Towards Systematic and Cost-Effective Monitoring of Large-Scale Software Systems
实现大规模软件系统的系统化且经济高效的监控
基本信息
- 批准号:RGPIN-2021-03900
- 负责人:
- 金额:$ 1.75万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2022
- 资助国家:加拿大
- 起止时间:2022-01-01 至 2023-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Failures of large-scale software systems can have significant impacts on our lives and safety, as well as on the economy and security of our society. For example, failures of the Canada Revenue Agency (CRA) website in March 2017 and March 2019 blocked many Canadians from filing their taxes for days. To detect and address such failures quickly and reduce their impact on users, it is crucial to gain an understanding of the internal state and runtime behavior of these systems. Software observability (i.e., the extent to which the internal state of a system can be inferred) allows software engineers to gain such an understanding, to verify if a system behaves as expected, and to detect and diagnose runtime failures. Software monitoring (e.g., logging and tracing) is the key to ensure the observability of large-scale software systems. Despite the importance of software monitoring, several challenges complicate the monitoring of large-scale software systems. First, large-scale software systems are usually composed of many components (e.g., microservices) that may be developed by different organizations using different programming languages. Furthermore, these components are often evolving. The heterogeneous and evolving nature makes it challenging to ensure consistent and up-to-date monitoring of these systems. Second, as the scale and complexity of software increases, so do the requirements on the computing resources needed to produce and manage the monitoring data, which could introduce significant overhead (e.g., performance and storage overhead) to the operations of large-scale software systems. Third, large-scale software systems usually generate a very large amount of monitoring data from multiple sources (e.g., from different web services and applications), which poses challenges for the analysis and utilization of these data. My long-term research goal is to discover, design, and develop a comprehensive and systematic solution to improve software monitoring and increase the observability of large-scale software systems. To achieve this long-term goal, the proposed research program will tackle the challenges of software monitoring from three mutually complementary perspectives: software development, software execution, and analysis of monitoring data. First, we aim to improve the quality of software monitoring by systematically considering the monitoring aspect across the entire software development lifecycle. Second, we aim to increase the cost-effectiveness of software monitoring by dynamically optimizing the monitoring intensity during software execution. Third, we will develop generic approaches to simplify common workflows of analyzing monitoring data. The proposed research program will be a pioneering attempt to systematically improve the monitoring and observability of large-scale software systems, which will benefit the research and practices in Canada in producing high-quality software.
大型软件系统的故障会对我们的生命和安全以及社会的经济和安全产生重大影响。例如,加拿大税务局(CRA)网站在2017年3月和2019年3月的故障使许多加拿大人无法提交税款。为了快速检测和解决此类故障并减少其对用户的影响,了解这些系统的内部状态和运行时行为至关重要。软件可观测性(即,系统的内部状态可以被推断的程度)允许软件工程师获得这样的理解,以验证系统是否如预期的那样运行,以及检测和诊断运行时故障。软件监控(例如,日志和跟踪)是保证大型软件系统可观测性的关键。尽管软件监控的重要性,一些挑战复杂的大型软件系统的监控。首先,大型软件系统通常由许多组件组成(例如,微服务),其可以由不同的组织使用不同的编程语言来开发。此外,这些组成部分往往是不断演变的。这些系统的异构性和不断变化的性质使得确保对这些系统进行一致和最新的监测变得具有挑战性。其次,随着软件的规模和复杂性的增加,对产生和管理监视数据所需的计算资源的要求也增加,这可能引入显著的开销(例如,性能和存储开销)到大规模软件系统的操作。第三,大规模软件系统通常从多个源(例如,来自不同的Web服务和应用程序),这给这些数据的分析和利用带来了挑战。 我的长期研究目标是发现,设计和开发一个全面和系统的解决方案,以改善软件监控和提高大型软件系统的可观测性。为了实现这一长期目标,拟议的研究计划将从三个相互补充的角度来解决软件监测的挑战:软件开发,软件执行和监测数据的分析。首先,我们的目标是通过系统地考虑整个软件开发生命周期中的监控方面来提高软件监控的质量。其次,我们的目标是通过动态优化软件执行过程中的监控强度来提高软件监控的成本效益。第三,我们将开发通用方法来简化分析监测数据的通用工作流程。拟议的研究计划将是一个开创性的尝试,系统地提高大型软件系统的监测和可观测性,这将有利于在加拿大生产高质量的软件的研究和实践。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Li, Heng其他文献
Amplifying fluorescence sensing based on inverse opal photonic crystal toward trace TNT detection
基于反蛋白石光子晶体的放大荧光传感,用于痕量 TNT 检测
- DOI:
10.1039/c0jm02554b - 发表时间:
2011-01-01 - 期刊:
- 影响因子:0
- 作者:
Li, Heng;Wang, Jingxia;Jiang, Lei - 通讯作者:
Jiang, Lei
Bacterial content and characterization of antibiotic resistant Staphylococcus aureus in Danish sushi products and association with food inspector rankings
- DOI:
10.1016/j.ijfoodmicro.2019.108244 - 发表时间:
2019-09-16 - 期刊:
- 影响因子:5.4
- 作者:
Li, Heng;Stegger, Marc;Leisner, Jorgen J. - 通讯作者:
Leisner, Jorgen J.
Multi-focus image fusion algorithm based on supervised learning for fully convolutional neural network
基于全卷积神经网络监督学习的多焦点图像融合算法
- DOI:
10.1016/j.patrec.2020.11.014 - 发表时间:
2021-01-01 - 期刊:
- 影响因子:5.1
- 作者:
Li, Heng;Zhang, Liming;Li, Yulong - 通讯作者:
Li, Yulong
Exploring the structure characteristics and major channels of cytochrome P450 2A6, 2A13, and 2E1 with pilocarpine
- DOI:
10.1002/bip.23108 - 发表时间:
2018-04-01 - 期刊:
- 影响因子:2.9
- 作者:
Fan, Jing-Rong;Li, Heng;Zheng, Qing-Chuan - 通讯作者:
Zheng, Qing-Chuan
An accurate 3D-FE based radius prediction model for in-plane roll-bending of strip considering spread effects
基于 3D-FE 的精确半径预测模型,用于考虑扩散效应的带材面内滚弯
- DOI:
10.1016/j.commatsci.2010.09.032 - 发表时间:
2010-12 - 期刊:
- 影响因子:3.3
- 作者:
Li, Zhijun;Yang, He;Li, Heng;Xu, Jie - 通讯作者:
Xu, Jie
Li, Heng的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Li, Heng', 18)}}的其他基金
Towards Systematic and Cost-Effective Monitoring of Large-Scale Software Systems
实现大规模软件系统的系统化且经济高效的监控
- 批准号:
DGECR-2021-00239 - 财政年份:2021
- 资助金额:
$ 1.75万 - 项目类别:
Discovery Launch Supplement
Towards Systematic and Cost-Effective Monitoring of Large-Scale Software Systems
实现大规模软件系统的系统化且经济高效的监控
- 批准号:
RGPIN-2021-03900 - 财政年份:2021
- 资助金额:
$ 1.75万 - 项目类别:
Discovery Grants Program - Individual
相似海外基金
Cost-effectiveness analysis of systematic screening for atrial fibrillation
房颤系统筛查的成本效益分析
- 批准号:
23K19868 - 财政年份:2023
- 资助金额:
$ 1.75万 - 项目类别:
Grant-in-Aid for Research Activity Start-up
Towards Systematic and Cost-Effective Monitoring of Large-Scale Software Systems
实现大规模软件系统的系统化且经济高效的监控
- 批准号:
DGECR-2021-00239 - 财政年份:2021
- 资助金额:
$ 1.75万 - 项目类别:
Discovery Launch Supplement
The effectiveness and cost-effectiveness of population-level policies to reduce alcohol use: a systematic review
人口层面减少饮酒政策的有效性和成本效益:系统评价
- 批准号:
452586 - 财政年份:2021
- 资助金额:
$ 1.75万 - 项目类别:
Operating Grants
Towards Systematic and Cost-Effective Monitoring of Large-Scale Software Systems
实现大规模软件系统的系统化且经济高效的监控
- 批准号:
RGPIN-2021-03900 - 财政年份:2021
- 资助金额:
$ 1.75万 - 项目类别:
Discovery Grants Program - Individual
Cost-optimized planning of changes in manufacturing of safety-critical components through systematic manufacturing change management
通过系统化的制造变更管理,对安全关键部件的制造变更进行成本优化规划
- 批准号:
437763513 - 财政年份:2020
- 资助金额:
$ 1.75万 - 项目类别:
Research Grants
Systematic Review and Cost-Effectiveness Analysis of Wandering Prevention for Dementia Elderly
预防痴呆老人走失的系统评价及成本效益分析
- 批准号:
20K18896 - 财政年份:2020
- 资助金额:
$ 1.75万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
Cost-effectiveness of palivizumab prophylaxis for respiratory syncytial virus (RSV): A systematic review
帕利珠单抗预防呼吸道合胞病毒 (RSV) 的成本效益:系统评价
- 批准号:
400020 - 财政年份:2019
- 资助金额:
$ 1.75万 - 项目类别:
The impact of supplementary prescription drug insurance and cost-sharing on drug use, health services utilization and health in Canada: a systematic review
补充处方药保险和费用分摊对加拿大药物使用、卫生服务利用和健康的影响:系统评价
- 批准号:
358603 - 财政年份:2016
- 资助金额:
$ 1.75万 - 项目类别:
Operating Grants
Effectiveness and cost-effectiveness of systematic screening for Lynch Syndrome (LS) in Australia
澳大利亚林奇综合症 (LS) 系统筛查的有效性和成本效益
- 批准号:
nhmrc : GNT1080246 - 财政年份:2015
- 资助金额:
$ 1.75万 - 项目类别:
Project Grants
Effectiveness and cost-effectiveness of systematic screening for Lynch Syndrome (LS) in Australia
澳大利亚林奇综合症 (LS) 系统筛查的有效性和成本效益
- 批准号:
nhmrc : 1080246 - 财政年份:2015
- 资助金额:
$ 1.75万 - 项目类别:
Project Grants