Collaborative Research: SHF: Core: Medium: Causal Performance Debugging for Highly-Configurable Systems

协作研究:SHF:核心:中:高度可配置系统的因果性能调试

基本信息

  • 批准号:
    2106853
  • 负责人:
  • 金额:
    $ 40.86万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2021
  • 资助国家:
    美国
  • 起止时间:
    2021-10-01 至 2024-09-30
  • 项目状态:
    已结题

项目摘要

Software performance is critical for most software systems to achieve scale and limit operating costs and energy consumption. As modern software systems, such as big data and machine-learning systems, are increasingly built by composing many reusable infrastructure components and deployed on distributed and heterogeneous hardware, developers have powerful tools and abstractions at their fingertips, and as a result face immense configuration complexity. Software and hardware need to be selected and configured carefully to achieve high performance for a given system and task. Unfortunately, in practice, performance faults and misconfigurations are common, where a system performs much worse than expected, not achieving its mission or simply wasting cost and energy. In large configuration spaces, end-users and developers face severe challenges in understanding and fixing performance faults by changing software configuration, changing hardware deployment, or modifying the software's code itself. Current approaches that model system performance by analyzing correlations among performance measurements and options are slow and may produce misleading results, obfuscating the actual causes of performance faults. Even if they can fix the problem, most of them cannot explain why (1) the obtained configurations are the real cause of the problem, and (2) a user/developer should consider the proposed recommendations. In both cases, the lack of explainability is a big issue. The project is intended to initiate a paradigm shift in today's testing and debugging methodology for complex, highly configurable systems, thereby positively impacting a broad range of industrial sectors relying on complex, highly configurable systems. Specifically, the project contributes to substantial energy savings and reduced carbon emissions, especially for the many big-data and machine-learning systems that operate at a massive scale. Finally, the research is providing valuable training for involved students from diverse backgrounds in research and generating high-quality researchers and practitioners for society. This project develops and evaluates foundations and tools for a causal approach to performance modeling and performance debugging. This project introduces the new concept of causal performance models that are learned using causal structure learning by intervening over configuration options and observing system performance regarding (multiple) performance objectives, rather than just analyzing correlations. Causal models enable causal inference and counterfactual reasoning for numerous tasks, including debugging performance faults and misconfigurations. Based on a solid technical foundation of causal modeling and extensive experience with performance modeling for configurable systems, this project develops innovations in three thrusts: (1) It designs and refines a causal modeling approach for software performance of systems composed of multiple configurable components, using innovations in sampling strategies, code analysis, compositional reasoning, and transfer learning to build accurate causal models efficiently. (2) It develops and evaluates user-facing tool support, based on causal models, to help users select well-performing configurations for their specific tasks and hardware and resolve misconfiguration faults with configuration changes, highlighting the (causal) performance impact of configuration decisions and providing a Pareto analysis of involved tradeoffs. (3) It develops and evaluates developer-facing tool support to foster code-level debugging and documentation. Finally, all contributions are being evaluated end-to-end with developers on real performance faults, showing how both users and developers benefit from causal models and related tools.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
软件性能对于大多数软件系统实现规模和限制运营成本和能耗至关重要。随着现代软件系统(如大数据和机器学习系统)越来越多地通过组成许多可重用的基础设施组件并部署在分布式和异构硬件上来构建,开发人员拥有强大的工具和抽象,因此面临巨大的配置复杂性。需要仔细选择和配置软件和硬件,以实现给定系统和任务的高性能。不幸的是,在实践中,性能故障和错误配置是常见的,其中系统的性能比预期差得多,没有实现其使命或只是浪费成本和能量。在大型配置空间中,最终用户和开发人员在通过更改软件配置、更改硬件部署或修改软件代码本身来理解和修复性能故障方面面临严峻的挑战。目前的方法,通过分析性能测量和选项之间的相关性来建模系统性能是缓慢的,可能会产生误导性的结果,混淆性能故障的实际原因。即使他们可以解决问题,他们中的大多数人也无法解释为什么(1)获得的配置是问题的真实的原因,以及(2)用户/开发人员应该考虑提出的建议。在这两种情况下,缺乏可解释性是一个大问题。该项目旨在为当今复杂、高度可配置的系统的测试和调试方法带来范式转变,从而对依赖复杂、高度可配置系统的广泛工业部门产生积极影响。具体而言,该项目有助于节省大量能源和减少碳排放,特别是对于许多大规模运行的大数据和机器学习系统。最后,这项研究为来自不同背景的学生提供了宝贵的培训,并为社会培养了高素质的研究人员和从业人员。这个项目开发和评估的基础和工具的因果关系的方法,性能建模和性能调试。该项目引入了因果性能模型的新概念,通过干预配置选项和观察系统性能(多个)性能目标,而不仅仅是分析相关性,使用因果结构学习来学习。因果模型使因果推理和反事实推理的许多任务,包括调试性能故障和错误配置。基于坚实的因果建模技术基础和丰富的可配置系统性能建模经验,本项目在三个方面进行了创新:(1)设计和改进了一种用于多个可配置组件组成的系统的软件性能的因果建模方法,使用采样策略,代码分析,组合推理和迁移学习的创新来有效地建立准确的因果模型。(2)它基于因果模型开发和评估面向用户的工具支持,以帮助用户为其特定任务和硬件选择性能良好的配置,并通过配置更改解决错误配置故障,突出配置决策的(因果)性能影响,并提供对所涉及权衡的帕累托分析。(3)它开发和评估面向开发人员的工具支持,以促进代码级调试和文档编制。最后,所有的贡献都将在真实的性能故障上与开发人员进行端到端的评估,展示用户和开发人员如何从因果模型和相关工具中受益。该奖项反映了NSF的法定使命,并被认为值得通过使用基金会的智力价值和更广泛的影响审查标准进行评估来支持。

项目成果

期刊论文数量(1)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
On Debugging the Performance of Configurable Software Systems: Developer Needs and Tailored Tool Support
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Christian Kastner其他文献

The Leadership factor: A study of leadership-styles in transformation
领导因素:转型中的领导风格研究
  • DOI:
  • 发表时间:
    2019
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Christian Kastner
  • 通讯作者:
    Christian Kastner
Multi-laboratory evaluation of the reproducibility of polymer biodegradation assessments applying standardized and modified respirometry methods
应用标准化和改良呼吸测定法对聚合物生物降解评估的再现性的多实验室评估
  • DOI:
    10.1016/j.scitotenv.2023.166339
  • 发表时间:
    2023-11-25
  • 期刊:
  • 影响因子:
    8.000
  • 作者:
    Kathleen McDonough;Glauco Battagliarin;Jennifer Menzies;Jared Bozich;Marlies Bergheim;Bjorn Hidding;Christian Kastner;Bahar Koyuncu;Georg Kreutzer;Hans Leijs;Yash Parulekar;Meera Raghuram;Nathalie Vallotton
  • 通讯作者:
    Nathalie Vallotton
The Role of a Leader: Transformational Efforts in Innovation and Change
领导者的角色:创新和变革中的转型努力
  • DOI:
    10.1007/978-3-030-57642-4_6
  • 发表时间:
    2021
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Christian Kastner
  • 通讯作者:
    Christian Kastner
MAREG and WinMAREG A tool for marginal regression models
MAREG 和 WinMAREG 边际回归模型工具
  • DOI:
  • 发表时间:
    1997
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Christian Kastner;Andreas Fieger;C. Heumann
  • 通讯作者:
    C. Heumann

Christian Kastner的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Christian Kastner', 18)}}的其他基金

Collaborative Proposal: SaTC: Frontiers: Enabling a Secure and Trustworthy Software Supply Chain
协作提案:SaTC:前沿:实现安全可信的软件供应链
  • 批准号:
    2206859
  • 财政年份:
    2022
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Continuing Grant
Collaborative Research: DASS: Policy Design for Holding AI-Supported Systems Accountable
合作研究:DASS:让人工智能支持的系统承担责任的政策设计
  • 批准号:
    2131477
  • 财政年份:
    2021
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
NSF Student and Early-Career Faculty Travel Grant for IEEE International Conference on Software Engineering 2020 (ICSE)
NSF 学生和早期职业教师 2020 年 IEEE 国际软件工程会议 (ICSE) 旅费补助
  • 批准号:
    2002420
  • 财政年份:
    2020
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
NSF Student and Early-Career Faculty Travel Grant for IEEE International Conference on Software Engineering 2019 (ICSE)
NSF 学生和早期职业教师 2019 年 IEEE 国际软件工程会议 (ICSE) 旅费补助
  • 批准号:
    1922878
  • 财政年份:
    2019
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
SHF: SMALL: Streamlining Fork-Based Software Development
SHF:小型:简化基于分叉的软件开发
  • 批准号:
    1813598
  • 财政年份:
    2018
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
SaTC: CORE: Small: Trustworthy Dependency Management
SaTC:核心:小型:值得信赖的依赖管理
  • 批准号:
    1717022
  • 财政年份:
    2017
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
CAREER:VARIATIONAL EXECUTION
职业:变量执行
  • 批准号:
    1552944
  • 财政年份:
    2016
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Continuing Grant
SHF: Small: Reverse Engineering Variability Implementations
SHF:小型:逆向工程可变性实施
  • 批准号:
    1318808
  • 财政年份:
    2013
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant

相似国自然基金

Research on Quantum Field Theory without a Lagrangian Description
  • 批准号:
    24ZR1403900
  • 批准年份:
    2024
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
Cell Research
  • 批准号:
    31224802
  • 批准年份:
    2012
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research
  • 批准号:
    31024804
  • 批准年份:
    2010
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research (细胞研究)
  • 批准号:
    30824808
  • 批准年份:
    2008
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
  • 批准号:
    10774081
  • 批准年份:
    2007
  • 资助金额:
    45.0 万元
  • 项目类别:
    面上项目

相似海外基金

Collaborative Research: SHF: Medium: Differentiable Hardware Synthesis
合作研究:SHF:媒介:可微分硬件合成
  • 批准号:
    2403134
  • 财政年份:
    2024
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: LEGAS: Learning Evolving Graphs At Scale
协作研究:SHF:小型:LEGAS:大规模学习演化图
  • 批准号:
    2331302
  • 财政年份:
    2024
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: LEGAS: Learning Evolving Graphs At Scale
协作研究:SHF:小型:LEGAS:大规模学习演化图
  • 批准号:
    2331301
  • 财政年份:
    2024
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: Efficient and Scalable Privacy-Preserving Neural Network Inference based on Ciphertext-Ciphertext Fully Homomorphic Encryption
合作研究:SHF:小型:基于密文-密文全同态加密的高效、可扩展的隐私保护神经网络推理
  • 批准号:
    2412357
  • 财政年份:
    2024
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Enabling Graphics Processing Unit Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的图形处理单元性能仿真
  • 批准号:
    2402804
  • 财政年份:
    2024
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Tiny Chiplets for Big AI: A Reconfigurable-On-Package System
合作研究:SHF:中:用于大人工智能的微型芯片:可重新配置的封装系统
  • 批准号:
    2403408
  • 财政年份:
    2024
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Toward Understandability and Interpretability for Neural Language Models of Source Code
合作研究:SHF:媒介:实现源代码神经语言模型的可理解性和可解释性
  • 批准号:
    2423813
  • 财政年份:
    2024
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Enabling GPU Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的 GPU 性能仿真
  • 批准号:
    2402806
  • 财政年份:
    2024
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Differentiable Hardware Synthesis
合作研究:SHF:媒介:可微分硬件合成
  • 批准号:
    2403135
  • 财政年份:
    2024
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Tiny Chiplets for Big AI: A Reconfigurable-On-Package System
合作研究:SHF:中:用于大人工智能的微型芯片:可重新配置的封装系统
  • 批准号:
    2403409
  • 财政年份:
    2024
  • 资助金额:
    $ 40.86万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了