HECURA: Toward Automated Problem Analysis of Large Scale Storage Systems

HECURA:迈向大规模存储系统的自动化问题分析

基本信息

  • 批准号:
    0621508
  • 负责人:
  • 金额:
    $ 99万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2006
  • 资助国家:
    美国
  • 起止时间:
    2006-07-15 至 2009-06-30
  • 项目状态:
    已结题

项目摘要

CMU proposes to explore methodologies and algorithms for automating analysis of failures and performance degradations in large-scale storage systems. Problem analysis includes such crucial tasks as identifying which component(s) misbehaved, likely root causes, and supporting evidence for any conclusions. Combining statistical tools with appropriate instrumentation, we hope to dramatically reduce the difficulty of analyzing performance and reliability problems in deployed storage systems. Such tools, integrated with automated reaction logic, also provide an essential building block for the longer-term goal of self-healing.Automating problem analysis is crucial to achieving cost-effective storage at the scales needed for tomorrow's high-end computing systems. The number of hardware and software components will make problems common rather than anomalous, so it must be possible to quickly move from problem to fix with little-to-no system downtime for analysis. Further, the distributed software complexity of such systems make by-hand analysis increasingly untenable. More nuanced, but perhaps of most concern, implementors of these storage systems are increasingly unable to test in representative high-end computing environments because they simply cannot afford to recreate the necessary system scale. As a result, scale-related problems must be analyzed in the field to allow improvements to be made, which introduces delays and productivity reductions for customers/users plus issues of clearance for systems deployed to support highly sensitive activities. Current designs and tools fall far short of what is needed.
CMU提出探索在大规模存储系统中自动分析故障和性能下降的方法和算法。问题分析包括这样一些关键的任务,如确定哪些组件行为不当,可能的根本原因,以及支持任何结论的证据。将统计工具与适当的仪器相结合,我们希望大大降低分析已部署存储系统的性能和可靠性问题的难度。这些工具与自动反应逻辑集成在一起,还为实现自我修复的长期目标提供了必要的构建块。自动化问题分析对于实现未来高端计算系统所需规模的经济高效存储至关重要。硬件和软件组件的数量将使问题变得常见,而不是异常,因此必须能够快速地从问题转移到修复,几乎不需要系统停机时间进行分析。此外,这种系统的分布式软件复杂性使得手工分析越来越站不住脚。更微妙的是,这些存储系统的实现者越来越无法在具有代表性的高端计算环境中进行测试,因为他们根本负担不起重新创建必要的系统规模。因此,必须在现场分析与规模有关的问题,以便进行改进,这给客户/用户带来了延迟和生产力降低,以及为支持高度敏感活动而部署的系统的许可问题。目前的设计和工具远远不能满足需要。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Priya Narasimhan其他文献

Priya Narasimhan的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Priya Narasimhan', 18)}}的其他基金

CAREER: Integrated Fault Tolerance and Real-Time Support for Middleware Applications
职业:中间件应用程序的集成容错和实时支持
  • 批准号:
    0238381
  • 财政年份:
    2003
  • 资助金额:
    $ 99万
  • 项目类别:
    Continuing Grant

相似国自然基金

Toward a general theory of intermittent aeolian and fluvial nonsuspended sediment transport
  • 批准号:
  • 批准年份:
    2022
  • 资助金额:
    55 万元
  • 项目类别:

相似海外基金

Toward an automated analysis of bifurcations of dynamical systems
动力系统分岔的自动分析
  • 批准号:
    23K17657
  • 财政年份:
    2023
  • 资助金额:
    $ 99万
  • 项目类别:
    Grant-in-Aid for Challenging Research (Exploratory)
Collaborative Research: SaTC: CORE: Medium: Audacity of Exploration: Toward Automated Discovery of Security Flaws in Networked Systems through Intelligent Documentation Analysis
协作研究:SaTC:核心:中:大胆探索:通过智能文档分析自动发现网络系统中的安全缺陷
  • 批准号:
    2409269
  • 财政年份:
    2023
  • 资助金额:
    $ 99万
  • 项目类别:
    Standard Grant
Toward Automated Uncertainty Quantification in Causal Inference
因果推理中的自动化不确定性量化
  • 批准号:
    2310831
  • 财政年份:
    2023
  • 资助金额:
    $ 99万
  • 项目类别:
    Continuing Grant
SHF: Small: Toward Fully Automated Formal Software Verification
SHF:小型:迈向全自动形式软件验证
  • 批准号:
    2210243
  • 财政年份:
    2022
  • 资助金额:
    $ 99万
  • 项目类别:
    Standard Grant
Collaborative Research: SaTC: CORE: Medium: Audacity of Exploration: Toward Automated Discovery of Security Flaws in Networked Systems through Intelligent Documentation Analysis
协作研究:SaTC:核心:中:大胆探索:通过智能文档分析自动发现网络系统中的安全缺陷
  • 批准号:
    2154138
  • 财政年份:
    2022
  • 资助金额:
    $ 99万
  • 项目类别:
    Standard Grant
Collaborative Research: SaTC: CORE: Medium: Audacity of Exploration: Toward Automated Discovery of Security Flaws in Networked Systems through Intelligent Documentation Analysis
协作研究:SaTC:核心:中:大胆探索:通过智能文档分析自动发现网络系统中的安全缺陷
  • 批准号:
    2154199
  • 财政年份:
    2022
  • 资助金额:
    $ 99万
  • 项目类别:
    Standard Grant
Collaborative Research: SaTC: CORE: Medium: Audacity of Exploration: Toward Automated Discovery of Security Flaws in Networked Systems through Intelligent Documentation Analysis
协作研究:SaTC:核心:中:大胆探索:通过智能文档分析自动发现网络系统中的安全缺陷
  • 批准号:
    2154078
  • 财政年份:
    2022
  • 资助金额:
    $ 99万
  • 项目类别:
    Standard Grant
Toward Self-Driving Safety: Chassis Dynamics Domain Control for Automated Vehicles
迈向自动驾驶安全:自动驾驶车辆的底盘动力学域控制
  • 批准号:
    21F21362
  • 财政年份:
    2021
  • 资助金额:
    $ 99万
  • 项目类别:
    Grant-in-Aid for JSPS Fellows
Toward Automated Video Quality Assessment of Ultrasound
超声自动化视频质量评估
  • 批准号:
    2431522
  • 财政年份:
    2020
  • 资助金额:
    $ 99万
  • 项目类别:
    Studentship
SaTC: CORE: Small: Toward Fully Automated Data-Driven Analysis of Web Censorship
SaTC:核心:小型:迈向网络审查的全自动数据驱动分析
  • 批准号:
    1814817
  • 财政年份:
    2018
  • 资助金额:
    $ 99万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了