CAREER: From Nonstop-Monitoring to Nano-ISA: An Adaptive Multi-Dimensional Framework for Processor Reliability

职业生涯:从不间断监控到 Nano-ISA:处理器可靠性的自适应多维框架

基本信息

  • 批准号:
    0954211
  • 负责人:
  • 金额:
    $ 42.77万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2010
  • 资助国家:
    美国
  • 起止时间:
    2010-08-01 至 2017-07-31
  • 项目状态:
    已结题

项目摘要

Successful use of computing technologies has enhanced quality of life. In fact, it is hard to point to even one aspect of societal functioning that is not impacted positively by computer systems. The tremendous advances in computer system performance while simultaneously lowering the computing cost have been the primary reason for the success of computing. However, shrinking device sizes have recently lead to new challenges in system reliability. System reliability is an uncompromising concern and is the primary focus of this proposed research. Rather than addressing one single reliability concern, such as soft errors or process variations, this proposal takes a multi-dimensional approach to improve Mean-Time-To-Failure (MTTF). Reliability degrades extremely slowly over time and hence the solutions proposed in this research are also low cost solutions. In order to develop low cost solutions, the first step is to non-intrusively monitor the health of a processor to understand its aging process before taking proactive measures for detecting errors. When the error is detected, instead of employing expensive hardware solutions, this proposal uses low cost and flexible software mechanisms to correct the errors. While any one approach will certainly extend MTTF, the true benefits of the proposed research will bear fruition when error monitoring, detection and correction are employed in a hierarchical framework based on reliability needs. The broader impacts of this proposal are on two fronts. The proposed research is motivated by industrial concerns regarding system reliability. On the technology front, the low cost solutions developed will be transferred for industry adoption through close industry-academia interactions. Most of the proposed research ideas will be designed and implemented by the research team as research prototypes. These prototypes will be shared with industrial partners for further evaluations in an industrial setting. Woman and minority student recruitment will be one of the key driving force to encourage broader participation in the proposed research. This objective will be achieved through active participation and involvement of USC's Family of Schools in Los Angeles.
计算机技术的成功使用提高了生活质量。事实上,很难指出社会功能的任何一个方面没有受到计算机系统的积极影响。计算机系统性能的巨大进步,同时降低计算成本已经成为计算成功的主要原因。然而,不断缩小的设备尺寸最近导致了系统可靠性方面的新挑战。系统可靠性是一个不容妥协的问题,是本研究的主要重点。而不是解决一个单一的可靠性问题,如软错误或工艺变化,该建议采取了多维的方法来提高平均故障时间(MTTF)。随着时间的推移,可靠性下降非常缓慢,因此在这项研究中提出的解决方案也是低成本的解决方案。为了开发低成本的解决方案,第一步是在采取主动措施检测错误之前,非侵入式地监控处理器的健康状况,以了解其老化过程。当检测到错误时,该建议使用低成本和灵活的软件机制来纠正错误,而不是采用昂贵的硬件解决方案。 虽然任何一种方法肯定会延长MTTF,真正的好处,所提出的研究将结出硕果时,错误的监控,检测和纠正采用的层次框架的基础上的可靠性需求。这一提议的更广泛影响体现在两个方面。所提出的研究是出于对系统可靠性的工业关注。在技术方面,开发的低成本解决方案将通过密切的产学互动转移给行业采用。大部分提出的研究思路将由研究团队作为研究原型进行设计和实施。这些原型将与工业合作伙伴共享,以便在工业环境中进行进一步评估。招收妇女和少数民族学生将是鼓励更广泛地参与拟议研究的主要推动力之一。这一目标将通过南加州大学在洛杉矶的学校家庭的积极参与和参与来实现。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Murali Annavaram其他文献

A privacy mechanism for mobile-based urban traffic monitoring
  • DOI:
    10.1016/j.pmcj.2014.12.007
  • 发表时间:
    2015-07-01
  • 期刊:
  • 影响因子:
  • 作者:
    Chi Wang;Hua Liu;Kwame-Lante Wright;Bhaskar Krishnamachari;Murali Annavaram
  • 通讯作者:
    Murali Annavaram
Differentially Private Next-Token Prediction of Large Language Models
大型语言模型的差分隐私下一个标记预测

Murali Annavaram的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Murali Annavaram', 18)}}的其他基金

SHF: Small: ML Accelerator Cohort Architecture
SHF:小型:ML 加速器群组架构
  • 批准号:
    2224319
  • 财政年份:
    2022
  • 资助金额:
    $ 42.77万
  • 项目类别:
    Standard Grant
Student Travel Support for the 2018 International Symposium on Computer Architecture (ISCA)
2018 年计算机体系结构国际研讨会 (ISCA) 学生旅行支持
  • 批准号:
    1812942
  • 财政年份:
    2018
  • 资助金额:
    $ 42.77万
  • 项目类别:
    Standard Grant
SHF:Small: Accelerating Graph Analytics Through Coordinated Storage, Memory and Computing Advances
SHF:Small:通过协调存储、内存和计算进步加速图形分析
  • 批准号:
    1719074
  • 财政年份:
    2017
  • 资助金额:
    $ 42.77万
  • 项目类别:
    Standard Grant
SHF:Small: Benchmarking of Transient and Intermittent Errors and Their Application to Microarchitecture
SHF:Small:瞬态和间歇性错误的基准测试及其在微架构中的应用
  • 批准号:
    1219186
  • 财政年份:
    2012
  • 资助金额:
    $ 42.77万
  • 项目类别:
    Standard Grant
IEEE International Symposium on Workload Characterization (IISWC) Student Subsidy Proposal
IEEE 国际工作负载表征研讨会 (IISWC) 学生资助提案
  • 批准号:
    1104542
  • 财政年份:
    2011
  • 资助金额:
    $ 42.77万
  • 项目类别:
    Standard Grant
CSR-PSCE,SM: Trade-offs Between Static Power, Performance and Reliability in Future Chip Multiprocessors
CSR-PSCE,SM:未来芯片多处理器静态功耗、性能和可靠性之间的权衡
  • 批准号:
    0834799
  • 财政年份:
    2008
  • 资助金额:
    $ 42.77万
  • 项目类别:
    Standard Grant
CSR-PSCE,SM: A Holistic Design Approach to Reliability Using 3D Stacked
CSR-PSCE,SM:使用 3D 堆叠的可靠性整体设计方法
  • 批准号:
    0834798
  • 财政年份:
    2008
  • 资助金额:
    $ 42.77万
  • 项目类别:
    Standard Grant
CT-ISG: A Game Theoretic Framework for Privacy Preservation in Community-Based Mobile Applications
CT-ISG:基于社区的移动应用程序中隐私保护的博弈论框架
  • 批准号:
    0831545
  • 财政年份:
    2008
  • 资助金额:
    $ 42.77万
  • 项目类别:
    Standard Grant

相似海外基金

Mechanistic investigations on the role of the ribosome-bound chaperones RAC and Ssb during nonstop- and polylysine protein expression
核糖体结合伴侣 RAC 和 Ssb 在不间断和多聚赖氨酸蛋白表达过程中作用的机制研究
  • 批准号:
    244586127
  • 财政年份:
    2013
  • 资助金额:
    $ 42.77万
  • 项目类别:
    Research Grants
Mechanisms of Nonstop Extension Mutations in Tumor Suppressor Genes
抑癌基因不间断延伸突变的机制
  • 批准号:
    397982491
  • 财政年份:
  • 资助金额:
    $ 42.77万
  • 项目类别:
    Research Grants
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了