NeTS:Small:Understanding the Impact of Unreliable Hardware on the Resilience of Networked Systems

NeTS:小:了解不可靠的硬件对网络系统弹性的影响

基本信息

  • 批准号:
    1117049
  • 负责人:
  • 金额:
    $ 44万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2011
  • 资助国家:
    美国
  • 起止时间:
    2011-08-15 至 2015-07-31
  • 项目状态:
    已结题

项目摘要

Networked systems have always been designed to operate even in the presence of failures, especially in communication links and storage. Until recently other components of such systems had relatively low probabilities of failures and for most networked systems, desired levels of resilience could be achieved using minimal redundancy added in an ad hoc manner. Two opposing trends are likely to make the task of achieving resilience significantly more difficult in the coming years: (a) increasing hardware failure probabilities: with the move towards finer nano-scale fabrication, chips are increasingly vulnerable to soft errors caused by external noise and are increasingly likely to fail early due to fatigue; (b) higher resilience requirements: as critical services continue to migrate to clouds, service providers are compelled into more stringent service-level agreements (SLAs), including higher reliability, higher availability, and tighter guarantees on service times. The above combination can dramatically increase the overhead of existing approaches for achieving desired levels of resilience. Intellectual merit: The first outcome of this project will be a holistic roadmap for resilience of networked systems. This resilience roadmap will take the roadmaps from the nano-scale CMOS (trends in chip cost, functionality, performance, power, and resilience that can be attained at chip level) and attempt to realistically project the future cost of currently-used networking and systems techniques for achieving desired level of resilience. The second outcome of this project is to develop resilience methods that scale gracefully in the face of increasing hardware failures. Such techniques will use novel partitioned redundancy strategies that achieve reliability at different levels across hardware and software layers. Broader Impacts. The resilience roadmap will provide unprecedented understanding of the trends in resilience and a uniquely realistic assessment of challenges and opportunities. This will significantly influence the research in the hardware as well as networking communities. A systematic design of scalable resilience methods will lead to significantly higher levels of resilience, lower costs - capital (equipment) as well recurring (especially, energy), and/or higher levels of performance. The utilitarian gains to society by the proposed project are likely to be substantial, since networked systems now constitute one of our most critical infrastructures and consume an increasingly large proportion of our resources.This project will draw upon two different disciplines, hardware architecture and networked systems, and involve detailed case studies and development of completely new theory and techniques, and will therefore provide unique educational and training opportunities for students and working professionals in these fields.Budget Impact Statement: The item numbers in this paragraph refer to those in Figure 9 and Section 3.2 (entitled 'Proposed Research Tasks and Plan') of our original proposal. We will undertake all tasks and sub-tasks proposed in item-1 (and all its sub-items). In item-2, we will undertake the development of a general framework to consider all basic redundancy schemes and alternative ways of deploying them (sub-item-2.1). We will also characterize the associated tradeoffs (sub-item-2.2) and the consequences of realistic constraints (sub-item-2.3). However, we will pursue the development of prototype tools (as outlined in sub-item-2.4), to the extent necessary to demonstrate the benefits of our approach and to conduct case studies (described in item-3). Finally, we will undertake the case studies as originally proposed in item-3.
网络系统一直被设计为即使在存在故障的情况下也能运行,特别是在通信链路和存储方面。直到最近,这类系统的其他组成部分的故障概率相对较低,对于大多数联网系统,可以通过使用以特别方式添加的最小冗余来实现所需的复原力水平。两种相反的趋势可能会使未来几年实现弹性的任务变得更加困难:(a)硬件故障概率增加:随着向更精细的纳米级制造迈进,芯片越来越容易受到外部噪声引起的软错误的影响,并且越来越有可能因疲劳而过早失效;(B)更高的弹性要求:随着关键服务不断迁移到云,服务提供商被迫签订更严格的服务级别协议(SLA),包括更高的可靠性、更高的可用性和更严格的服务时间保证。上述组合可能会大大增加现有方法的开销,以实现所需的弹性水平。智力价值:该项目的第一个成果将是网络系统弹性的整体路线图。该弹性路线图将采用纳米级CMOS的路线图(芯片成本,功能,性能,功耗和可在芯片级实现的弹性的趋势),并试图实际预测当前使用的网络和系统技术的未来成本,以实现所需的弹性水平。该项目的第二个成果是开发弹性方法,在面对越来越多的硬件故障时优雅地扩展。这些技术将使用新的分区冗余策略,在硬件和软件层的不同级别上实现可靠性。更广泛的影响。复原力路线图将提供对复原力趋势的前所未有的理解,以及对挑战和机遇的独特现实评估。这将对硬件和网络社区的研究产生重大影响。可扩展的弹性方法的系统设计将导致显著更高水平的弹性,更低的成本-资本(设备)以及经常性(特别是能源),和/或更高水平的性能。 由于网络系统现在已成为我们最重要的基础设施之一,消耗的资源比例越来越大,因此拟议项目对社会的实用收益可能是巨大的。该项目将借鉴两个不同的学科,硬件架构和网络系统,并涉及详细的案例研究和全新理论和技术的发展,因此将为这些领域的学生和专业人士提供独特的教育和培训机会。预算影响声明:本段中的项目编号是指我们原始提案中的图9和第3.2节(题为“拟议的研究任务和计划”)中的编号。我们将承担第1项(及其所有分项)中提出的所有任务和分任务。在项目2中,我们将着手制定一个总体框架,以考虑所有基本的冗余计划和部署这些计划的替代方法(分项2.1)。我们还将描述相关的权衡(分项2.2)和现实约束的后果(分项2.3)。然而,我们将继续开发原型工具(如分项2.4所述),以证明我们的方法的好处并进行案例研究(如第3项所述)。最后,我们将进行项目3中最初提议的案例研究。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Sandeep Gupta其他文献

Collaborative circuit designs using the CRAFT repository
使用 CRAFT 存储库进行协作电路设计
  • DOI:
    10.1016/j.future.2018.01.018
  • 发表时间:
    2018
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Adam Brinckman;E. Deelman;Sandeep Gupta;J. Nabrzyski;Soowang Park;Rafael Ferreira da Silva;I. Taylor;K. Vahi
  • 通讯作者:
    K. Vahi
Seismotectonics and crustal stress field in the
地震构造与地应力场
  • DOI:
  • 发表时间:
    2015
  • 期刊:
  • 影响因子:
    0
  • 作者:
    P. Mahesh;Sandeep Gupta
  • 通讯作者:
    Sandeep Gupta
Chronic infection in the aetiology of atherosclerosis--focus on Chlamydia pneumoniae.
动脉粥样硬化病因中的慢性感染——聚焦肺炎衣原体。
  • DOI:
  • 发表时间:
    1999
  • 期刊:
  • 影响因子:
    5.3
  • 作者:
    Sandeep Gupta
  • 通讯作者:
    Sandeep Gupta
Case Studies on Biological Treatment of Tannery Effluents in India
印度制革废水生物处理案例研究
Locally recurrent renal cell carcinoma involving large gut presenting with life threatening Gastrointestinal bleed: A rare presentation and review of literature
局部复发性肾细胞癌累及大肠并伴有危及生命的胃肠道出血:罕见的文献介绍和回顾
  • DOI:
  • 发表时间:
    2020
  • 期刊:
  • 影响因子:
    0
  • 作者:
    M. Arya;A. Singhal;Yogendra Shyoran;Sandeep Gupta;A. Gandhi;M. Sonwal;Rakesh Maan
  • 通讯作者:
    Rakesh Maan

Sandeep Gupta的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Sandeep Gupta', 18)}}的其他基金

SHF:Small:New models, design, and test methods for long-term aging of nanometer VLSI
SHF:Small:纳米VLSI长期老化的新模型、设计和测试方法
  • 批准号:
    1719047
  • 财政年份:
    2017
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
Theory, methods, and tools for cross-layered design of uniquely efficient failure-resistant systems
独特高效的抗故障系统的跨层设计的理论、方法和工具
  • 批准号:
    1255951
  • 财政年份:
    2013
  • 资助金额:
    $ 44万
  • 项目类别:
    Continuing Grant
Verification of closed loop feedback/feed-forward control actions for safe medical devices
验证安全医疗设备的闭环反馈/前馈控制动作
  • 批准号:
    1231590
  • 财政年份:
    2012
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
CSR: Small: Understanding and Modeling the Trade-Offs in Data Centers for Next-Generation Sustainable Management
CSR:小:理解和建模数据中心的权衡以实现下一代可持续管理
  • 批准号:
    1218505
  • 财政年份:
    2012
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
SHB: Small: Toward Verifying Smart-Health Infrastructure Safety from their Impact on Human Physiology
SHB:小:验证智能健康基础设施安全性及其对人体生理学的影响
  • 批准号:
    1116385
  • 财政年份:
    2011
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
TC:Small:EDICT: Evaluation and Design of IC's for Trustworthiness
TC:Small:EDICT:IC 的可信度评估和设计
  • 批准号:
    1018937
  • 财政年份:
    2010
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
II-EN: BlueTool: Infrastructure for Innovative Cyberphysical Data Center Management Research
II-EN:BlueTool:创新网络物理数据中心管理研究的基础设施
  • 批准号:
    0855277
  • 财政年份:
    2009
  • 资助金额:
    $ 44万
  • 项目类别:
    Continuing Grant
CT-ISG: Physiological Value based Security for Body Area Networks
CT-ISG:基于生理价值的身体区域网络安全
  • 批准号:
    0831544
  • 财政年份:
    2008
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
EMT/MISC: Theory and methods for design and synthesis of approximate logic circuits and systems: a paradigm for emerging technologies
EMT/MISC:近似逻辑电路和系统的设计和综合的理论和方法:新兴技术的范例
  • 批准号:
    0829946
  • 财政年份:
    2008
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
CSR-DMSS, SM: Next-Generation Thermal-Aware, Energy-Efficient Resource Management for Data Centers
CSR-DMSS、SM:数据中心的下一代热感知、节能资源管理
  • 批准号:
    0834797
  • 财政年份:
    2008
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant

相似国自然基金

昼夜节律性small RNA在血斑形成时间推断中的法医学应用研究
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
tRNA-derived small RNA上调YBX1/CCL5通路参与硼替佐米诱导慢性疼痛的机制研究
  • 批准号:
    n/a
  • 批准年份:
    2022
  • 资助金额:
    10.0 万元
  • 项目类别:
    省市级项目
Small RNA调控I-F型CRISPR-Cas适应性免疫性的应答及分子机制
  • 批准号:
    32000033
  • 批准年份:
    2020
  • 资助金额:
    24.0 万元
  • 项目类别:
    青年科学基金项目
Small RNAs调控解淀粉芽胞杆菌FZB42生防功能的机制研究
  • 批准号:
    31972324
  • 批准年份:
    2019
  • 资助金额:
    58.0 万元
  • 项目类别:
    面上项目
变异链球菌small RNAs连接LuxS密度感应与生物膜形成的机制研究
  • 批准号:
    81900988
  • 批准年份:
    2019
  • 资助金额:
    21.0 万元
  • 项目类别:
    青年科学基金项目
肠道细菌关键small RNAs在克罗恩病发生发展中的功能和作用机制
  • 批准号:
    31870821
  • 批准年份:
    2018
  • 资助金额:
    56.0 万元
  • 项目类别:
    面上项目
基于small RNA 测序技术解析鸽分泌鸽乳的分子机制
  • 批准号:
    31802058
  • 批准年份:
    2018
  • 资助金额:
    26.0 万元
  • 项目类别:
    青年科学基金项目
Small RNA介导的DNA甲基化调控的水稻草矮病毒致病机制
  • 批准号:
    31772128
  • 批准年份:
    2017
  • 资助金额:
    60.0 万元
  • 项目类别:
    面上项目
基于small RNA-seq的针灸治疗桥本甲状腺炎的免疫调控机制研究
  • 批准号:
    81704176
  • 批准年份:
    2017
  • 资助金额:
    20.0 万元
  • 项目类别:
    青年科学基金项目
水稻OsSGS3与OsHEN1调控small RNAs合成及其对抗病性的调节
  • 批准号:
    91640114
  • 批准年份:
    2016
  • 资助金额:
    85.0 万元
  • 项目类别:
    重大研究计划

相似海外基金

SaTC: CORE: Small: NSF-DST: Understanding Network Structure and Communication for Supporting Information Authenticity
SaTC:核心:小型:NSF-DST:了解支持信息真实性的网络结构和通信
  • 批准号:
    2343387
  • 财政年份:
    2024
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
CAREER: Understanding the Dynamic Mechanical Adaptations of Bone Tissue at Small Length Scales
职业:了解小长度尺度下骨组织的动态机械适应
  • 批准号:
    2339836
  • 财政年份:
    2024
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
RI: Small: Understanding Hand Interaction In The Jumble of Internet Videos
RI:小:在混乱的互联网视频中理解手部交互
  • 批准号:
    2426592
  • 财政年份:
    2024
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
Understanding prokaryotic small proteins from context
从背景理解原核小蛋白
  • 批准号:
    FT230100724
  • 财政年份:
    2023
  • 资助金额:
    $ 44万
  • 项目类别:
    ARC Future Fellowships
Collaborative Research: SaTC: CORE: Small: Understanding the Limitations of Wireless Network Security Designs Leveraging Wireless Properties: New Threats and Defenses in Practice
协作研究:SaTC:核心:小型:了解利用无线特性的无线网络安全设计的局限性:实践中的新威胁和防御
  • 批准号:
    2316720
  • 财政年份:
    2023
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
Collaborative Research: RI: Small: Motion Fields Understanding for Enhanced Long-Range Imaging
合作研究:RI:小型:增强远程成像的运动场理解
  • 批准号:
    2232298
  • 财政年份:
    2023
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
Collaborative Research: NSF-CSIRO: HCC: Small: Understanding Bias in AI Models for the Prediction of Infectious Disease Spread
合作研究:NSF-CSIRO:HCC:小型:了解预测传染病传播的 AI 模型中的偏差
  • 批准号:
    2302969
  • 财政年份:
    2023
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
Collaborative Research: HCC: Small: Understanding Online-to-Offline Sexual Violence through Data Donation from Users
合作研究:HCC:小型:通过用户捐赠的数据了解线上线下性暴力
  • 批准号:
    2401775
  • 财政年份:
    2023
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
Collaborative Research: SaTC: CORE: Small: Understanding and Taming Deterministic Model Bit Flip attacks in Deep Neural Networks
协作研究:SaTC:核心:小型:理解和驯服深度神经网络中的确定性模型位翻转攻击
  • 批准号:
    2342618
  • 财政年份:
    2023
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
AF: Small: Understanding Expansion Phenomena: Graphical, Hypergraphical, Geometric, and Quantum
AF:小:理解膨胀现象:图形、超图形、几何和量子
  • 批准号:
    2326685
  • 财政年份:
    2023
  • 资助金额:
    $ 44万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了