CAREER: Hardware Error Resilient Virtualization Infrastructure

职业:硬件错误弹性虚拟化基础设施

基本信息

  • 批准号:
    1350766
  • 负责人:
  • 金额:
    $ 31.87万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2014
  • 资助国家:
    美国
  • 起止时间:
    2014-01-15 至 2019-12-31
  • 项目状态:
    已结题

项目摘要

Cloud data centers have become an important cyber-infrastructure vital to our society and economy, yet the virtualization infrastructure is prone to many reliability challenges. In a cloud, the virtualization infrastructure presents an abstraction layer on top of commodity hardware components and manages the execution of guest virtual machines (VMs). However, commodity computer systems are susceptible to hardware errors and expected to experience high error rates in the near future. Notably, a hardware error in a virtualized system can potentially result in a system crash, breaking the assumption of VM fault isolation. This reliability risk is further compounded by the massive scale of data centers hosting hundreds of thousands of servers as well as the pursuit of aggressive server consolidation that aims to host hundreds of VMs on each single server for high resource utilization. Traditional VM fault tolerant solutions are inadequate to address this challenge.This research proposes a hardware error resilient virtualization infrastructure that provides high-performance virtualization during error free execution while offering strong protection against hardware errors. This research will advance our understanding of hardware error behaviors in virtualized environments and develop virtualization-aware techniques that provide robust error detection, recovery, and protection. The proposed hardware error resilient virtualization infrastructure will offer a low-cost full system solution by taking advantage of the characteristics of virtualized systems and providing resource management mechanisms for balanced performance and reliability.The proposed virtualization infrastructure, if successful, will bring substantial benefits to cloud providers in delivering reliable services to millions of users. Education will also be an integral part of the project. This project will develop educational materials for the undergraduate and graduate curriculum, recruit and mentor under-represented students, and carry out a number of unique outreach activities that benefit K-12, undergraduate, and graduate students.
云数据中心已成为对我们的社会和经济至关重要的重要网络基础设施,但虚拟化基础设施容易面临许多可靠性挑战。在云中,虚拟化基础设施在商用硬件组件之上提供抽象层,并管理客户虚拟机(VM)的执行。然而,商品计算机系统容易受到硬件错误的影响,并且预计在不久的将来会经历高错误率。值得注意的是,虚拟化系统中的硬件错误可能会导致系统崩溃,从而打破VM故障隔离的假设。托管数十万台服务器的大规模数据中心以及追求积极的服务器整合(旨在在每台服务器上托管数百个虚拟机以提高资源利用率)进一步加剧了这种可靠性风险。传统的虚拟机容错解决方案不足以应对这一挑战。本研究提出了一种硬件容错虚拟化基础架构,该架构在无错误执行期间提供高性能虚拟化,同时提供强大的硬件错误保护。这项研究将促进我们对虚拟化环境中硬件错误行为的理解,并开发虚拟化感知技术,提供强大的错误检测,恢复和保护。拟议的硬件容错虚拟化基础设施将提供一个低成本的完整系统解决方案,利用虚拟化系统的特点,并提供资源管理机制,以平衡性能和可靠性。拟议的虚拟化基础设施如果成功,将为云提供商带来巨大的好处,为数百万用户提供可靠的服务。教育也将是该项目的一个组成部分。该项目将为本科生和研究生课程开发教育材料,招募和指导代表性不足的学生,并开展一些独特的推广活动,使K-12,本科生和研究生受益。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

H. Howie Huang其他文献

SynCSE: syntax graph-based contrastive learning of sentence embeddings
同步编码器:基于句法图的句子嵌入对比学习
  • DOI:
    10.1016/j.eswa.2025.128047
  • 发表时间:
    2025-08-25
  • 期刊:
  • 影响因子:
    7.500
  • 作者:
    Yejin Kim;Dongsuk Oh;H. Howie Huang
  • 通讯作者:
    H. Howie Huang
Coordinated link sharing on Facebook
  • DOI:
    10.1038/s41598-025-00233-w
  • 发表时间:
    2025-05-05
  • 期刊:
  • 影响因子:
    3.900
  • 作者:
    Yunkang Yang;Ramesh Paudel;Jordan McShan;Matthew Hindman;H. Howie Huang;David Broniatowski
  • 通讯作者:
    David Broniatowski
A control-theoretic approach to automated local policy enforcement in computational grids
  • DOI:
    10.1016/j.future.2010.02.012
  • 发表时间:
    2010-06-01
  • 期刊:
  • 影响因子:
  • 作者:
    H. Howie Huang
  • 通讯作者:
    H. Howie Huang
Achieving high job execution reliability using underutilized resources in a computational economy
在计算经济中利用未充分利用的资源实现高作业执行可靠性
Maui: Black-Box Edge Privacy Attack on Graph Neural Networks
毛伊岛:图神经网络的黑盒边缘隐私攻击

H. Howie Huang的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('H. Howie Huang', 18)}}的其他基金

SHF: Small: Towards High-Performance Machine Learning on Graphs
SHF:小型:迈向图上的高性能机器学习
  • 批准号:
    2127207
  • 财政年份:
    2021
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant
Aspiring Computer Systems Research (CSR) PIs Workshop
有抱负的计算机系统研究 (CSR) PI 研讨会
  • 批准号:
    1828838
  • 财政年份:
    2018
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant
CSR: Small: IO-Efficient Computer System for Graph Analytics
CSR:小型:用于图形分析的 IO 高效计算机系统
  • 批准号:
    1717774
  • 财政年份:
    2017
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant
SHF: Small: Accelerating Graph Traversal on GPUs
SHF:小:加速 GPU 上的图遍历
  • 批准号:
    1618706
  • 财政年份:
    2016
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant
NSF CISE CAREER Proposal Writing Workshop 2015
2015 年 NSF CISE 职业提案写作研讨会
  • 批准号:
    1520809
  • 财政年份:
    2015
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant
CDI Type-II: Collaborative Research: From Ion Channels to Blood Flow and Heart Sounds: A New Paradigm in Cyber-Enabled Multiphysical Analysis of Heart Function
CDI II 型:协作研究:从离子通道到血流和心音:网络支持的心脏功能多物理分析的新范式
  • 批准号:
    1124813
  • 财政年份:
    2011
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant
Collaborative:Balanced Scalable Architectures for Data-Intensive Supercomputing
协作:数据密集型超级计算的平衡可扩展架构
  • 批准号:
    0937875
  • 财政年份:
    2009
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant

相似海外基金

SWIFT-SAT: Unlimited Radio Interferometry: A Hardware-Algorithm Co-Design Approach to RAS-Satellite Coexistence
SWIFT-SAT:无限无线电干涉测量:RAS 卫星共存的硬件算法协同设计方法
  • 批准号:
    2332534
  • 财政年份:
    2024
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant
CRII: SaTC: Reliable Hardware Architectures Against Side-Channel Attacks for Post-Quantum Cryptographic Algorithms
CRII:SaTC:针对后量子密码算法的侧通道攻击的可靠硬件架构
  • 批准号:
    2348261
  • 财政年份:
    2024
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant
Reversible Computing and Reservoir Computing with Magnetic Skyrmions for Energy-Efficient Boolean Logic and Artificial Intelligence Hardware
用于节能布尔逻辑和人工智能硬件的磁斯格明子可逆计算和储层计算
  • 批准号:
    2343607
  • 财政年份:
    2024
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Differentiable Hardware Synthesis
合作研究:SHF:媒介:可微分硬件合成
  • 批准号:
    2403134
  • 财政年份:
    2024
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant
CAREER: Data-Driven Hardware and Software Techniques to Enable Sustainable Data Center Services
职业:数据驱动的硬件和软件技术,以实现可持续的数据中心服务
  • 批准号:
    2340042
  • 财政年份:
    2024
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Continuing Grant
SHF: Small: Taming Huge Page Problems for Memory Bulk Operations Using a Hardware/Software Co-Design Approach
SHF:小:使用硬件/软件协同设计方法解决内存批量操作的大页面问题
  • 批准号:
    2400014
  • 财政年份:
    2024
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant
Collaborative Research: Reversible Computing and Reservoir Computing with Magnetic Skyrmions for Energy-Efficient Boolean Logic and Artificial Intelligence Hardware
合作研究:用于节能布尔逻辑和人工智能硬件的磁斯格明子可逆计算和储层计算
  • 批准号:
    2343606
  • 财政年份:
    2024
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant
SHF: Small: QED - A New Approach to Scalable Verification of Hardware Memory Consistency
SHF:小型:QED - 硬件内存一致性可扩展验证的新方法
  • 批准号:
    2332891
  • 财政年份:
    2024
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant
SHF: Small: Hardware-Software Co-design for Privacy Protection on Deep Learning-based Recommendation Systems
SHF:小型:基于深度学习的推荐系统的隐私保护软硬件协同设计
  • 批准号:
    2334628
  • 财政年份:
    2024
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Standard Grant
CAREER: Toward Power Delivery Network-aware Hardware Security
职业:迈向电力传输网络感知硬件安全
  • 批准号:
    2338069
  • 财政年份:
    2024
  • 资助金额:
    $ 31.87万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了