CAREER: DrCloud: Drill-Ready Cloud Computing
职业:DrCloud:可练习的云计算
基本信息
- 批准号:1350499
- 负责人:
- 金额:$ 27.99万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2014
- 资助国家:美国
- 起止时间:2014-05-01 至 2020-04-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Cloud computing is pervasive, but cloud service outages still take place. This proposal addresses how to ensure that failure recovery will work robustly in many deployment scenarios. To address this important question, this project proposes drill-ready cloud computing (DrCloud), a new dependability paradigm that advocates cloud systems to routinely perform "failure drills" in real deployments (i.e., deliberately schedule real failures rather than waiting for unexpected real failures to happen). This practice can unearth in-production recovery issues and prevent real outages. This project will create five building blocks of drill-ready cloud computing: methodology, safety, efficiency, usability, and generality. Specifically, these five sub-projects will substantiate a new research methodology via a formal study of hundreds of in-production recovery issues, devise mechanisms that guarantee safety (no data loss and performance disruptions) analogous to a proper fire drill preparation, develop techniques that maximize resource and monetary efficiency of drill execution, design a specification language and its runtime that simplifies drill usability, and finally boost drill generality beyond failure drills (e.g., supporting software upgrade and configuration change drills). The DrCloud project will enrich decades of research and literature in fault-tolerant computing. The project will also bring many direct benefits to the society; users from many areas increasingly use large-scale storage and computation services, depending on high availability and predictability that drill-ready cloud computing will facilitate. The project will also involve state-of-the-art scale-out cloud systems (Hadoop, Cassandra, HBase, etc.). Adding drill-readiness to these systems will provide prototypes of next-generation reliable systems.
云计算无处不在,但云服务中断仍然发生。 该建议解决了如何确保故障恢复在许多部署场景中稳健地工作。 为了解决这一重要问题,该项目提出了可钻云计算(DrCloud),这是一种新的可靠性范例,提倡云系统在真实的部署中定期执行“故障钻”(即,故意安排真实的故障而不是等待意外的真实的故障发生)。 这种做法可以发现生产中的恢复问题,并防止真实的停机。该项目将创建可钻云计算的五个构建块:方法论、安全性、效率、可用性和通用性。具体而言,这五个子项目将通过对数百个生产恢复问题的正式研究,(没有数据丢失和性能中断)类似于适当的消防演习准备,开发使演习执行的资源和货币效率最大化的技术,设计简化演习可用性的规范语言及其运行时,并且最终提高了钻孔的通用性超过了故障钻孔(例如,支持软件升级和配置更改演练)。DrCloud项目将丰富容错计算领域数十年的研究和文献。 该项目还将为社会带来许多直接利益;来自许多领域的用户越来越多地使用大规模存储和计算服务,这取决于可钻式云计算将促进的高可用性和可预测性。 该项目还将涉及最先进的横向扩展云系统(Hadoop,Cassandra,HBase等)。 在这些系统中增加演练准备将提供下一代可靠系统的原型。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Haryadi Gunawi其他文献
Haryadi Gunawi的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Haryadi Gunawi', 18)}}的其他基金
Collaborative Research: PPoSS: LARGE: ScaleStuds: Foundations for Correctness Checkability and Performance Predictability of Systems at Scale
合作研究:PPoSS:大型:ScaleStuds:大规模系统正确性可检查性和性能可预测性的基础
- 批准号:
2119184 - 财政年份:2021
- 资助金额:
$ 27.99万 - 项目类别:
Continuing Grant
PPoSS: Planning: CP2: Towards Systems Correctness Checkability and Performance Predictability at Scale
PPoSS:规划:CP2:实现大规模系统正确性可检查性和性能可预测性
- 批准号:
2028427 - 财政年份:2020
- 资助金额:
$ 27.99万 - 项目类别:
Standard Grant
USENIX FAST 2017 NSF Student Travel Support
USENIX FAST 2017 NSF 学生旅行支持
- 批准号:
1727380 - 财政年份:2017
- 资助金额:
$ 27.99万 - 项目类别:
Standard Grant
CSR: Medium:Combating Distributed Concurrency Bugs in Cloud Systems
CSR:中:对抗云系统中的分布式并发错误
- 批准号:
1563956 - 财政年份:2016
- 资助金额:
$ 27.99万 - 项目类别:
Continuing Grant
CSR: Small: BreezeFS: File System Transformation for Cloud and Multistore Era
CSR:小型:BreezeFS:云和多存储时代的文件系统转型
- 批准号:
1526304 - 财政年份:2015
- 资助金额:
$ 27.99万 - 项目类别:
Standard Grant
XPS:CLCCA:LigHTS: Lagging-Hardware Tolerant Systems" in the system.
系统中的“XPS:CLCCA:LigHTS:滞后硬件容忍系统”。
- 批准号:
1336580 - 财政年份:2013
- 资助金额:
$ 27.99万 - 项目类别:
Standard Grant
DC: Small: Collaborative Research: DARE: Declarative and Scalable Recovery
DC:小型:协作研究:DARE:声明式和可扩展的恢复
- 批准号:
1321958 - 财政年份:2012
- 资助金额:
$ 27.99万 - 项目类别:
Standard Grant
DC: Small: Collaborative Research: DARE: Declarative and Scalable Recovery
DC:小型:协作研究:DARE:声明式和可扩展的恢复
- 批准号:
1016924 - 财政年份:2010
- 资助金额:
$ 27.99万 - 项目类别:
Standard Grant