DC: Medium: Tackling and Understanding Intermediate Data in Cloud Applications as a First-Class Citizen

DC:中:作为一等公民处理和理解云应用程序中的中间数据

基本信息

  • 批准号:
    0964471
  • 负责人:
  • 金额:
    $ 60万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2010
  • 资助国家:
    美国
  • 起止时间:
    2010-07-01 至 2015-06-30
  • 项目状态:
    已结题

项目摘要

Cloud computing infrastructures involve thousands of servers, petabytes of storage, and hundreds of users running various applications that involve gigabytes to terabytes of data. This project focuses on intermediate data that is generated during the execution of parallelized dataflow programs in clouds. Such cloud intermediate data brings forth several unique characteristics: they are massive-scale, distributed, subjected to computational barriers, and prolong job run-times when subjected to server failures. Further, the size of intermediate data in a cloud application is often comparable to or larger than input or output data size, and it can thus range in terabytes. Thus, in spite of extensive existing work on traditional storage problems, there is a critical need for new algorithms and systems that target cloud intermediate data. This project is the first to treat cloud intermediate data as a first-class citizen. The project will involve new algorithm design and analysis, original systems building and implementation, deployment in real world testbeds, and performance of measurement studies. Concretely, this project will build a new system that explicitly manages intermediate data in cloud dataflow programs in order to improve their fault-tolerance, and design and realize barrier relaxation strategies to improve performance of cloud programs. We will implement using open software, deploy, and experimentally evaluate our systems atop the NSF infrastructure called the Cloud Computing Testbed (CCT) that is hosted at the University of Illinois. Finally, we will perform measurement studies of workload characteristics of cloud intermediate data. A fuller understanding of intermediate data in clouds can spawn research in managing cloud infrastructures, improve run-time performance of cloud applications, and lead to new cloud programming paradigms. Our contributions will directly improve the performance and fault-tolerance of applications that are run on the community infrastructure CCT, and positively impact design and deployment of existing and emerging industry clouds. Our results will be published and released in open software and datasets.
云计算基础设施涉及数千台服务器、pb级存储和数百个运行各种应用程序的用户,这些应用程序涉及千兆字节到tb级的数据。这个项目关注的是在云中并行数据流程序执行过程中产生的中间数据。这种云中间数据带来了几个独特的特征:它们是大规模的、分布式的、受到计算障碍的影响,并且在发生服务器故障时延长作业运行时间。此外,云应用程序中中间数据的大小通常与输入或输出数据大小相当或更大,因此可以以tb为单位。因此,尽管在传统存储问题上有大量的现有工作,但迫切需要针对云中间数据的新算法和系统。该项目是第一个将云中间数据视为一等公民的项目。该项目将涉及新的算法设计和分析,原始系统的构建和实现,在现实世界的测试平台中的部署,以及测量研究的性能。具体而言,本项目将构建一个新的系统,明确管理云数据流程序中的中间数据,以提高其容错性,并设计和实现屏障放松策略,以提高云程序的性能。我们将使用开放软件实现、部署和实验评估我们的系统,这些系统位于美国国家科学基金会(NSF)的基础设施之上,该基础设施被称为云计算测试平台(CCT),托管在伊利诺伊大学。最后,我们将对云中间数据的工作负载特征进行测量研究。更全面地理解云中的中间数据可以催生管理云基础设施的研究,提高云应用程序的运行时性能,并产生新的云编程范例。我们的贡献将直接提高在社区基础设施CCT上运行的应用程序的性能和容错性,并对现有和新兴行业云的设计和部署产生积极影响。我们的研究结果将在开放软件和数据集中发表和发布。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Indranil Gupta其他文献

ContagAlert: Using Contagion Theory for Adaptive, Distributed Alert Propagation
ContagAlert:使用传染理论进行自适应分布式警报传播
The design of novel distributed protocols from differential equations
  • DOI:
    10.1007/s00446-007-0024-2
  • 发表时间:
    2007-05-15
  • 期刊:
  • 影响因子:
    2.100
  • 作者:
    Indranil Gupta;Mahvesh Nagda;Christo Frank Devaraj
  • 通讯作者:
    Christo Frank Devaraj
Kaizen: Building a Performant Blockchain System Verified for Consensus and Integrity
Kaizen:构建经过共识和完整性验证的高性能区块链系统
Efficient on-demand operations in dynamic distributed infrastructures
动态分布式基础设施中的高效按需操作
Moara: Flexible and Scalable Group-Based Querying System
Moara:灵活且可扩展的基于组的查询系统

Indranil Gupta的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Indranil Gupta', 18)}}的其他基金

CNS Core: Small: GoT -- Groups of Things Abstractions for Distributed IoT
CNS 核心:小型:GoT——分布式物联网的物联网抽象组
  • 批准号:
    1908888
  • 财政年份:
    2019
  • 资助金额:
    $ 60万
  • 项目类别:
    Standard Grant
CSR: Medium: Availability-Consistency Tradeoffs in Key-Value and NoSQL Storage Systems
CSR:中:键值和 NoSQL 存储系统的可用性一致性权衡
  • 批准号:
    1409416
  • 财政年份:
    2014
  • 资助金额:
    $ 60万
  • 项目类别:
    Standard Grant
CSR: Small: Online Global Reconfigurations in Key-Value and NoSQL Cloud Storage Systems
CSR:小型:键值和 NoSQL 云存储系统中的在线全局重新配置
  • 批准号:
    1319527
  • 财政年份:
    2013
  • 资助金额:
    $ 60万
  • 项目类别:
    Standard Grant
CAREER: Systematic Design of Distributed Protocols - from Methodologies and Toolkits to Systems
职业:分布式协议的系统设计 - 从方法论和工具包到系统
  • 批准号:
    0448246
  • 财政年份:
    2005
  • 资助金额:
    $ 60万
  • 项目类别:
    Standard Grant

相似海外基金

Collaborative Research: CyberTraining: Implementation: Medium: Training Users, Developers, and Instructors at the Chemistry/Physics/Materials Science Interface
协作研究:网络培训:实施:媒介:在化学/物理/材料科学界面培训用户、开发人员和讲师
  • 批准号:
    2321102
  • 财政年份:
    2024
  • 资助金额:
    $ 60万
  • 项目类别:
    Standard Grant
RII Track-4:@NASA: Bluer and Hotter: From Ultraviolet to X-ray Diagnostics of the Circumgalactic Medium
RII Track-4:@NASA:更蓝更热:从紫外到 X 射线对环绕银河系介质的诊断
  • 批准号:
    2327438
  • 财政年份:
    2024
  • 资助金额:
    $ 60万
  • 项目类别:
    Standard Grant
Collaborative Research: Topological Defects and Dynamic Motion of Symmetry-breaking Tadpole Particles in Liquid Crystal Medium
合作研究:液晶介质中对称破缺蝌蚪粒子的拓扑缺陷与动态运动
  • 批准号:
    2344489
  • 财政年份:
    2024
  • 资助金额:
    $ 60万
  • 项目类别:
    Standard Grant
Collaborative Research: AF: Medium: The Communication Cost of Distributed Computation
合作研究:AF:媒介:分布式计算的通信成本
  • 批准号:
    2402836
  • 财政年份:
    2024
  • 资助金额:
    $ 60万
  • 项目类别:
    Continuing Grant
Collaborative Research: AF: Medium: Foundations of Oblivious Reconfigurable Networks
合作研究:AF:媒介:遗忘可重构网络的基础
  • 批准号:
    2402851
  • 财政年份:
    2024
  • 资助金额:
    $ 60万
  • 项目类别:
    Continuing Grant
Collaborative Research: CIF: Medium: Snapshot Computational Imaging with Metaoptics
合作研究:CIF:Medium:Metaoptics 快照计算成像
  • 批准号:
    2403122
  • 财政年份:
    2024
  • 资助金额:
    $ 60万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Differentiable Hardware Synthesis
合作研究:SHF:媒介:可微分硬件合成
  • 批准号:
    2403134
  • 财政年份:
    2024
  • 资助金额:
    $ 60万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Enabling Graphics Processing Unit Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的图形处理单元性能仿真
  • 批准号:
    2402804
  • 财政年份:
    2024
  • 资助金额:
    $ 60万
  • 项目类别:
    Standard Grant
Collaborative Research: CIF-Medium: Privacy-preserving Machine Learning on Graphs
合作研究:CIF-Medium:图上的隐私保护机器学习
  • 批准号:
    2402815
  • 财政年份:
    2024
  • 资助金额:
    $ 60万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Tiny Chiplets for Big AI: A Reconfigurable-On-Package System
合作研究:SHF:中:用于大人工智能的微型芯片:可重新配置的封装系统
  • 批准号:
    2403408
  • 财政年份:
    2024
  • 资助金额:
    $ 60万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了