Eager: Collaborative Research: DiRecMR: Reconciling the Dichotomy of MapReduce for Efficient Speculation and Resilience

Eager:协作研究:DiRecMR:调和 MapReduce 的二分法以实现高效推测和弹性

基本信息

  • 批准号:
    1744336
  • 负责人:
  • 金额:
    $ 8万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2017
  • 资助国家:
    美国
  • 起止时间:
    2017-08-01 至 2018-12-31
  • 项目状态:
    已结题

项目摘要

MapReduce systems have great capabilities in processing large amounts of data and have become a research target for governmental, academic and industrial organizations. However, their task management and fault handling policies do not recognize a tacit dichotomy that exists between its inherent two phases (map and reduce). This results in a number of critical issues, such as resource underutilization, prolonged task execution, myopic speculation, and failure amplifications. This project adopts a transformative combination of theoretical analysis, simulation and modeling, and systems design and implementation approaches in order to reconcile the dichotomy of MapReduce. The techniques from this project are potentially impactful to all organizations that deploy MapReduce systems and support Big Data applications from business analytics, social networks, and scientific computing research.Instead of empirical analysis of system behaviors to pinpoint resource management and task scheduling abnormalities, this project takes a different perspective on MapReduce efficiency and resilience, and formulates a Markov chain for the transition of Hadoop MapReduce containers, and a fork-join model for the queueing of map and reduce tasks. These formulations facilitate a theoretical analysis of the dichotomy of MapReduce and help shed light on its impact to asymptotic behaviors of large-scale workloads. This project aims to blend simulation and real system development together, and addresses the myopic speculation caused by dichotomy, liberates the scope of task speculation, and ensures task resilience without failure amplifications. These techniques are developed to enhance MapReduce platforms such as YARN and Spark. Besides the target on MapReduce systems, the research from this project addresses a general issue in distributed analytics environments.
MapReduce系统具有处理大量数据的强大能力,已成为政府、学术和工业组织的研究目标。然而,他们的任务管理和故障处理策略并没有认识到其固有的两个阶段(map和reduce)之间存在的隐性二分法。 这导致了一些关键问题,如资源利用不足,延长任务执行,短视的投机,和失败放大。该项目采用了理论分析,模拟和建模,系统设计和实现方法的变革性组合,以调和MapReduce的二分法。该项目的技术对所有部署MapReduce系统并支持商业分析、社交网络和科学计算研究等大数据应用的组织都具有潜在的影响力。该项目从不同的角度看待MapReduce的效率和弹性,而不是通过实证分析系统行为来查明资源管理和任务调度异常,提出了Hadoop MapReduce容器迁移的马尔可夫链模型和映射与归约任务调度的fork-join模型。这些公式有助于对MapReduce的二分法进行理论分析,并有助于揭示其对大规模工作负载的渐近行为的影响。该项目旨在将仿真和真实的系统开发融合在一起,解决二分法导致的短视推测,解放任务推测的范围,并确保任务弹性而不放大故障。这些技术的开发是为了增强MapReduce平台,如YARN和Spark。除了MapReduce系统的目标之外,该项目的研究还解决了分布式分析环境中的一个普遍问题。

项目成果

期刊论文数量(9)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
SVAGC: Garbage Collection with a Scalable Virtual Address Swapping Technique
SHMEMGraph: Efficient and Balanced Graph Processing Using One-Sided Communication
Efficient User-Level Storage Disaggregation for Deep Learning
Accurate classification of depression through optimized machine learning models on high-dimensional noisy data
  • DOI:
    10.1016/j.bspc.2021.103237
  • 发表时间:
    2022-01
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Xingang Fang;Julia Klawohn;Alexander De Sabatino;Harsh Kundnani;Jon Ryan;Weikuan Yu;G. Hajcak
  • 通讯作者:
    Xingang Fang;Julia Klawohn;Alexander De Sabatino;Harsh Kundnani;Jon Ryan;Weikuan Yu;G. Hajcak
Exploration of memory hybridization for RDD caching in Spark
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Weikuan Yu其他文献

Performance Evaluation of FPGA-Based Biological Applications
基于 FPGA 的生物应用的性能评估
  • DOI:
  • 发表时间:
    2007
  • 期刊:
  • 影响因子:
    0
  • 作者:
    O. Storaasli;Weikuan Yu;D. Strenski;James Maltby
  • 通讯作者:
    James Maltby
Ad Hoc File Systems for High-Performance Computing
用于高性能计算的临时文件系统
  • DOI:
    10.1007/s11390-020-9801-1
  • 发表时间:
    2020
  • 期刊:
  • 影响因子:
    1.9
  • 作者:
    A. Brinkmann;K. Mohror;Weikuan Yu;P. Carns;Toni Cortes;S. Klasky;Alberto Miranda;F. Pfreundt;R. Ross;Marc
  • 通讯作者:
    Marc
JVM-Bypass for Efficient Hadoop Shuffling
用于高效 Hadoop Shuffle 的 JVM 旁路
Performance evaluation and tuning of BioPig for genomic analysis
BioPig 用于基因组分析的性能评估和调整
Understanding I/O Behavior in Scientific Workflows on High Performance Computing Systems
了解高性能计算系统上科学工作流程中的 I/O 行为
  • DOI:
  • 发表时间:
    2019
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Fahim Chowdhury;Francesco Di;A. Moody;Elsa Gonsiorowski;K. Mohror;Weikuan Yu
  • 通讯作者:
    Weikuan Yu

Weikuan Yu的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Weikuan Yu', 18)}}的其他基金

Collaborative Research: OAC Core: CropDL - Scheduling and Checkpoint/Restart Support for Deep Learning Applications on HPC Clusters
合作研究:OAC 核心:CropDL - HPC 集群上深度学习应用的调度和检查点/重启支持
  • 批准号:
    2403089
  • 财政年份:
    2024
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
SaTC: CORE: Small: Realizing Enhanced Authentication in the Mobile Era
SaTC:核心:小:实现移动时代的增强认证
  • 批准号:
    2131143
  • 财政年份:
    2021
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
IRES Track-1: I/O Research for Data-Intensive Analytics and Deep Learning
IRES Track-1:数据密集型分析和深度学习的 I/O 研究
  • 批准号:
    1952302
  • 财政年份:
    2020
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
SHF: Medium: Collaborative Research: ECC: Ephemeral Coherence Cohort for I/O Containerization and Disaggregation
SHF:媒介:协作研究:ECC:I/O 容器化和分解的临时一致性队列
  • 批准号:
    1763547
  • 财政年份:
    2018
  • 资助金额:
    $ 8万
  • 项目类别:
    Continuing Grant
CRI: II-New: A Software Defined Infrastructure for Cross-Layer Research on Reconfigurable Architecture and Systems
CRI:II-New:用于可重构架构和系统跨层研究的软件定义基础设施
  • 批准号:
    1822737
  • 财政年份:
    2018
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
CSR: Small: XooMR: Cross-Layer and Cross-Phase Cooperation for Fair and Efficient MapReduce
CSR:小:XooMR:跨层跨阶段合作实现公平高效的 MapReduce
  • 批准号:
    1564647
  • 财政年份:
    2015
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
EAGER: Tadoop: A Dual-Purpose Framework Taming the Bipolarity of Storage and Communication for High-Performance Computing and Data Analytics
EAGER:Tadoop:一个双用途框架,克服存储和通信的两极性,实现高性能计算和数据分析
  • 批准号:
    1561041
  • 财政年份:
    2015
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
EAGER: Tadoop: A Dual-Purpose Framework Taming the Bipolarity of Storage and Communication for High-Performance Computing and Data Analytics
EAGER:Tadoop:一个双用途框架,克服存储和通信的两极性,实现高性能计算和数据分析
  • 批准号:
    1432892
  • 财政年份:
    2014
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
CSR: Small: XooMR: Cross-Layer and Cross-Phase Cooperation for Fair and Efficient MapReduce
CSR:小:XooMR:跨层跨阶段合作实现公平高效的 MapReduce
  • 批准号:
    1320016
  • 财政年份:
    2013
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
II-New: A Compute and Storage Cluster for Multidisciplinary Research on Computer Systems and Scientific Simulations
II-New:用于计算机系统和科学模拟多学科研究的计算和​​存储集群
  • 批准号:
    1059376
  • 财政年份:
    2011
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant

相似海外基金

Collaborative Research: EAGER: IMPRESS-U: Groundwater Resilience Assessment through iNtegrated Data Exploration for Ukraine (GRANDE-U)
合作研究:EAGER:IMPRESS-U:通过乌克兰综合数据探索进行地下水恢复力评估 (GRANDE-U)
  • 批准号:
    2409395
  • 财政年份:
    2024
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
EAGER/Collaborative Research: An LLM-Powered Framework for G-Code Comprehension and Retrieval
EAGER/协作研究:LLM 支持的 G 代码理解和检索框架
  • 批准号:
    2347624
  • 财政年份:
    2024
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
EAGER/Collaborative Research: Revealing the Physical Mechanisms Underlying the Extraordinary Stability of Flying Insects
EAGER/合作研究:揭示飞行昆虫非凡稳定性的物理机制
  • 批准号:
    2344215
  • 财政年份:
    2024
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: Designing Nanomaterials to Reveal the Mechanism of Single Nanoparticle Photoemission Intermittency
合作研究:EAGER:设计纳米材料揭示单纳米粒子光电发射间歇性机制
  • 批准号:
    2345581
  • 财政年份:
    2024
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: Designing Nanomaterials to Reveal the Mechanism of Single Nanoparticle Photoemission Intermittency
合作研究:EAGER:设计纳米材料揭示单纳米粒子光电发射间歇性机制
  • 批准号:
    2345582
  • 财政年份:
    2024
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: Designing Nanomaterials to Reveal the Mechanism of Single Nanoparticle Photoemission Intermittency
合作研究:EAGER:设计纳米材料揭示单纳米粒子光电发射间歇性机制
  • 批准号:
    2345583
  • 财政年份:
    2024
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: The next crisis for coral reefs is how to study vanishing coral species; AUVs equipped with AI may be the only tool for the job
合作研究:EAGER:珊瑚礁的下一个危机是如何研究正在消失的珊瑚物种;
  • 批准号:
    2333604
  • 财政年份:
    2024
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: Energy for persistent sensing of carbon dioxide under near shore waves.
合作研究:EAGER:近岸波浪下持续感知二氧化碳的能量。
  • 批准号:
    2339062
  • 财政年份:
    2024
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: The next crisis for coral reefs is how to study vanishing coral species; AUVs equipped with AI may be the only tool for the job
合作研究:EAGER:珊瑚礁的下一个危机是如何研究正在消失的珊瑚物种;
  • 批准号:
    2333603
  • 财政年份:
    2024
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
EAGER/Collaborative Research: An LLM-Powered Framework for G-Code Comprehension and Retrieval
EAGER/协作研究:LLM 支持的 G 代码理解和检索框架
  • 批准号:
    2347623
  • 财政年份:
    2024
  • 资助金额:
    $ 8万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了