CIF: Small: Leveraging Coding Techniques for Distributed Computing
CIF:小型:利用编码技术进行分布式计算
基本信息
- 批准号:1910840
- 负责人:
- 金额:$ 49.23万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2019
- 资助国家:美国
- 起止时间:2019-10-01 至 2024-09-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Clusters of computer processors that process huge amounts of data at specialized locations called data centers are ubiquitous in both industry and academia. The usage of distributed clusters is a necessity rather than a luxury since many modern datasets are too large to be stored in the memory or disk of a single computer. However, using such clusters to obtain answers quickly and efficiently presents many new challenges. These include dealing with issues such as slow or failed processors (ie., worker nodes) and taking into account the time for these worker nodes to communicate among themselves for collaboratively executing a job. Such issues are critical, as it is well-recognized that for such large scale systems, worker node failures are the norm rather than the exception. The project will investigate classes of methods for the robust and efficient operation of large-scale distributed computing clusters. Furthermore, the project will train graduate and undergraduate students in data analytics and in using industry standard techniques for working with these clusters.The overarching goal of this project is to leverage coding-theoretic ideas to make distributed computation robust to stragglers (slow or failed worker nodes) and reduce the communication overhead of distributed computing paradigms such as MapReduce and Spark. While there has been some recent work on the topic of straggler mitigation for distributed matrix computations, the majority of prior work proceeds by treating stragglers exclusively as node failures. This project will investigate rigorous techniques for leveraging slow (but not failed) stragglers. In particular, the sequential nature of computation within a worker node will be taken into account when designing codes for our systems. The second part of the project will deal with issues around the numerical stability of recovery within distributed matrix computation. Several well-known erasure codes that have been proposed for this problem perform rather poorly on this metric. The project will design classes of codes that are useful in straggler mitigation and analyze them through the lens of numerical stability. The last part of the project will address the reduction of shuffle phase traffic in MapReduce-like systems that are used for executing jobs over distributed clusters. Prior work in this area proposes techniques that are information-theoretically optimal (under an appropriate model). A major assumption of prior work is that jobs can be split into arbitrarily small parts. However, in practical systems, this assumption severely limits the actual gain in the overall job execution time. This project will study a large class of techniques that reduce shuffle phase traffic and the overall job execution time by leveraging the properties of suitably defined linear block codes.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
在称为数据中心的专门位置处理大量数据的计算机处理器集群在工业界和学术界都无处不在。分布式集群的使用是必要的,而不是奢侈品,因为许多现代数据集太大,无法存储在单个计算机的内存或磁盘中。 然而,使用这样的集群来快速有效地获得答案提出了许多新的挑战。其中包括处理处理速度慢或出现故障的处理器(即,工作者节点),并考虑这些工作者节点在它们之间通信以协作地执行作业的时间。这样的问题是至关重要的,因为众所周知,对于这样的大规模系统,工作节点故障是正常的而不是例外。该项目将研究大型分布式计算集群的鲁棒和有效操作的方法。此外,该项目还将培训研究生和本科生数据分析和使用行业标准技术来处理这些集群。该项目的总体目标是利用编码理论思想使分布式计算对落后者(缓慢或失败的工作节点)具有鲁棒性,并减少分布式计算范例(如MapReduce和Spark)的通信开销。虽然最近已经有一些关于分散矩阵计算的离散缓解的主题的工作,但大多数先前的工作都是将离散者完全视为节点故障。这个项目将研究利用缓慢(但不是失败)的落伍者的严格技术。特别是,在为我们的系统设计代码时,将考虑工作节点内计算的顺序性质。该项目的第二部分将处理分布式矩阵计算中恢复的数值稳定性问题。几个著名的擦除码,已经提出了这个问题,执行这个指标相当差。该项目将设计的代码类,是有用的,在落伍者缓解和分析他们通过数值稳定性的透镜。该项目的最后一部分将解决在MapReduce类系统中减少shuffle阶段流量的问题,这些系统用于在分布式集群上执行作业。在此领域的先前工作提出了信息理论上最优的技术(在适当的模型下)。先前工作的一个主要假设是,作业可以分成任意小的部分。然而,在实际系统中,这种假设严重限制了总体作业执行时间的实际收益。该项目将研究一大类技术,通过利用适当定义的线性块码的特性来减少洗牌阶段的流量和总体作业执行时间。该奖项反映了NSF的法定使命,并通过使用基金会的智力价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(16)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Asynchronous Coded Caching With Uncoded Prefetching
- DOI:10.1109/tnet.2020.3003907
- 发表时间:2019-07
- 期刊:
- 影响因子:0
- 作者:H. Ghasemi;A. Ramamoorthy
- 通讯作者:H. Ghasemi;A. Ramamoorthy
Coded matrix computation with gradient coding
使用梯度编码的编码矩阵计算
- DOI:10.1109/isit54713.2023.10206996
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:Son, Kyungrak;Ramamoorthy, Aditya
- 通讯作者:Ramamoorthy, Aditya
An Integrated Method to Deal with Partial Stragglers and Sparse Matrices in Distributed Computations
分布式计算中处理部分散乱矩阵和稀疏矩阵的综合方法
- DOI:10.1109/isit50566.2022.9834346
- 发表时间:2022
- 期刊:
- 影响因子:0
- 作者:Das, Anindya Bijoy;Ramamoorthy, Aditya
- 通讯作者:Ramamoorthy, Aditya
Distributed Matrix Computations with Low-weight Encodings
使用低权重编码的分布式矩阵计算
- DOI:10.1109/isit54713.2023.10206445
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:Das, Anindya Bijoy;Ramamoorthy, Aditya;Love, David J.;Brinton, Christopher G.
- 通讯作者:Brinton, Christopher G.
A Unified Treatment of Partial Stragglers and Sparse Matrices in Coded Matrix Computation
编码矩阵计算中部分散乱矩阵和稀疏矩阵的统一处理
- DOI:10.1109/itw48936.2021.9611400
- 发表时间:2021
- 期刊:
- 影响因子:0
- 作者:Das, Anindya Bijoy;Ramamoorthy, Aditya
- 通讯作者:Ramamoorthy, Aditya
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Aditya Ramamoorthy其他文献
Minimum Cost Distributed Source Coding Over a Network
网络上的最低成本分布式源编码
- DOI:
10.1109/tit.2010.2090196 - 发表时间:
2007 - 期刊:
- 影响因子:2.5
- 作者:
Aditya Ramamoorthy - 通讯作者:
Aditya Ramamoorthy
Overlay protection against link failures using network coding
使用网络编码针对链路故障提供重叠保护
- DOI:
10.1109/ciss.2008.4558582 - 发表时间:
2008 - 期刊:
- 影响因子:0
- 作者:
A. Kamal;Aditya Ramamoorthy - 通讯作者:
Aditya Ramamoorthy
Degrees of freedom region for an interference network with general message demands
具有一般消息需求的干扰网络的自由度区域
- DOI:
10.1109/isit.2011.6034148 - 发表时间:
2011 - 期刊:
- 影响因子:0
- 作者:
Lei Ke;Aditya Ramamoorthy;Zhengdao Wang;H. Yin - 通讯作者:
H. Yin
Federated Over-Air Robust Subspace Tracking from Missing Data
针对缺失数据的联合空中稳健子空间跟踪
- DOI:
- 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
Praneeth Narayanamurthy;Namrata Vaswani;Aditya Ramamoorthy - 通讯作者:
Aditya Ramamoorthy
Communicating the sum of sources over a network
- DOI:
10.1109/isit.2008.4595267 - 发表时间:
2008-04 - 期刊:
- 影响因子:0
- 作者:
Aditya Ramamoorthy - 通讯作者:
Aditya Ramamoorthy
Aditya Ramamoorthy的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Aditya Ramamoorthy', 18)}}的其他基金
CIF:Small:Towards practical coded caching
CIF:小:走向实用的编码缓存
- 批准号:
1718470 - 财政年份:2017
- 资助金额:
$ 49.23万 - 项目类别:
Standard Grant
CIF: Small: Distributed Storage Systems from Combinatorial Designs
CIF:小型:组合设计的分布式存储系统
- 批准号:
1320416 - 财政年份:2013
- 资助金额:
$ 49.23万 - 项目类别:
Standard Grant
CAREER: Joint Topographic Imaging and Materials Characterization using Atomic Force Microscopy - a Systems Approach
职业:使用原子力显微镜进行联合形貌成像和材料表征 - 一种系统方法
- 批准号:
1149860 - 财政年份:2012
- 资助金额:
$ 49.23万 - 项目类别:
Continuing Grant
CIF: Small: Collaborative Research: Signal processing for enabling high speed probe based nanoimaging
CIF:小型:协作研究:用于实现基于高速探针的纳米成像的信号处理
- 批准号:
1116322 - 财政年份:2011
- 资助金额:
$ 49.23万 - 项目类别:
Standard Grant
CIF: Small: An Algebraic Approach to Distributed Source Coding
CIF:小:分布式源编码的代数方法
- 批准号:
1018148 - 财政年份:2010
- 资助金额:
$ 49.23万 - 项目类别:
Standard Grant
Collaborative Research: Dynamic Mode High Density Probe Based Data Storage
协作研究:基于动态模式高密度探针的数据存储
- 批准号:
0802019 - 财政年份:2008
- 资助金额:
$ 49.23万 - 项目类别:
Continuing Grant
相似国自然基金
昼夜节律性small RNA在血斑形成时间推断中的法医学应用研究
- 批准号:
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
tRNA-derived small RNA上调YBX1/CCL5通路参与硼替佐米诱导慢性疼痛的机制研究
- 批准号:n/a
- 批准年份:2022
- 资助金额:10.0 万元
- 项目类别:省市级项目
Small RNA调控I-F型CRISPR-Cas适应性免疫性的应答及分子机制
- 批准号:32000033
- 批准年份:2020
- 资助金额:24.0 万元
- 项目类别:青年科学基金项目
Small RNAs调控解淀粉芽胞杆菌FZB42生防功能的机制研究
- 批准号:31972324
- 批准年份:2019
- 资助金额:58.0 万元
- 项目类别:面上项目
变异链球菌small RNAs连接LuxS密度感应与生物膜形成的机制研究
- 批准号:81900988
- 批准年份:2019
- 资助金额:21.0 万元
- 项目类别:青年科学基金项目
基于small RNA 测序技术解析鸽分泌鸽乳的分子机制
- 批准号:31802058
- 批准年份:2018
- 资助金额:26.0 万元
- 项目类别:青年科学基金项目
肠道细菌关键small RNAs在克罗恩病发生发展中的功能和作用机制
- 批准号:31870821
- 批准年份:2018
- 资助金额:56.0 万元
- 项目类别:面上项目
Small RNA介导的DNA甲基化调控的水稻草矮病毒致病机制
- 批准号:31772128
- 批准年份:2017
- 资助金额:60.0 万元
- 项目类别:面上项目
基于small RNA-seq的针灸治疗桥本甲状腺炎的免疫调控机制研究
- 批准号:81704176
- 批准年份:2017
- 资助金额:20.0 万元
- 项目类别:青年科学基金项目
水稻OsSGS3与OsHEN1调控small RNAs合成及其对抗病性的调节
- 批准号:91640114
- 批准年份:2016
- 资助金额:85.0 万元
- 项目类别:重大研究计划
相似海外基金
CSR: Small: Leveraging Physical Side-Channels for Good
CSR:小:利用物理侧通道做好事
- 批准号:
2312089 - 财政年份:2024
- 资助金额:
$ 49.23万 - 项目类别:
Standard Grant
CNS Core: Small: Network Wide Sensing by Leveraging Cellular Communication Networks
CNS 核心:小型:利用蜂窝通信网络进行全网络传感
- 批准号:
2343469 - 财政年份:2024
- 资助金额:
$ 49.23万 - 项目类别:
Standard Grant
Collaborative Research: SaTC: CORE: Small: Understanding the Limitations of Wireless Network Security Designs Leveraging Wireless Properties: New Threats and Defenses in Practice
协作研究:SaTC:核心:小型:了解利用无线特性的无线网络安全设计的局限性:实践中的新威胁和防御
- 批准号:
2316720 - 财政年份:2023
- 资助金额:
$ 49.23万 - 项目类别:
Standard Grant
Small Things First: Leveraging Implementation Science to Increase Access to Infant Directed Speech for ALL Infants in Neonatal Intensive Care Units
小事优先:利用实施科学增加新生儿重症监护病房所有婴儿获得婴儿定向语音的机会
- 批准号:
10570336 - 财政年份:2023
- 资助金额:
$ 49.23万 - 项目类别:
Collaborative Research: SaTC: CORE: Small: Understanding the Limitations of Wireless Network Security Designs Leveraging Wireless Properties: New Threats and Defenses in Practice
协作研究:SaTC:核心:小型:了解利用无线特性的无线网络安全设计的局限性:实践中的新威胁和防御
- 批准号:
2316719 - 财政年份:2023
- 资助金额:
$ 49.23万 - 项目类别:
Standard Grant
SaTC: CORE: Small: Regulating and Leveraging Types for Security
SaTC:核心:小型:监管和利用安全类型
- 批准号:
2247434 - 财政年份:2023
- 资助金额:
$ 49.23万 - 项目类别:
Continuing Grant
Leveraging Technology: Providing a Comprehensive, Active Learning, Online Support Network for STEM Students Attending a Small, Rural, and Remote Community College
利用技术:为就读小型、农村和偏远社区学院的 STEM 学生提供全面、主动学习的在线支持网络
- 批准号:
2130277 - 财政年份:2022
- 资助金额:
$ 49.23万 - 项目类别:
Standard Grant
The impact of small impoundments and connectivity loss on fishdistribution and abundance in river ecosystems: leveraging cutting edge geospatial model
小蓄水池和连通性丧失对河流生态系统鱼类分布和丰度的影响:利用尖端地理空间模型
- 批准号:
2746458 - 财政年份:2022
- 资助金额:
$ 49.23万 - 项目类别:
Studentship
SaTC: CORE: Small: Collaborative: Leveraging community oversight to enhance collective efficacy for privacy and security
SaTC:核心:小型:协作:利用社区监督来提高隐私和安全的集体效力
- 批准号:
2326901 - 财政年份:2022
- 资助金额:
$ 49.23万 - 项目类别:
Standard Grant
CNS Core: Small: Leveraging Hardware Counters to Improve the Performance and Energy Efficiency of Mobile Apps
CNS 核心:小型:利用硬件计数器提高移动应用程序的性能和能源效率
- 批准号:
2149533 - 财政年份:2022
- 资助金额:
$ 49.23万 - 项目类别:
Standard Grant