CAREER: Deadline-Ordered Multicast: A Network Primitive for Cloud-Native Distributed Systems
职业:按截止日期排序的多播:云原生分布式系统的网络原语
基本信息
- 批准号:2340748
- 负责人:
- 金额:$ 80.12万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2024
- 资助国家:美国
- 起止时间:2024-09-01 至 2029-08-31
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Computer systems spread out over multiple machines---also known as distributed systems---power most of today's digital world and the vast majority of web applications. Such digital systems include storage systems and search engine as well as web services. Spreading out digital systems across machines has multiple benefits including (1) the ability to survive machine failures by moving computation to a different machine and (2) the ability to leverage the capabilities of multiple machines to finish computations faster. To realize these benefits, the machines within a distributed system must coordinate amongst themselves to ensure the result is identical to an idealised illusion of executing the computation on a single machine. This project will make coordination significantly faster by developing a new abstraction, deadline-ordered multicast (DOM), which combines: (1) Multicast: The ability to simultaneously transmit a message from a sender to multiple receivers and (2) Synchronized clocks: Technology to ensure that, at any instant, clocks on various machines display the same value of time and that these clocks progress in lock step with each other. The project will demonstrate how DOM can be used to accelerate widely used computer systems, such as distributed databases, coordination services, and blockchains. The project will also train both undergraduate and graduate students using a variety of approaches to understand the benefits of synchronized clocks when thinking about distributed systems. Distributed systems are at the heart of computing today and include widely used systems such as distributed databases, fault-tolerant key-value stores, and distributed ledgers. At the core of these systems are distributed protocols such as crash-fault-tolerant consensus, concurrency control, and Byzantine-fault-tolerant consensus. Much effort has been expended on improving the performance of distributed protocols over the years including recent efforts that leverage richer network services to improve distributed protocols. Such services include switch multicast, programmable switches, programmable network-interface cards, in-network priority queues, and control over routing. However, many distributed systems today are deployed by cloud tenants, who have no access to such rich network services, making it impossible for them to benefit from the improved protocol performance enabled by such rich network services.This project will develop a new network primitive called deadline-ordered multicast (DOM) that will make it easier to construct high-performance distributed protocols on the public cloud. DOM leverages 2 key techniques: (1) the recent availability of synchronized clocks as a service, which now permit tightly synchronized clocks in the public cloud and (2) multicast, which struggled to find traction in the Internet, but is ideal for the one-sender-multi-receiver communication at the heart of distributed systems. DOM delivers a multicast message from a sender to multiple receivers at or after a message's deadline and delivers multiple messages in deadline order. DOM provides a consistent order (the order of deadlines) in which different receivers process a set of messages, accelerating several distributed protocols in the process. This proposal will develop the DOM abstraction, design an optimized DOM service, and prototype several applications demonstrating DOM's value.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
分布在多台机器上的计算机系统-也称为分布式系统-为当今的大多数数字世界和绝大多数Web应用程序提供动力。 这样的数字系统包括存储系统和搜索引擎以及web服务。将数字系统扩展到机器上有多种好处,包括:(1)通过将计算转移到不同的机器上来承受机器故障的能力,以及(2)利用多台机器的能力来更快地完成计算的能力。为了实现这些好处,分布式系统中的机器必须相互协调,以确保结果与在单个机器上执行计算的理想化幻觉相同。该项目将通过开发一种新的抽象,截止日期有序多播(DOM),使协调速度大大加快,它结合了:(1)多播:同时将消息从发送者传输到多个接收者的能力和(2)同步时钟:确保在任何时刻,不同机器上的时钟显示相同的时间值,并且这些时钟彼此同步的技术。 该项目将演示如何使用DOM来加速广泛使用的计算机系统,例如分布式数据库,协调服务和区块链。该项目还将培训本科生和研究生使用各种方法来了解同步时钟的好处时,考虑分布式系统。分布式系统是当今计算的核心,包括广泛使用的系统,如分布式数据库,容错键值存储和分布式账本。这些系统的核心是分布式协议,如崩溃容错共识,并发控制和拜占庭容错共识。多年来,人们在提高分布式协议的性能方面付出了很多努力,包括最近利用更丰富的网络服务来改进分布式协议的努力。这些服务包括交换机多播、可编程交换机、可编程网络接口卡、网内优先级队列和路由控制。然而,目前很多分布式系统都是由云租户部署的,他们无法访问这些丰富的网络服务,因此无法从这些丰富的网络服务带来的协议性能提升中受益。本项目将开发一种新的网络原语,称为deadline-ordered multicast(DOM),它可以更容易地在公有云上构建高性能的分布式协议。DOM利用了两个关键技术:(1)最近同步时钟作为服务的可用性,现在允许公共云中的紧密同步时钟,以及(2)多播,这在互联网中很难找到牵引力,但对于分布式系统核心的一个发送者-多个接收者通信来说是理想的。DOM在消息的截止日期或截止日期之后将多播消息从发送方传递到多个接收方,并按截止日期顺序传递多个消息。DOM提供了一个一致的顺序(截止日期的顺序),不同的接收者按照这个顺序处理一组消息,从而加速了这个过程中的几个分布式协议。该提案将开发DOM抽象,设计优化的DOM服务,并对几个展示DOM价值的应用程序进行原型设计。该奖项反映了NSF的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Anirudh Sivaraman Kaushalram其他文献
Designing fast and programmable routers
- DOI:
- 发表时间:
2017 - 期刊:
- 影响因子:0
- 作者:
Anirudh Sivaraman Kaushalram - 通讯作者:
Anirudh Sivaraman Kaushalram
MobiTest : an evaluation infrastructure for mobile distributed applications
MobiTest:移动分布式应用程序的评估基础架构
- DOI:
- 发表时间:
2012 - 期刊:
- 影响因子:0
- 作者:
Anirudh Sivaraman Kaushalram - 通讯作者:
Anirudh Sivaraman Kaushalram
Anirudh Sivaraman Kaushalram的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Anirudh Sivaraman Kaushalram', 18)}}的其他基金
CNS Core: Small: Autogenerating fast packet-processing code using program synthesis
CNS 核心:小型:使用程序合成自动生成快速数据包处理代码
- 批准号:
2008048 - 财政年份:2020
- 资助金额:
$ 80.12万 - 项目类别:
Standard Grant
NSF Workshop on Programmable Networks
NSF 可编程网络研讨会
- 批准号:
1833483 - 财政年份:2018
- 资助金额:
$ 80.12万 - 项目类别:
Standard Grant
相似海外基金
Development of Hard Real-Time Recognition Technologies With Deadline-driven AI Accelerators
利用截止日期驱动的人工智能加速器开发硬实时识别技术
- 批准号:
23H03477 - 财政年份:2023
- 资助金额:
$ 80.12万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Submitted thesis 03/02/2021 - now has corrections - resubmission deadline Nov-2022
提交论文 03/02/2021 - 现在已更正 - 重新提交截止日期 2022 年 11 月
- 批准号:
2227769 - 财政年份:2016
- 资助金额:
$ 80.12万 - 项目类别:
Studentship
Development of Geographical Information Sharing Method with a Time Deadline between Sections in Local Government and Regional Communities under Disaster
地方政府部门和受灾地区社区之间限时地理信息共享方法的开发
- 批准号:
20300093 - 财政年份:2008
- 资助金额:
$ 80.12万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Doctoral Dissertation Research: Working on Deadline: A Study of the Careers of Reporters and Editors at U. S. Daily Newspapers
博士论文研究:在截止日期前工作:美国日报记者和编辑职业研究
- 批准号:
0726712 - 财政年份:2007
- 资助金额:
$ 80.12万 - 项目类别:
Standard Grant
The Deadline Effect and Related Bargaining Phenomena: Some Experimental Studies
最后期限效应和相关的讨价还价现象:一些实验研究
- 批准号:
8815566 - 财政年份:1988
- 资助金额:
$ 80.12万 - 项目类别:
Standard Grant