Collaborative Research: FMitF: Track I: Game Theoretic Updates for Network and Cloud Functions
合作研究:FMitF:第一轨:网络和云功能的博弈论更新
基本信息
- 批准号:2052696
- 负责人:
- 金额:$ 29.5万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2020
- 资助国家:美国
- 起止时间:2020-10-01 至 2024-09-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Updates are common in cloud-computing networks, and they occur for many reasons. Some network updates are planned while others are unplanned and automated. Since network updates can take seconds or minutes to complete, and cloud-computing networks must be "always on", updates must be efficient and transparent. Researchers have proposed various abstractions for network updating that leverage advances in formal methods to synthesize update plans and protocols, ensuring that the system remains well-behaved during an ongoing update. However, despite several high-profile cases of network updates gone wrong, operators continue to use relatively naive approaches. We investigate key shortcomings of prior work on update abstractions that limit their utility and widespread use in practice, and develop a new abstraction that addresses the heterogeneity, scale, and dynamic nature of real-world updates. The project's novelties are (1) a new game-theoretic foundation for network updates, (2) algorithms for synthesizing update controllers that are robust to failures and changing conditions during the update, (3) algorithms for explaining update failures, (4) a language design that allows synthesized controllers to be safely modified, and (5) implementations and evaluations of these mechanisms for virtual network functions and serverless-computing platforms. The project provides network operators with tools that make updates to networked systems easier, safer, and more reliable, and develops a framework that makes datacenter computing more reliable and secure.Some specific key shortcomings of previous work on network updates are the following. (1) They assume that the network behaves predictably during the update. However, at scale, network demands and concurrent updates can cause unpredictable or even adversarial behavior in response to the update. (2) They have limited explanatory power when an update plan cannot be found or cannot be completed. (3) They make it hard for operators to choose between alternative update plans. This project consists of a comprehensive research plan to address these shortcomings. The key technical innovation is a formulation of updates as the search for a winning strategy in a two-player game, between the operator (or control plane) and the network. This formulation allows a uniform modeling of key elements, including hardware and software failures, variations in demand, and the addition and removal of network elements. To produce updates that are robust to changing conditions and failures, this work uses program-synthesis techniques to automatically generate an update controller that corresponds to a winning strategy in the game. To help operators when fatal errors occur, the project develops algorithms that exploit this game-theoretic formulation to explain the root cause of update failures and present alternatives. Finally, to give operators more control over updates, the investigators develop approaches for synthesizing update controllers that are interpretable and modifiable. The game-theoretic formulation is applicable to several kinds of networked systems, and the project will instantiate and evaluate our tools for platforms that implement virtual network functions and serverless functions.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
更新在云计算网络中是很常见的,它们的发生有很多原因。一些网络更新是有计划的,而另一些则是计划外的和自动的。由于网络更新可能需要几秒钟或几分钟才能完成,而且云计算网络必须“始终在线”,因此更新必须高效透明。研究人员提出了各种网络更新的抽象,利用形式化方法的进步来综合更新计划和协议,确保系统在持续更新期间保持良好的行为。然而,尽管发生了几起引人注目的网络更新出错事件,运营商仍在使用相对幼稚的方法。我们研究了先前更新抽象工作的主要缺点,这些缺点限制了它们在实践中的实用性和广泛使用,并开发了一个新的抽象,解决了现实世界更新的异质性、规模和动态性。该项目的新颖之处在于:(1)网络更新的新博弈论基础,(2)合成更新控制器的算法,该算法在更新过程中对故障和变化条件具有鲁棒性,(3)解释更新失败的算法,(4)允许安全修改合成控制器的语言设计,以及(5)虚拟网络功能和无服务器计算平台的这些机制的实现和评估。该项目为网络运营商提供了工具,使网络系统的更新更容易、更安全、更可靠,并开发了一个框架,使数据中心计算更加可靠和安全。以前在网络更新方面的一些具体的关键缺点如下。(1)他们假设网络在更新过程中的行为是可预测的。然而,在规模上,网络需求和并发更新可能导致不可预测的甚至对抗的行为来响应更新。(2)当无法找到更新计划或无法完成更新计划时,其解释力有限。(3)它们使运营商难以在可供选择的更新计划之间做出选择。该项目包括一个全面的研究计划,以解决这些缺点。关键的技术创新是在操作员(或控制平面)和网络之间的双人游戏中寻找获胜策略的更新公式。此公式允许对关键元素进行统一建模,包括硬件和软件故障、需求变化以及网络元素的添加和删除。为了产生对不断变化的条件和故障具有鲁棒性的更新,这项工作使用程序合成技术来自动生成与游戏中的获胜策略相对应的更新控制器。为了在发生致命错误时帮助作业者,该项目开发了一种算法,利用这种博弈论公式来解释更新失败的根本原因,并提供替代方案。最后,为了让操作人员对更新有更多的控制权,研究人员开发了综合可解释和可修改的更新控制器的方法。博弈论公式适用于几种网络系统,该项目将实例化和评估我们的工具,用于实现虚拟网络功能和无服务器功能的平台。该奖项反映了美国国家科学基金会的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(3)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Breaking the computation and communication abstraction barrier in distributed machine learning workloads
- DOI:10.1145/3503222.3507778
- 发表时间:2021-05
- 期刊:
- 影响因子:0
- 作者:Abhinav Jangda;Jun Huang;Guodong Liu;Amir Hossein Nodehi Sabet;Saeed Maleki;Youshan Miao;M. Musuvathi-M.-Mu
- 通讯作者:Abhinav Jangda;Jun Huang;Guodong Liu;Amir Hossein Nodehi Sabet;Saeed Maleki;Youshan Miao;M. Musuvathi-M.-Mu
MultiPL-E: A Scalable and Polyglot Approach to Benchmarking Neural Code Generation
- DOI:10.1109/tse.2023.3267446
- 发表时间:2023-07
- 期刊:
- 影响因子:7.4
- 作者:Federico Cassano;John Gouwar;Daniel Nguyen;S. Nguyen;Luna Phipps-Costin;Donald Pinckney;Ming-Ho Yee-M
- 通讯作者:Federico Cassano;John Gouwar;Daniel Nguyen;S. Nguyen;Luna Phipps-Costin;Donald Pinckney;Ming-Ho Yee-M
TacTok: semantics-aware proof synthesis
- DOI:10.1145/3428299
- 发表时间:2020-11
- 期刊:
- 影响因子:0
- 作者:Yuriy Brun
- 通讯作者:Yuriy Brun
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Arjun Guha其他文献
The Fragile X Mental Retardation Protein protects the lung from xenobiotic stress by facilitating the Integrated Stress Response
脆性 X 智力迟钝蛋白通过促进综合应激反应来保护肺部免受外源应激
- DOI:
- 发表时间:
2021 - 期刊:
- 影响因子:0
- 作者:
D. Basu;Rital Bhavsar;Imtiyaz Gulami;Sai Manoz Lingamallu;Ravi S Muddashetty;Chandrakanth Veeranna;S. Chattarji;R. Thimmulappa;A. Bhattacharya;Arjun Guha - 通讯作者:
Arjun Guha
Semantics and Types for Objects with First-Class Member Names
具有第一类成员名称的对象的语义和类型
- DOI:
- 发表时间:
2012 - 期刊:
- 影响因子:0
- 作者:
J. Politz;Arjun Guha;S. Krishnamurthi - 通讯作者:
S. Krishnamurthi
The Sweep: Essential Examples for In-Flow Peer Review
扫描:流动同行评审的基本示例
- DOI:
10.1145/2839509.2844626 - 发表时间:
2016 - 期刊:
- 影响因子:0
- 作者:
J. Politz;Joseph M. Collard;Arjun Guha;Kathi Fisler;S. Krishnamurthi - 通讯作者:
S. Krishnamurthi
Fluid Object Types
流体对象类型
- DOI:
- 发表时间:
2011 - 期刊:
- 影响因子:0
- 作者:
Arjun Guha;J. Politz;S. Krishnamurthi - 通讯作者:
S. Krishnamurthi
Fission: Secure Dynamic Code-Splitting for JavaScript
Fission:JavaScript 的安全动态代码分割
- DOI:
- 发表时间:
2017 - 期刊:
- 影响因子:0
- 作者:
Arjun Guha;Jean;Rachit Nigam;J. Tangen;Rian Shambaugh - 通讯作者:
Rian Shambaugh
Arjun Guha的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Arjun Guha', 18)}}的其他基金
Collaborative Research: FW-HTF-RM: AI-Assisted Programming: Equipping Social and Natural Scientists for the Future of Research
合作研究:FW-HTF-RM:人工智能辅助编程:为社会和自然科学家的未来研究做好准备
- 批准号:
2326173 - 财政年份:2023
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
SHF:Small:A Language-based Approach to Faster and Safer Serverless Computing
SHF:Small:基于语言的更快、更安全的无服务器计算方法
- 批准号:
2102288 - 财政年份:2020
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
Collaborative Research: SHF: Small: Interactive Synthesis and Repair For Robot Programs
合作研究:SHF:小型:机器人程序的交互式合成和修复
- 批准号:
2102291 - 财政年份:2020
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
SHF:Small:A Language-based Approach to Faster and Safer Serverless Computing
SHF:Small:基于语言的更快、更安全的无服务器计算方法
- 批准号:
2007066 - 财政年份:2020
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
Collaborative Research: SHF: Small: Interactive Synthesis and Repair For Robot Programs
合作研究:SHF:小型:机器人程序的交互式合成和修复
- 批准号:
2006995 - 财政年份:2020
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Game Theoretic Updates for Network and Cloud Functions
合作研究:FMitF:第一轨:网络和云功能的博弈论更新
- 批准号:
2018393 - 财政年份:2020
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
NeTS: Large: Collaborative Research:Programmable Inter-Domain Observation and Control
NeTS:大型:协作研究:可编程域间观测与控制
- 批准号:
1413985 - 财政年份:2014
- 资助金额:
$ 29.5万 - 项目类别:
Continuing Grant
相似国自然基金
Research on Quantum Field Theory without a Lagrangian Description
- 批准号:24ZR1403900
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
Cell Research
- 批准号:31224802
- 批准年份:2012
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research
- 批准号:31024804
- 批准年份:2010
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research (细胞研究)
- 批准号:30824808
- 批准年份:2008
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
- 批准号:10774081
- 批准年份:2007
- 资助金额:45.0 万元
- 项目类别:面上项目
相似海外基金
FMitF: Collaborative Research: RedLeaf: Verified Operating Systems in Rust
FMITF:协作研究:RedLeaf:经过验证的 Rust 操作系统
- 批准号:
2313411 - 财政年份:2023
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: DeepSmith: Scheduling with Quality Guarantees for Efficient DNN Model Execution
合作研究:FMitF:第一轨:DeepSmith:为高效 DNN 模型执行提供质量保证的调度
- 批准号:
2349461 - 财政年份:2023
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Game Theoretic Updates for Network and Cloud Functions
合作研究:FMitF:第一轨:网络和云功能的博弈论更新
- 批准号:
2318970 - 财政年份:2023
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Knitting Semantics
合作研究:FMitF:第一轨:针织语义
- 批准号:
2319182 - 财政年份:2023
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Towards Verified Robustness and Safety in Power System-Informed Neural Networks
合作研究:FMitF:第一轨:实现电力系统通知神经网络的鲁棒性和安全性验证
- 批准号:
2319242 - 财政年份:2023
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Towards Verified Robustness and Safety in Power System-Informed Neural Networks
合作研究:FMitF:第一轨:实现电力系统通知神经网络的鲁棒性和安全性验证
- 批准号:
2319243 - 财政年份:2023
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Synthesis and Verification of In-Memory Computing Systems using Formal Methods
合作研究:FMitF:第一轨:使用形式方法合成和验证内存计算系统
- 批准号:
2319400 - 财政年份:2023
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Synthesis and Verification of In-Memory Computing Systems using Formal Methods
合作研究:FMitF:第一轨:使用形式方法合成和验证内存计算系统
- 批准号:
2319399 - 财政年份:2023
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Simplifying End-to-End Verification of High-Performance Distributed Systems
合作研究:FMitF:第一轨:简化高性能分布式系统的端到端验证
- 批准号:
2318954 - 财政年份:2023
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: The Phlox framework for verifying a high-performance distributed database
合作研究:FMitF:第一轨:用于验证高性能分布式数据库的 Phlox 框架
- 批准号:
2319167 - 财政年份:2023
- 资助金额:
$ 29.5万 - 项目类别:
Standard Grant