Collaborative Research: FMitF: Track I: Composable Verification of Crash-Safe Distributed Systems with Grove
合作研究:FMitF:第一轨:使用 Grove 对崩溃安全分布式系统进行可组合验证
基本信息
- 批准号:2123864
- 负责人:
- 金额:$ 50万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2021
- 资助国家:美国
- 起止时间:2021-10-01 至 2025-09-30
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Distributed systems play a crucial role in computer systems infrastructure. Nevertheless, developing reliable distributed systems is challenging due to the need to contend with concurrency across machines, concurrency within each machine, unreliable networks that can delay or drop messages, and partial failures if one or more machines crash and reboot while others continue running. As a result, distributed systems are error-prone and subtle bugs can lead to significant outages. Traditional testing approaches are insufficient to eliminate all such bugs. This project's novelty is a new approach to formal verification of distributed systems that allows verifying components in a modular fashion. It allows for verification of distributed systems in the presence of crashes. This project's impact is intended to include improving the reliability and correctness of distributed systems and avoid costly outages. In addition, new lab assignments for systems-verification classes are being developed, focused on distributed systems.The technical approach addresses two specific challenges: reasoning about crash recovery in distributed systems, as well as composing distributed systems from smaller components. Crash recovery is challenging because individual nodes can crash and reboot. Once a node starts running again, it might no longer be consistent with the rest of the system that did not crash. This means the node may have lost all of its memory contents on crash but may have kept some state durably on disk. The second challenge lies in composing specifications and proofs of distributed systems (such as a key-value store) that are built out of smaller components (such as a configuration service, a lock service, or the implementation of an individual node). Scaling verification of distributed systems requires the proof to reflect this modularity. For example, reasoning about an application that uses a lock service should not require reasoning about the network messages sent by the lock service itself. It should be done purely using the specifications for the lock service client stubs. This project tackles these challenges using concurrent separation logic, which provides a natural approach for composing proofs about multiple components, as well as abstracting away implementation details with a pre/post-condition specification. This project extends earlier work with techniques for distributed system reasoning, including new kinds of per-node invariants (which might need to be repaired on crash) as opposed to global invariants (which must hold even if some nodes have crashed). In addition, the project provides techniques for reasoning about exactly-once semantics of Remote Procedure Calls (RPC) on top of unreliable computer networks and locks that span multiple machines.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
分布式系统在计算机系统基础设施中起着至关重要的作用。然而,开发可靠的分布式系统是具有挑战性的,因为需要处理机器之间的并发性、每台机器内部的并发性、可能延迟或丢弃消息的不可靠网络,以及如果一台或多台机器崩溃并重新启动而其他机器继续运行时出现部分故障。因此,分布式系统容易出错,细微的错误可能导致严重的中断。传统的测试方法不足以消除所有这样的错误。这个项目的新颖之处在于,它提供了一种对分布式系统进行正式验证的新方法,允许以模块化的方式验证组件。它允许在出现崩溃时对分布式系统进行验证。这个项目的影响包括提高分布式系统的可靠性和正确性,避免代价高昂的中断。此外,系统验证课程的新实验作业正在开发中,重点放在分布式系统上。该技术方法解决了两个特定的挑战:对分布式系统中的崩溃恢复进行推理,以及用较小的组件组合分布式系统。崩溃恢复具有挑战性,因为单个节点可能崩溃并重新启动。一旦某个节点重新开始运行,它可能不再与系统中没有崩溃的其余部分保持一致。这意味着节点可能在崩溃时丢失了所有内存内容,但可能在磁盘上持久地保留了一些状态。第二个挑战在于组合分布式系统(例如键值存储)的规范和证明,这些系统由较小的组件(例如配置服务、锁服务或单个节点的实现)构建而成。分布式系统的扩展验证需要证明来反映这种模块化。例如,对使用锁服务的应用程序的推理不应该需要对锁服务本身发送的网络消息进行推理。它应该完全使用锁服务客户机存根的规范来完成。该项目使用并发分离逻辑解决了这些挑战,并发分离逻辑为组合多个组件的证明提供了一种自然的方法,并使用前置/后用条件规范抽象了实现细节。该项目使用分布式系统推理技术扩展了早期的工作,包括与全局不变量(即使某些节点崩溃也必须保持)相对的新类型的每节点不变量(可能需要在崩溃时修复)。此外,该项目还提供了在不可靠的计算机网络和跨多台机器的锁之上对远程过程调用(RPC)的精确一次语义进行推理的技术。该奖项反映了美国国家科学基金会的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(1)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Verifying vMVCC, a high-performance transaction library using multi-version concurrency control
验证使用多版本并发控制的高性能事务库vMVCC
- DOI:
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:Chang, Yun-Sheng;Jung, Ralf;Sharma, Upamanyu;Tassarotti, Joseph;Kaashoek, M. Frans;Zeldovich, Nickolai
- 通讯作者:Zeldovich, Nickolai
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Nickolai Zeldovich其他文献
Aardvark: An Asynchronous Authenticated Dictionary with Applications to Account-based Cryptocurrencies
Aardvark:异步认证字典,适用于基于账户的加密货币
- DOI:
- 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
Derek Leung;Y. Gilad;S. Gorbunov;Leonid Reyzin;Nickolai Zeldovich - 通讯作者:
Nickolai Zeldovich
A Trigger-Based Middleware Cache for ORMs
基于触发器的 ORM 中间件缓存
- DOI:
- 发表时间:
2011 - 期刊:
- 影响因子:0
- 作者:
Priya Gupta;Nickolai Zeldovich;S. Madden - 通讯作者:
S. Madden
Optimizing unit test execution in large software programs using dependency analysis
使用依赖性分析优化大型软件程序中的单元测试执行
- DOI:
- 发表时间:
2013 - 期刊:
- 影响因子:0
- 作者:
Taesoo Kim;Ramesh Chandra;Nickolai Zeldovich - 通讯作者:
Nickolai Zeldovich
Guidelines for Using the CryptDB System Securely
安全使用 CryptDB 系统的指南
- DOI:
- 发表时间:
2015 - 期刊:
- 影响因子:0
- 作者:
Raluca A. Popa;Nickolai Zeldovich;H. Balakrishnan - 通讯作者:
H. Balakrishnan
Yodel: Strong Metadata Security for Real-Time Voice Calls
Yodel:实时语音通话的强大元数据安全性
- DOI:
- 发表时间:
2019 - 期刊:
- 影响因子:0
- 作者:
Y. Gilad;Nickolai Zeldovich - 通讯作者:
Nickolai Zeldovich
Nickolai Zeldovich的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Nickolai Zeldovich', 18)}}的其他基金
Collaborative Research: FMitF: Track I: The Phlox framework for verifying a high-performance distributed database
合作研究:FMitF:第一轨:用于验证高性能分布式数据库的 Phlox 框架
- 批准号:
2319167 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
SaTC: CORE: Medium: Verifying Hardware Security Modules with Information-Preserving Refinement
SaTC:核心:中:通过信息保留改进验证硬件安全模块
- 批准号:
2225441 - 财政年份:2022
- 资助金额:
$ 50万 - 项目类别:
Continuing Grant
SaTC: CORE: Small: verifying security for data non-interference
SaTC:核心:小:验证数据互不干扰的安全性
- 批准号:
1812522 - 财政年份:2018
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
FMitF: Verifying Concurrent System Software with Cspec
FMITF:使用 Cspec 验证并发系统软件
- 批准号:
1836712 - 财政年份:2018
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
CAREER: System-Wide Intrusion Recovery Using Selective Re-execution
职业:使用选择性重新执行进行系统范围的入侵恢复
- 批准号:
1053143 - 财政年份:2011
- 资助金额:
$ 50万 - 项目类别:
Continuing Grant
相似国自然基金
Research on Quantum Field Theory without a Lagrangian Description
- 批准号:24ZR1403900
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
Cell Research
- 批准号:31224802
- 批准年份:2012
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research
- 批准号:31024804
- 批准年份:2010
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research (细胞研究)
- 批准号:30824808
- 批准年份:2008
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
- 批准号:10774081
- 批准年份:2007
- 资助金额:45.0 万元
- 项目类别:面上项目
相似海外基金
FMitF: Collaborative Research: RedLeaf: Verified Operating Systems in Rust
FMITF:协作研究:RedLeaf:经过验证的 Rust 操作系统
- 批准号:
2313411 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Game Theoretic Updates for Network and Cloud Functions
合作研究:FMitF:第一轨:网络和云功能的博弈论更新
- 批准号:
2318970 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Knitting Semantics
合作研究:FMitF:第一轨:针织语义
- 批准号:
2319182 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Towards Verified Robustness and Safety in Power System-Informed Neural Networks
合作研究:FMitF:第一轨:实现电力系统通知神经网络的鲁棒性和安全性验证
- 批准号:
2319242 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: DeepSmith: Scheduling with Quality Guarantees for Efficient DNN Model Execution
合作研究:FMitF:第一轨:DeepSmith:为高效 DNN 模型执行提供质量保证的调度
- 批准号:
2349461 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Towards Verified Robustness and Safety in Power System-Informed Neural Networks
合作研究:FMitF:第一轨:实现电力系统通知神经网络的鲁棒性和安全性验证
- 批准号:
2319243 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Synthesis and Verification of In-Memory Computing Systems using Formal Methods
合作研究:FMitF:第一轨:使用形式方法合成和验证内存计算系统
- 批准号:
2319400 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Synthesis and Verification of In-Memory Computing Systems using Formal Methods
合作研究:FMitF:第一轨:使用形式方法合成和验证内存计算系统
- 批准号:
2319399 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: A Formal Verification and Implementation Stack for Programmable Logic Controllers
合作研究:FMitF:第一轨:可编程逻辑控制器的形式验证和实现堆栈
- 批准号:
2425711 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Collaborative Research: FMitF: Track I: Simplifying End-to-End Verification of High-Performance Distributed Systems
合作研究:FMitF:第一轨:简化高性能分布式系统的端到端验证
- 批准号:
2318954 - 财政年份:2023
- 资助金额:
$ 50万 - 项目类别:
Standard Grant