XPS: FULL: Collaborative Research: Rethinking Architecture Support for Memory Consistency

XPS:完整:协作研究:重新思考对内存一致性的架构支持

基本信息

  • 批准号:
    1629196
  • 负责人:
  • 金额:
    $ 48.11万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2016
  • 资助国家:
    美国
  • 起止时间:
    2016-09-01 至 2021-08-31
  • 项目状态:
    已结题

项目摘要

Despite decades of progress, writing correct parallel software to realize the value of modern parallel computer hardware remains extremely difficult. A key problem is that today's computer systems do not give all programs clear behavioral guarantees; "ill-synchronized" code, in which parallel computations are incompletely or incorrectly coordinated, has ill-defined, often destructive behavior. This problem is a key theoretical and practical flaw in nearly all parallel computer systems. This proposal addresses this challenge, by proposing a new class of parallel computer architectures with strong behavioral guarantees, even for ill-synchronized code. The key idea is to make systems safely terminate ill-synchronized program executions before they can cause problems. To avoid degrading availability, the project includes mechanisms to avoid terminating program executions when possible, by falling back to more permissive, yet safe and predictable behavioral guarantees, and by resolving potential errors caused by ill-synchronized code. The intellectual merits of the project are that it provides crucial behavioral guarantees even to ill-synchronized parallel code. The project eliminates outdated hardware models that not only provide inadequate behavioral guarantees, but are also complex, and power-hungry. The project is the first in this domain to directly address availability and correctness together. The project's broader significance and importance are that it will improve the reliability of all parallel systems, which affects all aspects of life: medicine, energy, transportation, health, defense, and business. The stronger guarantees provided by this project avoid costly, dangerous failures and decrease the cost of application development, even in mature languages. The project will generate results relevant to industry and will influence academia through publication. The project will directly influence secondary and higher education in computing, fostering a diverse, future STEM workforce.To provide strong behavioral guarantees to all code -- even if incorrectly synchronized -- the proposed architectures provide region-atomic memory consistency guarantees for coarse-grained code regions. In these architectures, a program's execution is either a serialization of code regions, or it terminates with an exception that indicates an error could have left memory inconsistent. The architectures provide this strong memory consistency model to all program executions, departing from mainstream approaches to coherence and consistency that favor weaker guarantees without a clear benefit in complexity or performance. In systems executing ill-synchronized code, frequent exceptions may too often terminate program executions, degrading availability. The proposed architectures avoid degrading availability by tolerating consistency violations with a well-defined snapshot isolation semantics that avoids exceptions, but does not guarantee serializability of code regions. The architectures further address availability by resolving exceptions, leveraging commutativity of code to avoid unnecessary exceptions for commutative operations, as well as using dynamic symbolic analysis to resolve exceptions by combining symbolic memory updates.
尽管有了几十年的进步,编写正确的并行软件来实现现代并行计算机硬件的价值仍然是极其困难的。一个关键的问题是,今天的计算机系统并没有给所有的程序明确的行为保证;“不同步”代码,其中并行计算不完全或不正确地协调,具有不明确的,通常是破坏性的行为。这个问题是几乎所有并行计算机系统的一个关键理论和实际缺陷。本提案解决了这一挑战,提出了一种新的并行计算机体系结构,具有强大的行为保证,即使对于非同步代码也是如此。其关键思想是使系统在错误同步的程序执行引起问题之前安全地终止它们。为了避免降低可用性,该项目包括一些机制,通过退回到更宽松、更安全、更可预测的行为保证,以及解决由不同步代码引起的潜在错误,在可能的情况下避免终止程序执行。这个项目在智力上的优点是,它提供了关键的行为保证,甚至对不同步的并行代码也是如此。该项目消除了过时的硬件模型,这些模型不仅提供了不充分的行为保证,而且还很复杂,耗电量大。该项目是该领域中第一个直接解决可用性和正确性问题的项目。该项目更广泛的意义和重要性在于,它将提高所有并行系统的可靠性,这些系统影响到生活的各个方面:医药、能源、交通、卫生、国防和商业。该项目提供的更强的保证避免了代价高昂、危险的失败,并降低了应用程序开发的成本,即使是在成熟的语言中也是如此。该项目将产生与工业界相关的成果,并将通过出版物影响学术界。该项目将直接影响计算机领域的中学和高等教育,培养多元化的未来STEM劳动力。为了给所有代码提供强有力的行为保证——即使不正确地同步——所建议的体系结构为粗粒度代码区域提供了区域原子内存一致性保证。在这些体系结构中,程序的执行要么是代码区域的序列化,要么以一个异常结束,该异常表明错误可能导致内存不一致。这些体系结构为所有程序执行提供了这种强内存一致性模型,与主流的一致性和一致性方法不同,后者倾向于较弱的保证,但在复杂性或性能方面没有明显的好处。在执行不同步代码的系统中,频繁的异常可能经常终止程序执行,从而降低可用性。所建议的体系结构通过使用定义良好的快照隔离语义容忍一致性违规,从而避免降低可用性,该语义避免了异常,但不保证代码区域的可序列化性。这些体系结构通过解决异常进一步解决可用性问题,利用代码的交换性来避免交换操作的不必要异常,以及使用动态符号分析通过组合符号内存更新来解决异常。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Brandon Lucia其他文献

SOFRITAS: Serializable Ordering-Free Regions for Increasing Thread Atomicity Scalably
SOFRITAS:可序列化的无序区域,用于可扩展地增加线程原子性
EagleEye: Nanosatellite constellation design for high-coverage, high-resolution sensing
EagleEye:用于高覆盖范围、高分辨率传感的纳米卫星星座设计
Peacenik: Architecture Support for Not Failing under Fail-Stop Memory Consistency
Peacenik:在故障停止内存一致性下不失败的架构支持
Alternative approaches to hypertension
高血压的替代方法
  • DOI:
  • 发表时间:
    2019
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Brandon Lucia;Joseph Devietti;Tom Bergan;L. Ceze;D. Grossman
  • 通讯作者:
    D. Grossman
Computational Nanosatellite Constellations
计算纳米卫星星座

Brandon Lucia的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Brandon Lucia', 18)}}的其他基金

CPS: Frontier: Software-Defined Nanosatellite Constellations: The Foundation of Future Space-Based Cyber-physical Systems
CPS:前沿:软件定义的纳米卫星星座:未来天基网络物理系统的基础
  • 批准号:
    2111751
  • 财政年份:
    2022
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Continuing Grant
Workshop Proposal: Redefining the Future of Computer Architecture from First Principles
研讨会提案:从第一原理重新定义计算机架构的未来
  • 批准号:
    2220657
  • 财政年份:
    2022
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Standard Grant
SHF: Small: Practical and Formal Foundations for Intermittent Computer Systems
SHF:小型:间歇计算机系统的实用和正式基础
  • 批准号:
    2007998
  • 财政年份:
    2020
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Standard Grant
CAREER: System Support for Capable, Reliable Intermittently-Powered Computer Systems
职业:为功能强大、可靠的间歇供电计算机系统提供系统支持
  • 批准号:
    1751029
  • 财政年份:
    2018
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Continuing Grant
CSR: SHF: Small: Programming Language, Runtime System, and Architecture Support for Reliability in Intermittent, Energy-Harvesting Computing Devices
CSR:SHF:小型:间歇性能量收集计算设备可靠性的编程语言、运行时系统和架构支持
  • 批准号:
    1526342
  • 财政年份:
    2015
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Standard Grant

相似国自然基金

钴基Full-Heusler合金的掺杂效应和薄膜噪声特性研究
  • 批准号:
    51871067
  • 批准年份:
    2018
  • 资助金额:
    60.0 万元
  • 项目类别:
    面上项目

相似海外基金

XPS: FULL: Collaborative Research: Enabling Scalable Cloud And Edge-device Integration Using Cross-layer Parallelism
XPS:完整:协作研究:使用跨层并行性实现可扩展的云和边缘设备集成
  • 批准号:
    1903880
  • 财政年份:
    2018
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Standard Grant
XPS: FULL: Collaborative Research: Parallel and Distributed Circuit Programming for Structured Prediction
XPS:完整:协作研究:用于结构化预测的并行和分布式电路编程
  • 批准号:
    1818643
  • 财政年份:
    2017
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Standard Grant
XPS: FULL: Collaborative Research: Maximizing the Performance Potential and Reliability of Flash-based Solid State Devices for Future Storage Systems
XPS:完整:协作研究:最大限度地提高未来存储系统基于闪存的固态设备的性能潜力和可靠性
  • 批准号:
    1629291
  • 财政年份:
    2016
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Standard Grant
XPS: FULL: Collaborative Research: Rethinking Architecture Support for Memory Consistency
XPS:完整:协作研究:重新思考对内存一致性的架构支持
  • 批准号:
    1629126
  • 财政年份:
    2016
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Standard Grant
XPS: FULL: Collaborative Research: Parallel and Distributed Circuit Programming for Structured Prediction
XPS:完整:协作研究:用于结构化预测的并行和分布式电路编程
  • 批准号:
    1629459
  • 财政年份:
    2016
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Standard Grant
XPS: FULL: Collaborative Research: Enabling Scalable Cloud And Edge-device Integration Using Cross-layer Parallelism
XPS:完整:协作研究:使用跨层并行性实现可扩展的云和边缘设备集成
  • 批准号:
    1629347
  • 财政年份:
    2016
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Standard Grant
XPS: FULL: Collaborative Research: PARAGRAPH: Parallel, Scalable Graph Analytics
XPS:完整:协作研究:段落:并行、可扩展图形分析
  • 批准号:
    1629548
  • 财政年份:
    2016
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Standard Grant
XPS: FULL: Collaborative Research: PARAGRAPH: Parallel, Scalable Graph Analytics
XPS:完整:协作研究:段落:并行、可扩展图形分析
  • 批准号:
    1629657
  • 财政年份:
    2016
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Standard Grant
XPS: FULL: Collaborative Research: Maximizing the Performance Potential and Reliability of Flash-based Solid State Devices for Future Storage Systems
XPS:完整:协作研究:最大限度地提高未来存储系统基于闪存的固态设备的性能潜力和可靠性
  • 批准号:
    1629218
  • 财政年份:
    2016
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Standard Grant
XPS: FULL: Collaborative Research: Maximizing the Performance Potential and Reliability of Flash-based Solid State Devices for Future Storage Systems
XPS:完整:协作研究:最大限度地提高未来存储系统基于闪存的固态设备的性能潜力和可靠性
  • 批准号:
    1629403
  • 财政年份:
    2016
  • 资助金额:
    $ 48.11万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了