Collaborative Research: PPoSS: Large: A Full-Stack Architecture for Sparse Computation

协作研究:PPoSS:大型:稀疏计算的全栈架构

基本信息

  • 批准号:
    2216971
  • 负责人:
  • 金额:
    $ 54.98万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2022
  • 资助国家:
    美国
  • 起止时间:
    2022-10-01 至 2027-09-30
  • 项目状态:
    未结题

项目摘要

Computer systems have been designed and optimized primarily for dense computations, i.e., those that process regularly structured data. But current systems are ill-suited to sparse computations, i.e., those that process unstructured data. Sparse computations are very common because many relations and interactions are sparse. For example, most people are not friends and most neurons are not directly connected. Sparse computations take advantage of this sparsity by encoding and processing only meaningful relations, such as storing only the non-zero elements of a matrix. These applications are crucial in many domains, like deep learning, data analytics, and scientific computing, but their irregular structure makes them inefficient and hard to scale in currentsystems, wasting billions of dollars yearly. This project aims to redesign the computing stack to provide first-class support for sparse computations. The project's novelties include a full system stack that spans programming languages, compilers, and specialized hardware architectures and large-scale computer systems. The project's impacts include making future parallel systems much more versatile, scalable, energy efficient and easier to program.This project takes a coordinated approach across the system stack to unlock the performance and scalability of sparse computations, because they pose challenges that cannot be addressed at a single layer. For example, sparse computations have a rich space of choices in algorithm, data representation, and schedule, which current languages and compilers cannot capture or optimize properly. The right choice of algorithm and data representation are often unknown in advance and may change at run-time, thwarting the rigid division between current compilers and schedulers. Irregular, data-dependent control and memory accesses stymie compiler analysis, hinder parallelization, make poor use of hardware, and introduce numerous side channels that thwart security. Finally, their data-intensive nature is a poor match to the processors and accelerators pervasive in current clusters and datacenters, which optimize for compute operations rather than to minimize data movement. To tackle these challenges, this project will develop a full system stack spanning domain-specific languages, a tightly integrated compiler and scheduler, and specialized hardware architectures and high-performance, multi-node computer systems and networks. This stack is built around a unifying abstraction, anovel sparse intermediate representation that (1) encodes semantic information on key sparse data structures and their iterations, (2) enables optimizing compiler transformations and dynamic scheduling decisions, and (3) can be easily compiled to parallel architectures, including graphics processing units (GPUs), general-purpose processors, our proposed specialized architecture, and their combination. The full stack will be designed with security at the forefront, leveraging novel cross-layer techniques to achieve secure high performance. This system will be rigorously evaluated using a broad set of sparse applications and at a wide range of system scales, including large-scale clusters with hundreds of GPUs or tens of specialized processors. By innovating across the full software and hardware stack, these techniques will achieve performance, scalability, and efficiency gains that single-layer approaches cannot provide.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
计算机系统主要针对密集计算(即处理规则结构化数据的计算)进行设计和优化。但当前的系统不适合稀疏计算,即处理非结构化数据的系统。稀疏计算非常常见,因为许多关系和交互都是稀疏的。例如,大多数人都不是朋友,大多数神经元也没有直接连接。稀疏计算通过仅编码和处理有意义的关系来利用这种稀疏性,例如仅存储矩阵的非零元素。这些应用程序在深度学习、数据分析和科学计算等许多领域至关重要,但它们的不规则结构使其效率低下,并且难以在当前系统中扩展,每年浪费数十亿美元。该项目旨在重新设计计算堆栈,为稀疏计算提供一流的支持。该项目的新颖之处包括涵盖编程语言、编译器、专用硬件架构和大型计算机系统的完整系统堆栈。该项目的影响包括使未来的并行系统更加通用、可扩展、节能且更易于编程。该项目采用跨系统堆栈的协调方法来解锁稀疏计算的性能和可扩展性,因为它们带来了单层无法解决的挑战。例如,稀疏计算在算法、数据表示和调度方面有丰富的选择空间,当前的语言和编译器无法正确捕获或优化。算法和数据表示的正确选择通常是事先未知的,并且可能在运行时发生变化,从而阻碍了当前编译器和调度程序之间的严格划分。不规则的、数据相关的控制和内存访问会阻碍编译器分析、阻碍并行化、硬件利用率低下,并引入大量阻碍安全的侧通道。最后,它们的数据密集型性质与当前集群和数据中心中普遍存在的处理器和加速器不匹配,后者针对计算操作进行优化,而不是最大限度地减少数据移动。为了应对这些挑战,该项目将开发涵盖特定领域语言的完整系统堆栈、紧密集成的编译器和调度程序、专用硬件架构以及高性能、多节点计算机系统和网络。该堆栈围绕统一的抽象、新颖的稀疏中间表示构建,该表示(1)对关键稀疏数据结构及其迭代的语义信息进行编码,(2)能够优化编译器转换和动态调度决策,以及(3)可以轻松编译为并行架构,包括图形处理单元(GPU)、通用处理器、我们提出的专用架构及其组合。整个堆栈的设计将以安全为先,利用新颖的跨层技术来实现安全的高性能。该系统将使用广泛的稀疏应用程序和广泛的系统规模进行严格评估,包括具有数百个 GPU 或数十个专用处理器的大型集群。通过在整个软件和硬件堆栈上进行创新,这些技术将实现单层方法无法提供的性能、可扩展性和效率提升。该奖项反映了 NSF 的法定使命,并通过使用基金会的智力价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(4)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Provably Good Randomized Strategies for Data Placement in Distributed Key-Value Stores
An Efficient Scheduler for Task-Parallel Interactive Applications
Responsive Parallelism with Synchronization
响应式并行与同步
  • DOI:
    10.1145/3591249
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Muller, Stefan K.;Singer, Kyle;Keeney, Devyn Terra;Neth, Andrew;Agrawal, Kunal;Lee, I-Ting Angelina;Acar, Umut A.
  • 通讯作者:
    Acar, Umut A.
Responsive parallelism with futures and state
与 future 和 state 的响应式并行
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Kunal Agrawal其他文献

Brief Announcement: Green Paging and Parallel Paging
简短公告:绿色分页和并行分页
  • DOI:
  • 发表时间:
    2020
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Kunal Agrawal;William Kuszmaul;Michele Scquizzato
  • 通讯作者:
    Michele Scquizzato
Intractability Issues in Mixed-Criticality Scheduling
混合关键调度中的棘手问题
  • DOI:
    10.4230/lipics.ecrts.2018.11
  • 发表时间:
    2018
  • 期刊:
  • 影响因子:
    3.7
  • 作者:
    Kunal Agrawal;Sanjoy Baruah
  • 通讯作者:
    Sanjoy Baruah
Distributed Load Balancing in the Face of Reappearance Dependencies
面对再现依赖的分布式负载均衡
Number : WUCSE-2013-25 2013 Parallel Real-Time Scheduling of DAGs
编号:WUCSE-2013-25 2013 DAG 并行实时调度
  • DOI:
  • 发表时间:
    2015
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Abusayeed Saifullah;D. Ferry;Jing Li;Kunal Agrawal;Chenyang Lu
  • 通讯作者:
    Chenyang Lu
Analysis of classic algorithms on GPUs
GPU上经典算法分析

Kunal Agrawal的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Kunal Agrawal', 18)}}的其他基金

Collaborative Research: AF: Medium: Adventures in Flatland: Algorithms for Modern Memories
合作研究:AF:媒介:平地历险记:现代记忆算法
  • 批准号:
    2106699
  • 财政年份:
    2021
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Continuing Grant
Collaborative Research: SHF: Medium: Responsive Parallelism for Interactive Applications: Theory and Practice
协作研究:SHF:媒介:交互式应用程序的响应式并行性:理论与实践
  • 批准号:
    2107280
  • 财政年份:
    2021
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Continuing Grant
SPX: Collaborative Research: Eat your Wheaties: Multi-Grain Compilers for Parallel Builds at Every Scale
SPX:协作研究:吃你的小麦:用于各种规模并行构建的多粒度编译器
  • 批准号:
    1725647
  • 财政年份:
    2017
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Standard Grant
XPS: FULL: FP: Collaborative Research: Taming parallelism: optimally exploiting high-throughput parallel architectures
XPS:完整:FP:协作研究:驯服并行性:最佳地利用高吞吐量并行架构
  • 批准号:
    1439062
  • 财政年份:
    2014
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Standard Grant
XPS: FP: Real-Time Scheduling of Parallel Tasks
XPS:FP:并行任务的实时调度
  • 批准号:
    1337218
  • 财政年份:
    2013
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Standard Grant
CAREER: Provably Good Concurrency Platforms for Streaming Applications
职业:经过验证的流应用程序良好并发平台
  • 批准号:
    1150036
  • 财政年份:
    2012
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Continuing Grant
AF: SMALL: Collaborative Research: Data Structures for Parallel Algorithms
AF:小:协作研究:并行算法的数据结构
  • 批准号:
    1218017
  • 财政年份:
    2012
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Standard Grant

相似国自然基金

Research on Quantum Field Theory without a Lagrangian Description
  • 批准号:
    24ZR1403900
  • 批准年份:
    2024
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
Cell Research
  • 批准号:
    31224802
  • 批准年份:
    2012
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research
  • 批准号:
    31024804
  • 批准年份:
    2010
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research (细胞研究)
  • 批准号:
    30824808
  • 批准年份:
    2008
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
  • 批准号:
    10774081
  • 批准年份:
    2007
  • 资助金额:
    45.0 万元
  • 项目类别:
    面上项目

相似海外基金

Collaborative Research: PPoSS: Large: A Full-stack Approach to Declarative Analytics at Scale
协作研究:PPoSS:大型:大规模声明性分析的全栈方法
  • 批准号:
    2316161
  • 财政年份:
    2023
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Continuing Grant
Collaborative Research: PPoSS: LARGE: Research into the Use and iNtegration of Data Movement Accelerators (RUN-DMX)
协作研究:PPoSS:大型:数据移动加速器 (RUN-DMX) 的使用和集成研究
  • 批准号:
    2316176
  • 财政年份:
    2023
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Continuing Grant
Collaborative Research: PPoSS: Large: A Full-stack Approach to Declarative Analytics at Scale
协作研究:PPoSS:大型:大规模声明性分析的全栈方法
  • 批准号:
    2316158
  • 财政年份:
    2023
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Continuing Grant
Collaborative Research: PPoSS: LARGE: Cross-layer Coordination and Optimization for Scalable and Sparse Tensor Networks (CROSS)
合作研究:PPoSS:LARGE:可扩展和稀疏张量网络的跨层协调和优化(CROSS)
  • 批准号:
    2316201
  • 财政年份:
    2023
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Standard Grant
Collaborative Research: PPoSS: LARGE: Cross-layer Coordination and Optimization for Scalable and Sparse Tensor Networks (CROSS)
合作研究:PPoSS:LARGE:可扩展和稀疏张量网络的跨层协调和优化(CROSS)
  • 批准号:
    2316203
  • 财政年份:
    2023
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Continuing Grant
Collaborative Research: PPoSS: LARGE: Research into the Use and iNtegration of Data Movement Accelerators (RUN-DMX)
协作研究:PPoSS:大型:数据移动加速器 (RUN-DMX) 的使用和集成研究
  • 批准号:
    2316177
  • 财政年份:
    2023
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Continuing Grant
Collaborative Research: PPoSS: LARGE: Cross-layer Coordination and Optimization for Scalable and Sparse Tensor Networks (CROSS)
合作研究:PPoSS:LARGE:可扩展和稀疏张量网络的跨层协调和优化(CROSS)
  • 批准号:
    2316202
  • 财政年份:
    2023
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Standard Grant
Collaborative Research: PPoSS: LARGE: General-Purpose Scalable Technologies for Fundamental Graph Problems
合作研究:PPoSS:大型:解决基本图问题的通用可扩展技术
  • 批准号:
    2316235
  • 财政年份:
    2023
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Continuing Grant
Collaborative Research: PPoSS: LARGE: Principles and Infrastructure of Extreme Scale Edge Learning for Computational Screening and Surveillance for Health Care
合作研究:PPoSS:大型:用于医疗保健计算筛查和监视的超大规模边缘学习的原理和基础设施
  • 批准号:
    2406572
  • 财政年份:
    2023
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Continuing Grant
Collaborative Research: PPoSS: Large: A Full-stack Approach to Declarative Analytics at Scale
协作研究:PPoSS:大型:大规模声明性分析的全栈方法
  • 批准号:
    2316159
  • 财政年份:
    2023
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了