SPX: Collaborative Research: Intelligent Communication Fabrics to Facilitate Extreme Scale Computing

SPX:协作研究:促进超大规模计算的智能通信结构

基本信息

  • 批准号:
    1918987
  • 负责人:
  • 金额:
    $ 45.01万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2019
  • 资助国家:
    美国
  • 起止时间:
    2019-10-01 至 2024-02-29
  • 项目状态:
    已结题

项目摘要

Advances in High-Performance Computing over the last several decades have enabled various important applications throughout science and engineering. Semiconductor technology increasingly faces fundamental physical limits. New approaches to hardware/software co-design are now achieving higher performance at extreme scales. This project will explore two new approaches: configurability and integration. Configurability enables hardware to map better to applications. Integration enables system components that have generally been single function to gain additional functionality. An example is a network for the transport of data. Such a network might also operate on that data as it is being transported. Integration enables compute everywhere in the architecture and network. Configurability and integration will lead to much more efficient use of computing resources. Such resources include high-performance computers. Another focus of this project will be in the use of Field Programmable Gate Arrays (FPGAs). FPGAs are a widely used type of integrated circuit. In FPGAs the hardware can be configured to match the application. FPGAs will allow the deployment of the research products into production systems. They will also allow evaluation of changes in system design. This work will advance extreme-scale computing applications. Such applications include medicine and new drug discovery. This project will also include training of graduate and undergraduate students. It will focus on students belonging to groups underrepresented in STEM disciplinesThe central theme of this project is that emerging centrality of hardware configurability and integration means that at least two old systems abstractions must be broken. First, this project will demonstrate that computation and communication should no longer comprise separate silos. Second, it will demonstrate that applications should no longer be mapped to fixed hardware. In their place, new hardware-software abstractions must necessarily be built around the idea of the application-centric system. This project will identify new computing modes -- computation distribution/offload and configurable hardware -- that will be application aware with tight mapping of applications to computer systems. This improved mapping, in turn, will require deploying a number of mechanisms, starting with improved compilers and high-quality application libraries but extending to lighter-weight yet intelligent middleware (for instance, improved Message Passing Interface middleware), automated application modification, dynamic autotuning, and machine learning assisting in all of these components. The project will consider all of these concerns in due course.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
在过去的几十年里,高性能计算的进步使科学和工程中的各种重要应用成为可能。半导体技术日益面临基本的物理限制。硬件/软件协同设计的新方法现在在极端规模下实现了更高的性能。这个项目将探索两种新的方法:可配置性和集成。可配置性使硬件能够更好地映射到应用程序。集成使通常具有单一功能的系统组件能够获得额外的功能。一个例子是用于数据传输的网络。这样的网络还可以在传输数据时对数据进行操作。集成使计算在体系结构和网络中无处不在。可配置性和集成将导致更有效地使用计算资源。这些资源包括高性能计算机。该项目的另一个重点将是现场可编程门阵列(fpga)的使用。fpga是一种应用广泛的集成电路。在fpga中,硬件可以配置为与应用相匹配。fpga将允许将研究产品部署到生产系统中。它们还将允许对系统设计中的变化进行评估。这项工作将推进极端规模计算的应用。这些应用包括医药和新药发现。该项目还将包括研究生和本科生的培训。该项目的中心主题是硬件可配置性和集成的中心地位,这意味着必须打破至少两个旧的系统抽象。首先,这个项目将证明计算和通信不应再由各自的竖井组成。其次,它将演示应用程序不应该再映射到固定的硬件。在它们的位置上,必须围绕以应用程序为中心的系统的思想构建新的软硬件抽象。该项目将确定新的计算模式——计算分配/卸载和可配置硬件——这将是应用程序感知的,应用程序与计算机系统的紧密映射。这种改进的映射反过来又需要部署许多机制,从改进的编译器和高质量的应用程序库开始,扩展到轻量级的智能中间件(例如,改进的消息传递接口中间件)、自动应用程序修改、动态自动调优,以及在所有这些组件中辅助的机器学习。该项目将在适当的时候考虑所有这些问题。该奖项反映了美国国家科学基金会的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(4)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Reconfigurable switches for high performance and flexible MPI collectives
  • DOI:
    10.1002/cpe.6769
  • 发表时间:
    2021-12
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Pouya Haghi;Anqi Guo;Qingqing Xiong;Chen Yang;Tong Geng;Justin T. Broaddus;Ryan J. Marshall;Derek Schafer;A. Skjellum;Martin C. Herbordt
  • 通讯作者:
    Pouya Haghi;Anqi Guo;Qingqing Xiong;Chen Yang;Tong Geng;Justin T. Broaddus;Ryan J. Marshall;Derek Schafer;A. Skjellum;Martin C. Herbordt
Implementation and evaluation of MPI 4.0 partitioned communication libraries
MPI 4.0分区通信库的实现和评估
  • DOI:
    10.1016/j.parco.2021.102827
  • 发表时间:
    2021
  • 期刊:
  • 影响因子:
    1.4
  • 作者:
    Dosanjh, Matthew G.F.;Worley, Andrew;Schafer, Derek;Soundararajan, Prema;Ghafoor, Sheikh;Skjellum, Anthony;Bangalore, Purushotham V.;Grant, Ryan E.
  • 通讯作者:
    Grant, Ryan E.
Understanding the use of message passing interface in exascale proxy applications
了解消息传递接口在百亿亿级代理应用程序中的使用
  • DOI:
    10.1002/cpe.5901
  • 发表时间:
    2020
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Sultana, Nawrin;Rüfenacht, Martin;Skjellum, Anthony;Bangalore, Purushotham;Laguna, Ignacio;Mohror, Kathryn
  • 通讯作者:
    Mohror, Kathryn
Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives
分区点对点通信原语的便携式实现的设计
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Anthony Skjellum其他文献

Understanding GPU Triggering APIs for MPI+X Communication
了解用于 MPI X 通信的 GPU 触发 API
  • DOI:
  • 发表时间:
    2024
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Patrick G. Bridges;Anthony Skjellum;E. Suggs;Derek Schafer;P. Bangalore
  • 通讯作者:
    P. Bangalore
MitM attacks on intellectual property and integrity of additive manufacturing systems: A security analysis
针对增材制造系统的知识产权和完整性的中间人攻击:安全分析
  • DOI:
    10.1016/j.cose.2024.103810
  • 发表时间:
    2024-05-01
  • 期刊:
  • 影响因子:
    5.400
  • 作者:
    Hamza Alkofahi;Heba Alawneh;Anthony Skjellum
  • 通讯作者:
    Anthony Skjellum

Anthony Skjellum的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Anthony Skjellum', 18)}}的其他基金

SPX: Collaborative Research: Intelligent Communication Fabrics to Facilitate Extreme Scale Computing
SPX:协作研究:促进超大规模计算的智能通信结构
  • 批准号:
    2412182
  • 财政年份:
    2023
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: Real-time Strategies and Synchronized Time Distribution Mechanisms for Enhanced Exascale Performance-Portability and Predictability
合作研究:EAGER:实时策略和同步时间分配机制,以增强百亿亿次性能-可移植性和可预测性
  • 批准号:
    2405142
  • 财政年份:
    2023
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
Beginnings: Creating and Sustaining a Diverse Community of Expertise in Quantum Information Science (EQUIS) Across the Southeastern United States
起点:在美国东南部创建并维持一个多元化的量子信息科学 (EQUIS) 专业社区
  • 批准号:
    2414461
  • 财政年份:
    2023
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Cooperative Agreement
Collaborative Research: EAGER: Real-time Strategies and Synchronized Time Distribution Mechanisms for Enhanced Exascale Performance-Portability and Predictability
合作研究:EAGER:实时策略和同步时间分配机制,以增强百亿亿次性能-可移植性和可预测性
  • 批准号:
    2151020
  • 财政年份:
    2022
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
CC* Networking Infrastructure: Advancing High-speed Networking at UTC for Research and Education
CC* 网络基础设施:推进 UTC 的研究和教育高速网络
  • 批准号:
    1925598
  • 财政年份:
    2019
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
Collaborative Research: Software Engineering Workforce Development in High Performance Computing for Digital Twins
协作研究:数字孪生高性能计算中的软件工程劳动力开发
  • 批准号:
    1935628
  • 财政年份:
    2019
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
CC* Compute: A Cost-Effective, 2,048 Core InfiniBand Cluster at UTC for Campus Research and Education
CC* 计算:UTC 的具有成本效益的 2,048 核心 InfiniBand 集群,用于校园研究和教育
  • 批准号:
    1925603
  • 财政年份:
    2019
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
Collaborative Research: CICI: Regional: SouthEast SciEntific Cybersecurity for University Research (SouthEast SECURE)
合作研究:CICI:区域:东南大学研究科学网络安全 (SouthEast SECURE)
  • 批准号:
    1812404
  • 财政年份:
    2017
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
SHF: Medium: Collaborative Research: Next-Generation Message Passing for Parallel Programming: Resiliency, Time-to-Solution, Performance-Portability, Scalability, and QoS
SHF:中:协作研究:并行编程的下一代消息传递:弹性、解决时间、性能可移植性、可扩展性和 QoS
  • 批准号:
    1822191
  • 财政年份:
    2017
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Continuing Grant
SHF: Small: Collaborative Research: Coupling Computation and Communication in FPGA-Enhanced Clouds and Clusters
SHF:小型:协作研究:FPGA 增强型云和集群中的耦合计算和通信
  • 批准号:
    1821431
  • 财政年份:
    2017
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant

相似海外基金

SPX: Collaborative Research: Automated Synthesis of Extreme-Scale Computing Systems Using Non-Volatile Memory
SPX:协作研究:使用非易失性存储器自动合成超大规模计算系统
  • 批准号:
    2408925
  • 财政年份:
    2023
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
SPX: Collaborative Research: Scalable Neural Network Paradigms to Address Variability in Emerging Device based Platforms for Large Scale Neuromorphic Computing
SPX:协作研究:可扩展神经网络范式,以解决基于新兴设备的大规模神经形态计算平台的可变性
  • 批准号:
    2401544
  • 财政年份:
    2023
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
SPX: Collaborative Research: Intelligent Communication Fabrics to Facilitate Extreme Scale Computing
SPX:协作研究:促进超大规模计算的智能通信结构
  • 批准号:
    2412182
  • 财政年份:
    2023
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
SPX: Collaborative Research: Cross-stack Memory Optimizations for Boosting I/O Performance of Deep Learning HPC Applications
SPX:协作研究:用于提升深度学习 HPC 应用程序 I/O 性能的跨堆栈内存优化
  • 批准号:
    2318628
  • 财政年份:
    2022
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
SPX: Collaborative Research: NG4S: A Next-generation Geo-distributed Scalable Stateful Stream Processing System
SPX:合作研究:NG4S:下一代地理分布式可扩展状态流处理系统
  • 批准号:
    2202859
  • 财政年份:
    2022
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
SPX: Collaborative Research: FASTLEAP: FPGA based compact Deep Learning Platform
SPX:协作研究:FASTLEAP:基于 FPGA 的紧凑型深度学习平台
  • 批准号:
    2333009
  • 财政年份:
    2022
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
SPX: Collaborative Research: Memory Fabric: Data Management for Large-scale Hybrid Memory Systems
SPX:协作研究:内存结构:大规模混合内存系统的数据管理
  • 批准号:
    2132049
  • 财政年份:
    2021
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
SPX: Collaborative Research: Automated Synthesis of Extreme-Scale Computing Systems Using Non-Volatile Memory
SPX:协作研究:使用非易失性存储器自动合成超大规模计算系统
  • 批准号:
    2113307
  • 财政年份:
    2020
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
SPX: Collaborative Research: FASTLEAP: FPGA based compact Deep Learning Platform
SPX:协作研究:FASTLEAP:基于 FPGA 的紧凑型深度学习平台
  • 批准号:
    1919117
  • 财政年份:
    2019
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
SPX: Collaborative Research: Parallel Algorithm by Blocks - A Data-centric Compiler/runtime System for Productive Programming of Scalable Parallel Systems
SPX:协作研究:块并行算法 - 用于可扩展并行系统的高效编程的以数据为中心的编译器/运行时系统
  • 批准号:
    1919021
  • 财政年份:
    2019
  • 资助金额:
    $ 45.01万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了