SHF: Small: Collaborative Research: Coupling Computation and Communication in FPGA-Enhanced Clouds and Clusters

SHF:小型:协作研究:FPGA 增强型云和集群中的耦合计算和通信

基本信息

  • 批准号:
    1821431
  • 负责人:
  • 金额:
    $ 17.82万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2017
  • 资助国家:
    美国
  • 起止时间:
    2017-10-01 至 2021-05-31
  • 项目状态:
    已结题

项目摘要

The introduction of Field Programmable Gate Arrays (FPGAs) to accelerate clusters of servers in datacenters and clouds provides a great, immediate opportunity to leverage a new technology in high-end computing. With their flexible logic and native massive communication capability, FPGAs are ideal for high-performance computing in the post-Moore?s Law world. Since the hardware adapts to the application higher efficiency can be achieved, and since FPGAs are hybrid communication/computation processors, they can be interconnected directly chip-to-chip. Large-scale communication can consequently proceed with both higher bandwidth, lower latency, and less processor impact. These features are crucial to enhancing performance beyond current levels. The proposed design allows for useful processing while data is in flight in the network resulting in reduced software overhead in parallel middleware and reduced network congestion. The key tenets of the research are to achieve programmable, intelligent acceleration of applications while emphasizing overlap of communication and computation at low latency, while also cutting substantially software overhead. The research project, FC5 (an FPGA framework for coupling communication and computation in clouds and clusters) has several thrusts. First, hardware support for FC5 and investigation of methods of configurability in FC5 to reduce communication latency and support computing in the network are studied. A second outcome is a prototype version of the Open MPI open source version of MPI-3.1 parallel middleware that utilizes FC5 to deliver the features and performance enhancements involving data movement between and within servers, mathematical data reductions, and bulk data reorganizations. Third, proof-of-concept versions of multiple FC5 software models, including direct hardware access, a transparent MPI-in-OpenCL, and an API-based mechanism that exposes essential functionality. Finally, because FC5 is evolving rapidly with major new announcements expected imminently, continued refinement is essential. At least two model applications, Molecular Dynamics and Map-Reduce, will be used as test cases. With the continued consolidation of computing services into the cloud, the potential broader impact is to increase both the scale and availability of parallel applications. The broad range of uses of cloud and cluster computing for commercial, government, and academic applications means that acceleration offered will have a widespread impact applicable across many sectors. The growing acceptance of high performance computing in industry (e.g., fast machine learning) is one particular potential commercial sector that will be enhanced by this project.
现场可编程门阵列(fpga)的引入加速了数据中心和云中的服务器集群,为在高端计算中利用新技术提供了一个巨大的、直接的机会。fpga具有灵活的逻辑和原生的大规模通信能力,是后摩尔时代高性能计算的理想选择。这是法律世界。由于硬件适应应用,可以实现更高的效率,并且由于fpga是混合通信/计算处理器,它们可以直接连接芯片到芯片。因此,大规模通信可以以更高的带宽、更低的延迟和更少的处理器影响进行。这些特性对于在当前水平之上提高性能至关重要。所建议的设计允许在数据在网络中传输时进行有用的处理,从而减少并行中间件的软件开销并减少网络拥塞。该研究的关键原则是实现应用程序的可编程、智能加速,同时强调在低延迟下通信和计算的重叠,同时也大幅削减软件开销。研究项目FC5(用于云和集群中耦合通信和计算的FPGA框架)有几个重点。首先,研究了FC5的硬件支持,并研究了FC5的可配置性方法,以减少通信延迟和支持网络计算。第二个结果是MPI-3.1并行中间件的Open MPI开源版本的原型版本,它利用FC5来提供涉及服务器之间和服务器内部的数据移动、数学数据减少和批量数据重组的特性和性能增强。第三,多个FC5软件模型的概念验证版本,包括直接硬件访问、透明的opencl中的mpi和基于api的暴露基本功能的机制。最后,由于FC5正在快速发展,预计即将发布重要的新公告,因此持续的改进是必不可少的。至少两个模型应用程序,分子动力学和Map-Reduce,将被用作测试用例。随着计算服务不断整合到云中,潜在的更广泛的影响是增加并行应用程序的规模和可用性。商业、政府和学术应用程序对云和集群计算的广泛使用意味着所提供的加速将对许多部门产生广泛的影响。在工业中对高性能计算(例如,快速机器学习)的日益接受是一个特殊的潜在商业领域,将通过该项目得到加强。

项目成果

期刊论文数量(2)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Implementation and evaluation of MPI 4.0 partitioned communication libraries
MPI 4.0分区通信库的实现和评估
  • DOI:
    10.1016/j.parco.2021.102827
  • 发表时间:
    2021
  • 期刊:
  • 影响因子:
    1.4
  • 作者:
    Dosanjh, Matthew G.F.;Worley, Andrew;Schafer, Derek;Soundararajan, Prema;Ghafoor, Sheikh;Skjellum, Anthony;Bangalore, Purushotham V.;Grant, Ryan E.
  • 通讯作者:
    Grant, Ryan E.
Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives
分区点对点通信原语的便携式实现的设计
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Anthony Skjellum其他文献

Understanding GPU Triggering APIs for MPI+X Communication
了解用于 MPI X 通信的 GPU 触发 API
  • DOI:
  • 发表时间:
    2024
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Patrick G. Bridges;Anthony Skjellum;E. Suggs;Derek Schafer;P. Bangalore
  • 通讯作者:
    P. Bangalore
MitM attacks on intellectual property and integrity of additive manufacturing systems: A security analysis
针对增材制造系统的知识产权和完整性的中间人攻击:安全分析
  • DOI:
    10.1016/j.cose.2024.103810
  • 发表时间:
    2024-05-01
  • 期刊:
  • 影响因子:
    5.400
  • 作者:
    Hamza Alkofahi;Heba Alawneh;Anthony Skjellum
  • 通讯作者:
    Anthony Skjellum

Anthony Skjellum的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Anthony Skjellum', 18)}}的其他基金

SPX: Collaborative Research: Intelligent Communication Fabrics to Facilitate Extreme Scale Computing
SPX:协作研究:促进超大规模计算的智能通信结构
  • 批准号:
    2412182
  • 财政年份:
    2023
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: Real-time Strategies and Synchronized Time Distribution Mechanisms for Enhanced Exascale Performance-Portability and Predictability
合作研究:EAGER:实时策略和同步时间分配机制,以增强百亿亿次性能-可移植性和可预测性
  • 批准号:
    2405142
  • 财政年份:
    2023
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
Beginnings: Creating and Sustaining a Diverse Community of Expertise in Quantum Information Science (EQUIS) Across the Southeastern United States
起点:在美国东南部创建并维持一个多元化的量子信息科学 (EQUIS) 专业社区
  • 批准号:
    2414461
  • 财政年份:
    2023
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Cooperative Agreement
Collaborative Research: EAGER: Real-time Strategies and Synchronized Time Distribution Mechanisms for Enhanced Exascale Performance-Portability and Predictability
合作研究:EAGER:实时策略和同步时间分配机制,以增强百亿亿次性能-可移植性和可预测性
  • 批准号:
    2151020
  • 财政年份:
    2022
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
CC* Networking Infrastructure: Advancing High-speed Networking at UTC for Research and Education
CC* 网络基础设施:推进 UTC 的研究和教育高速网络
  • 批准号:
    1925598
  • 财政年份:
    2019
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
SPX: Collaborative Research: Intelligent Communication Fabrics to Facilitate Extreme Scale Computing
SPX:协作研究:促进超大规模计算的智能通信结构
  • 批准号:
    1918987
  • 财政年份:
    2019
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
Collaborative Research: Software Engineering Workforce Development in High Performance Computing for Digital Twins
协作研究:数字孪生高性能计算中的软件工程劳动力开发
  • 批准号:
    1935628
  • 财政年份:
    2019
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
CC* Compute: A Cost-Effective, 2,048 Core InfiniBand Cluster at UTC for Campus Research and Education
CC* 计算:UTC 的具有成本效益的 2,048 核心 InfiniBand 集群,用于校园研究和教育
  • 批准号:
    1925603
  • 财政年份:
    2019
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
Collaborative Research: CICI: Regional: SouthEast SciEntific Cybersecurity for University Research (SouthEast SECURE)
合作研究:CICI:区域:东南大学研究科学网络安全 (SouthEast SECURE)
  • 批准号:
    1812404
  • 财政年份:
    2017
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
SHF: Medium: Collaborative Research: Next-Generation Message Passing for Parallel Programming: Resiliency, Time-to-Solution, Performance-Portability, Scalability, and QoS
SHF:中:协作研究:并行编程的下一代消息传递:弹性、解决时间、性能可移植性、可扩展性和 QoS
  • 批准号:
    1822191
  • 财政年份:
    2017
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Continuing Grant

相似国自然基金

昼夜节律性small RNA在血斑形成时间推断中的法医学应用研究
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
tRNA-derived small RNA上调YBX1/CCL5通路参与硼替佐米诱导慢性疼痛的机制研究
  • 批准号:
    n/a
  • 批准年份:
    2022
  • 资助金额:
    10.0 万元
  • 项目类别:
    省市级项目
Small RNA调控I-F型CRISPR-Cas适应性免疫性的应答及分子机制
  • 批准号:
    32000033
  • 批准年份:
    2020
  • 资助金额:
    24.0 万元
  • 项目类别:
    青年科学基金项目
Small RNAs调控解淀粉芽胞杆菌FZB42生防功能的机制研究
  • 批准号:
    31972324
  • 批准年份:
    2019
  • 资助金额:
    58.0 万元
  • 项目类别:
    面上项目
变异链球菌small RNAs连接LuxS密度感应与生物膜形成的机制研究
  • 批准号:
    81900988
  • 批准年份:
    2019
  • 资助金额:
    21.0 万元
  • 项目类别:
    青年科学基金项目
肠道细菌关键small RNAs在克罗恩病发生发展中的功能和作用机制
  • 批准号:
    31870821
  • 批准年份:
    2018
  • 资助金额:
    56.0 万元
  • 项目类别:
    面上项目
基于small RNA 测序技术解析鸽分泌鸽乳的分子机制
  • 批准号:
    31802058
  • 批准年份:
    2018
  • 资助金额:
    26.0 万元
  • 项目类别:
    青年科学基金项目
Small RNA介导的DNA甲基化调控的水稻草矮病毒致病机制
  • 批准号:
    31772128
  • 批准年份:
    2017
  • 资助金额:
    60.0 万元
  • 项目类别:
    面上项目
基于small RNA-seq的针灸治疗桥本甲状腺炎的免疫调控机制研究
  • 批准号:
    81704176
  • 批准年份:
    2017
  • 资助金额:
    20.0 万元
  • 项目类别:
    青年科学基金项目
水稻OsSGS3与OsHEN1调控small RNAs合成及其对抗病性的调节
  • 批准号:
    91640114
  • 批准年份:
    2016
  • 资助金额:
    85.0 万元
  • 项目类别:
    重大研究计划

相似海外基金

Collaborative Research: SHF: Small: LEGAS: Learning Evolving Graphs At Scale
协作研究:SHF:小型:LEGAS:大规模学习演化图
  • 批准号:
    2331302
  • 财政年份:
    2024
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: LEGAS: Learning Evolving Graphs At Scale
协作研究:SHF:小型:LEGAS:大规模学习演化图
  • 批准号:
    2331301
  • 财政年份:
    2024
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: Efficient and Scalable Privacy-Preserving Neural Network Inference based on Ciphertext-Ciphertext Fully Homomorphic Encryption
合作研究:SHF:小型:基于密文-密文全同态加密的高效、可扩展的隐私保护神经网络推理
  • 批准号:
    2412357
  • 财政年份:
    2024
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: Quasi Weightless Neural Networks for Energy-Efficient Machine Learning on the Edge
合作研究:SHF:小型:用于边缘节能机器学习的准失重神经网络
  • 批准号:
    2326895
  • 财政年份:
    2023
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: Enabling Efficient 3D Perception: An Architecture-Algorithm Co-Design Approach
协作研究:SHF:小型:实现高效的 3D 感知:架构-算法协同设计方法
  • 批准号:
    2334624
  • 财政年份:
    2023
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: Technical Debt Management in Dynamic and Distributed Systems
合作研究:SHF:小型:动态和分布式系统中的技术债务管理
  • 批准号:
    2232720
  • 财政年份:
    2023
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: Sub-millisecond Topological Feature Extractor for High-Rate Machine Learning
合作研究:SHF:小型:用于高速机器学习的亚毫秒拓扑特征提取器
  • 批准号:
    2234921
  • 财政年份:
    2023
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: Reimagining Communication Bottlenecks in GNN Acceleration through Collaborative Locality Enhancement and Compression Co-Design
协作研究:SHF:小型:通过协作局部性增强和压缩协同设计重新想象 GNN 加速中的通信瓶颈
  • 批准号:
    2326494
  • 财政年份:
    2023
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: Quasi Weightless Neural Networks for Energy-Efficient Machine Learning on the Edge
合作研究:SHF:小型:用于边缘节能机器学习的准失重神经网络
  • 批准号:
    2326894
  • 财政年份:
    2023
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: Sub-millisecond Topological Feature Extractor for High-Rate Machine Learning
合作研究:SHF:小型:用于高速机器学习的亚毫秒拓扑特征提取器
  • 批准号:
    2234920
  • 财政年份:
    2023
  • 资助金额:
    $ 17.82万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了