SHF: Large: Collaborative Research: Next Generation Communication Mechanisms exploiting Heterogeneity, Hierarchy and Concurrency for Emerging HPC Systems

SHF:大型:协作研究:利用新兴 HPC 系统的异构性、层次结构和并发性的下一代通信机制

基本信息

  • 批准号:
    1565414
  • 负责人:
  • 金额:
    $ 117.19万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2016
  • 资助国家:
    美国
  • 起止时间:
    2016-08-15 至 2020-07-31
  • 项目状态:
    已结题

项目摘要

This award was partially supported by the CIF21 Software Reuse Venture whose goals are to support pathways towards sustainable software elements through their reuse, and to emphasize the critical role of reusable software elements in a sustainable software cyberinfrastructure to support computational and data-enabled science and engineering.Parallel programming based on MPI (Message Passing Interface) is being used with increased frequency in academia, government (defense and non-defense uses), as well as emerging uses in scalable machine learning and big data analytics. The emergence of Dense Many-Core (DMC) architectures like Intel's Knights Landing (KNL) and accelerator/co-processor architectures like NVIDIA GPGPUs are enabling the design of systems with high compute density. This, coupled with the availability of Remote Direct Memory Access (RDMA)-enabled commodity networking technologies like InfiniBand, RoCE, and 10/40GigE with iWARP, is fueling the growth of multi-petaflop and ExaFlop systems. These DMC architectures have the following unique characteristics: deeper levels of hierarchical memory; revolutionary network interconnects; and heterogeneous compute power and data movement costs (with heterogeneity at chip-level and node-level). For these emerging systems, a combination of MPI and other programming models, known as MPI+X (where X can be PGAS, Tasks, OpenMP, OpenACC, or CUDA), are being targeted. The current generation communication protocols and mechanisms for MPI+X programming models cannot efficiently support the emerging DMC architectures. This leads to the following broad challenges: 1) How can high-performance and scalable communication mechanisms for next generation DMC architectures be designed to support MPI+X (including Task-based) programming models? and 2) How can the current and next generation applications be designed/co-designed with the proposed communication mechanisms?A synergistic and comprehensive research plan, involving computer scientists from The Ohio State University (OSU) and Ohio Supercomputer Center (OSC) and computational scientists from the Texas Advanced Computing Center (TACC), San Diego Supercomputer Center (SDSC) and University of California San Diego (UCSD), is proposed to address the above broad challenges with innovative solutions. The research will be driven by a set of applications from established NSF computational science researchers running large scale simulations on Stampede and Comet and other systems at OSC and OSU. The proposed designs will be integrated into the widely-used MVAPICH2 library and made available for public use. Multiple graduate and undergraduate students will be trained under this project as future scientists and engineers in HPC. The established national-scale training and outreach programs at TACC, SDSC and OSC will be used to disseminate the results of this research to XSEDE users. Tutorials will be organized at XSEDE, SC and other conferences to share the research results and experience with the community.
该奖项得到了CIF21软件重用风险企业的部分支持,该风险企业的目标是通过它们的重用来支持通往可持续软件元素的道路,并强调可重用软件元素在可持续软件网络基础设施中的关键作用,以支持计算和数据启用的科学和工程。基于MPI(消息传递接口)的并行编程正在学术界、政府(国防和非国防用途)以及可扩展机器学习和大数据分析中得到越来越频繁的使用。密集多核(DMC)架构(如Intel的Knters Landing(KnL))和加速器/协处理器架构(如NVIDIA GPGPU)的出现使高计算密度系统的设计成为可能。这一点,再加上支持远程直接内存访问(RDMA)的商用网络技术的可用性,如InfiniBand、ROCE和带有iWARP的10/40GigE,正在推动多Petaflop和Exaflop系统的增长。这些DMC架构具有以下独特特征:更深层次的分层内存;革命性的网络互连;以及计算能力和数据移动成本的异构性(芯片级和节点级的异构性)。对于这些新兴系统,MPI和其他编程模型的组合被称为MPI+X(其中X可以是PGAS、TASKS、OpenMP、OpenACC或CUDA)。当前一代的MPI+X编程模型的通信协议和机制不能有效地支持新兴的DMC体系结构。这带来了以下广泛的挑战:1)如何为下一代DMC体系结构设计高性能和可扩展的通信机制,以支持MPI+X(包括基于任务的)编程模型?2)如何利用所提出的通信机制来设计和共同设计当前和下一代应用程序?俄亥俄州立大学(OSU)和俄亥俄超级计算机中心(OSC)的计算机科学家以及德克萨斯高级计算中心(TACC)、圣地亚哥超级计算机中心(SDSC)和加州大学圣地亚哥分校(UCSD)的计算科学家提出了一个协同和综合的研究计划,以创新的解决方案来应对上述广泛的挑战。这项研究将由一组来自NSF计算科学研究人员的应用程序推动,这些研究人员在OSC和OSU的Stampede和Comet以及其他系统上进行大规模模拟。建议的设计将被整合到广泛使用的MVAPICH2库中,并供公众使用。多名研究生和本科生将在该项目下接受培训,成为HPC未来的科学家和工程师。在TACC、SDSC和OSC建立的全国性培训和推广方案将用于向XSEDE用户传播这项研究的结果。将在XSEDE、SC等会议上组织教程,与社区分享研究成果和经验。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Dhabaleswar Panda其他文献

Dhabaleswar Panda的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Dhabaleswar Panda', 18)}}的其他基金

CSR: Small: CONCERT: Designing Scalable Communication Runtimes with On-the-fly Compression for HPC and AI Applications on Heterogeneous Architectures
CSR:小型:CONCERT:为异构架构上的 HPC 和 AI 应用程序设计具有动态压缩的可扩展通信运行时
  • 批准号:
    2312927
  • 财政年份:
    2023
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Standard Grant
Travel: Student Travel Support for MVAPICH User Group (MUG) 2023 Conference
旅行:MVAPICH 用户组 (MUG) 2023 年会议的学生旅行支持
  • 批准号:
    2331223
  • 财政年份:
    2023
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Standard Grant
Collaborative Research: Frameworks: Performance Engineering Scientific Applications with MVAPICH and TAU using Emerging Communication Primitives
合作研究:框架:使用新兴通信原语的 MVAPICH 和 TAU 的性能工程科学应用
  • 批准号:
    2311830
  • 财政年份:
    2023
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Standard Grant
Travel: Student Travel Support for MVAPICH User group (MUG) 2022 Conference
旅行:MVAPICH 用户组 (MUG) 2022 年会议的学生旅行支持
  • 批准号:
    2231825
  • 财政年份:
    2022
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Standard Grant
AI Institute for Intelligent CyberInfrastructure with Computational Learning in the Environment (ICICLE)
环境中具有计算学习功能的智能网络基础设施人工智能研究所 (ICICLE)
  • 批准号:
    2112606
  • 财政年份:
    2021
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Cooperative Agreement
MRI: RADiCAL: Reconfigurable Major Research Cyberinfrastructure for Advanced Computational Data Analytics and Machine Learning
MRI:RADiCAL:用于高级计算数据分析和机器学习的可重构主要研究网络基础设施
  • 批准号:
    2018627
  • 财政年份:
    2020
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Standard Grant
OAC Core: Small: Next-Generation Communication and I/O Middleware for HPC and Deep Learning with Smart NICs
OAC 核心:小型:使用智能 NIC 实现 HPC 和深度学习的下一代通信和 I/O 中间件
  • 批准号:
    2007991
  • 财政年份:
    2020
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Standard Grant
Student Travel Support for MVAPICH User Group (MUG) Meeting
MAPICH 用户组 (MUG) 会议的学生旅行支持
  • 批准号:
    1930003
  • 财政年份:
    2019
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Standard Grant
Collaborative Research: Frameworks: Designing Next-Generation MPI Libraries for Emerging Dense GPU Systems
协作研究:框架:为新兴密集 GPU 系统设计下一代 MPI 库
  • 批准号:
    1931537
  • 财政年份:
    2019
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Standard Grant
Student Travel Support for MVAPICH User Group (MUG) Meeting
MAPICH 用户组 (MUG) 会议的学生旅行支持
  • 批准号:
    1839739
  • 财政年份:
    2018
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Standard Grant

相似国自然基金

水稻穗粒数调控关键因子LARGE6的分子遗传网络解析
  • 批准号:
  • 批准年份:
    2022
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
量子自旋液体中拓扑拟粒子的性质:量子蒙特卡罗和新的large-N理论
  • 批准号:
  • 批准年份:
    2020
  • 资助金额:
    62 万元
  • 项目类别:
    面上项目
甘蓝型油菜Large Grain基因调控粒重的分子机制研究
  • 批准号:
    31972875
  • 批准年份:
    2019
  • 资助金额:
    58.0 万元
  • 项目类别:
    面上项目
Large PB/PB小鼠 视网膜新生血管模型的研究
  • 批准号:
    30971650
  • 批准年份:
    2009
  • 资助金额:
    8.0 万元
  • 项目类别:
    面上项目
基因discs large在果蝇卵母细胞的后端定位及其体轴极性形成中的作用机制
  • 批准号:
    30800648
  • 批准年份:
    2008
  • 资助金额:
    20.0 万元
  • 项目类别:
    青年科学基金项目
LARGE基因对口腔癌细胞中α-DG糖基化及表达的分子调控
  • 批准号:
    30772435
  • 批准年份:
    2007
  • 资助金额:
    29.0 万元
  • 项目类别:
    面上项目

相似海外基金

Collaborative Research: SHF: Medium: Enabling Graphics Processing Unit Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的图形处理单元性能仿真
  • 批准号:
    2402804
  • 财政年份:
    2024
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Enabling GPU Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的 GPU 性能仿真
  • 批准号:
    2402806
  • 财政年份:
    2024
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Medium: Enabling GPU Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的 GPU 性能仿真
  • 批准号:
    2402805
  • 财政年份:
    2024
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Standard Grant
SHF: Large: Collaborative Research: Molecular computing for the real world
SHF:大型:协作研究:现实世界的分子计算
  • 批准号:
    1832985
  • 财政年份:
    2018
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Continuing Grant
SHF: Large: Collaborative Research: Next Generation Communication Mechanisms exploiting Heterogeneity, Hierarchy and Concurrency for Emerging HPC Systems
SHF:大型:协作研究:利用新兴 HPC 系统的异构性、层次结构和并发性的下一代通信机制
  • 批准号:
    1565336
  • 财政年份:
    2016
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Standard Grant
SHF: Large: Collaborative Research: Exploiting the Naturalness of Software
SHF:大型:协作研究:利用软件的自然性
  • 批准号:
    1723215
  • 财政年份:
    2016
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Continuing Grant
SHF: Large: Collaborative Research: Next Generation Communication Mechanisms exploiting Heterogeneity, Hierarchy and Concurrency for Emerging HPC Systems
SHF:大型:协作研究:利用新兴 HPC 系统的异构性、层次结构和并发性的下一代通信机制
  • 批准号:
    1565431
  • 财政年份:
    2016
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Standard Grant
SHF: Large: Collaborative Research: Molecular computing for the real world
SHF:大型:协作研究:现实世界的分子计算
  • 批准号:
    1518715
  • 财政年份:
    2015
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Continuing Grant
SHF: Large: Collaborative Research: Molecular computing for the real world
SHF:大型:协作研究:现实世界的分子计算
  • 批准号:
    1518833
  • 财政年份:
    2015
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Continuing Grant
SHF: Large: Collaborative Research: Molecular computing for the real world
SHF:大型:协作研究:现实世界的分子计算
  • 批准号:
    1518723
  • 财政年份:
    2015
  • 资助金额:
    $ 117.19万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了