SI2-SSI: Collaborative Research: A Software Infrastructure for MPI Performance Engineering: Integrating MVAPICH and TAU via the MPI Tools Interface

SI2-SSI:协作研究:MPI 性能工程的软件基础设施:通过 MPI 工具接口集成 MVAPICH 和 TAU

基本信息

  • 批准号:
    1450471
  • 负责人:
  • 金额:
    $ 120万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2015
  • 资助国家:
    美国
  • 起止时间:
    2015-09-01 至 2020-08-31
  • 项目状态:
    已结题

项目摘要

Message-Passing Interface (MPI) continues to dominate the supercomputing landscape, being the primary parallel programming model of choice. A large variety of scientific applications in use today are based on MPI. On the current and next-generation High-End Computing (HEC) systems, it is essential to understand the interaction between time-critical applications and the underlying MPI implementations in order to better optimize them for both scalability and performance. Current users of HEC systems develop their applications with high-performance MPI implementations, but analyze and fine tune the behavior using standalone performance tools. Essentially, each software component views the other as a blackbox, with little sharing of information or access to capabilities that might be useful in optimization strategies. Lack of a standardized interface that allows interaction between the profiling tool and the MPI library has been a big impediment. The newly introduced MPI_T interface in the MPI-3 standard provides a simple mechanism that allows MPI implementers to expose variables representing configuration parameters or performance measurements from within the implementation for the benefit of tools, tuning frameworks, and other support libraries. However, few performance analysis and tuning tools take advantage of the MPI_T interface and none do so to dynamically optimize at execution time. This research and development effort aims to build a software infrastructure for MPI performance engineering using the new MPI_T interface.With the adoption of MPI_T in the MPI standard, it is now possible to take positive steps to realize close interaction between and integration of MPI libraries and performance tools. This research, undertaken by a team of computer scientists from OSU and UO representing the open source MVAPICH and TAU projects, aims to create an open source integrated software infrastructure built on the MPI_T interface which defines the API for interaction and information interchange to enable fine grained performance optimizations for HPC applications. The challenges addressed by the project include: 1) enhancing existing support for MPI_T in MVAPICH to expose a richer set of performance and control variables; 2) redesigning TAU to take advantage of the new MPI_T variables exposed by MVAPICH; 3) extending and enhancing TAU and MVAPICH with the ability to generate recommendations and performance engineering reports; 4) proposing fundamental design changes to make MPI libraries like MVAPICH ``reconfigurable'' at runtime; and 5) adding support to MVAPICH and TAU for interactive performance engineering sessions. The framework will be validated on a variety of HPC benchmarks and applications. The integrated middleware and tools will be made publicly available to the community. The research will have a significant impact on enabling optimizations of HPC applications that have previously been difficult to provide. As a result, it will contribute to deriving "best practice" guidelines for running on next-generation Multi-Petaflop and Exascale systems. The research directions and their solutions will be used in the curriculum of the PIs to train undergraduate and graduate students.
消息传递接口(MPI)继续在超级计算领域占据主导地位,成为首选的主要并行编程模型。当今使用的大量科学应用都基于MPI。在当前和下一代高端计算(HEC)系统上,必须了解时间关键型应用程序和底层MPI实施之间的交互,以便更好地优化它们的可扩展性和性能。HEC系统的当前用户使用高性能MPI实现开发他们的应用程序,但使用独立的性能工具分析和微调行为。从本质上讲,每个软件组件都将对方视为一个黑匣子,很少共享信息或访问可能对优化策略有用的功能。缺乏允许分析工具和MPI库之间交互的标准化接口一直是一个很大的障碍。MPI-3标准中新引入的MPI_T接口提供了一种简单的机制,允许MPI实现者从实现中公开表示配置参数或性能测量的变量,以利于工具、调优框架和其他支持库。然而,很少有性能分析和调优工具利用MPI_T接口,也没有一个这样做来在执行时动态优化。这项研究和开发工作旨在利用新的MPI_T接口来构建MPI性能工程的软件基础设施。随着MPI标准中MPI_T的采用,现在有可能采取积极的步骤来实现MPI库和性能工具之间的紧密交互和集成。这项研究由来自OSU和UO的计算机科学家团队负责,他们代表开源的MVAPICH和TAU项目,旨在创建一个基于MPI_T接口的开源集成软件基础设施,该接口定义了用于交互和信息交换的API,以实现高性能计算应用程序的细粒度性能优化。该项目解决的挑战包括:1)增强MVAPICH中对MPI_T的现有支持,以公开更丰富的性能和控制变量集;2)重新设计TAU,以利用MVAPICH所公开的新的MPI_T变量;3)扩展和增强TAU和MVAPICH,使其能够生成建议和性能工程报告;4)提出基本的设计更改,以使MVAPICH等MPI库在运行时“可重新配置”;以及5)为交互式性能工程会话添加对MVAPICH和TAU的支持。该框架将在各种高性能计算基准和应用程序上进行验证。集成的中间件和工具将向社区公开提供。这项研究将对实现以前难以提供的高性能计算应用程序的优化产生重大影响。因此,它将有助于得出在下一代多千万亿次浮点和亿级系统上运行的“最佳实践”指南。研究方向及其解决方案将用于PIS的课程,以培养本科生和研究生。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Sameer Shende其他文献

Integrated parallel performance views

Sameer Shende的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Sameer Shende', 18)}}的其他基金

Collaborative Research: Frameworks: Performance Engineering Scientific Applications with MVAPICH and TAU using Emerging Communication Primitives
合作研究:框架:使用新兴通信原语的 MVAPICH 和 TAU 的性能工程科学应用
  • 批准号:
    2311831
  • 财政年份:
    2023
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
CSSI: Elements: First Workshop on NSF and DOE High Performance Computing Tools
CSSI:Elements:第一届 NSF 和 DOE 高性能计算工具研讨会
  • 批准号:
    1939486
  • 财政年份:
    2019
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant

相似国自然基金

考虑SSI效应的导管架式海洋平台抗震性能研究
  • 批准号:
  • 批准年份:
    2022
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
考虑SSI的层间隔震高层建筑结构在三维地震下的响应研究
  • 批准号:
    52168072
  • 批准年份:
    2021
  • 资助金额:
    35 万元
  • 项目类别:
    地区科学基金项目
考虑SSI效应的大型储罐动力学特性及其隔板减晃研究
  • 批准号:
    51978336
  • 批准年份:
    2019
  • 资助金额:
    61.0 万元
  • 项目类别:
    面上项目
考虑SSI效应的摇摆墙-框架结构抗震机理及性能评估方法研究
  • 批准号:
    51978524
  • 批准年份:
    2019
  • 资助金额:
    60.0 万元
  • 项目类别:
    面上项目
考虑能量需求和SSI效应的RC梁式桥基于性能的抗震设计方法
  • 批准号:
    50908014
  • 批准年份:
    2009
  • 资助金额:
    20.0 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

Collaborative Research: SI2-SSI: Expanding Volunteer Computing
合作研究:SI2-SSI:扩展志愿者计算
  • 批准号:
    2039142
  • 财政年份:
    2020
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
SI2-SSI: Collaborative Research: Einstein Toolkit Community Integration and Data Exploration
SI2-SSI:协作研究:Einstein Toolkit 社区集成和数据探索
  • 批准号:
    2114580
  • 财政年份:
    2020
  • 资助金额:
    $ 120万
  • 项目类别:
    Continuing Grant
Collaborative Research: SI2-SSI: Expanding Volunteer Computing
合作研究:SI2-SSI:扩展志愿者计算
  • 批准号:
    2001752
  • 财政年份:
    2019
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
Collaborative Research: SI2-SSI: Expanding Volunteer Computing
合作研究:SI2-SSI:扩展志愿者计算
  • 批准号:
    1664022
  • 财政年份:
    2017
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
Collaborative Research: SI2-SSI: Cyberinfrastructure for Advancing Hydrologic Knowledge through Collaborative Integration of Data Science, Modeling and Analysis
合作研究:SI2-SSI:通过数据科学、建模和分析的协作集成推进水文知识的网络基础设施
  • 批准号:
    1664061
  • 财政年份:
    2017
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
SI2-SSI: Collaborative Research: Scalable Infrastructure for Enabling Multiscale and Multiphysics Applications in Fluid Dynamics, Solid Mechanics, and Fluid-Structure Interaction
SI2-SSI:协作研究:可扩展基础设施,支持流体动力学、固体力学和流固耦合中的多尺度和多物理场应用
  • 批准号:
    1836797
  • 财政年份:
    2017
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
Collaborative Research: SI2-SSI: Open Source Support for Massively Parallel, Generic Finite Element Methods
合作研究:SI2-SSI:对大规模并行、通用有限元方法的开源支持
  • 批准号:
    1741870
  • 财政年份:
    2017
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
Collaborative Research: SI2-SSI: Modules for Experiments in Stellar Astrophysics
合作研究:SI2-SSI:恒星天体物理实验模块
  • 批准号:
    1663684
  • 财政年份:
    2017
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
Collaborative Research: SI2-SSI: Inquiry-Focused Volumetric Data Analysis Across Scientific Domains: Sustaining and Expanding the yt Community
合作研究:SI2-SSI:跨科学领域以调查为中心的体积数据分析:维持和扩展 yt 社区
  • 批准号:
    1663893
  • 财政年份:
    2017
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
Collaborative Research: SI2-SSI: Cyberinfrastructure for Advancing Hydrologic Knowledge through Collaborative Integration of Data Science, Modeling and Analysis
合作研究:SI2-SSI:通过数据科学、建模和分析的协作集成推进水文知识的网络基础设施
  • 批准号:
    1664018
  • 财政年份:
    2017
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了