Collaborative Research: SI2-SSI: A Comprehensive Performance Tuning Framework for the MPI Stack

合作研究:SI2-SSI:MPI 堆栈的综合性能调优框架

基本信息

  • 批准号:
    1148424
  • 负责人:
  • 金额:
    $ 45万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2012
  • 资助国家:
    美国
  • 起止时间:
    2012-06-01 至 2016-05-31
  • 项目状态:
    已结题

项目摘要

The Message Passing Interface (MPI) is a very widely used parallel programming model on modern High-End Computing (HEC) systems. Many performance aspects of MPI libraries, such as latency, bandwidth, scalability, memory footprint, cache pollution, overlap of computation and communication etc. are highly dependent on system configuration and application requirements. Additionally, modern clusters are changing rapidly with the growth of multi-core processors and commodity networking technologies such as InfiniBand and 10GigE/iWARP. They are becoming diverse and heterogeneous with varying number of processor cores, processor speed, memory speed, multi-generation network adapters/switches, I/O interface technologies, and accelerators (GPGPUs), etc. Typically, any MPI library deals with the above kind of diversity in platforms and sensitivity of applications by employing various runtime parameters. These parameters are tuned during its release, or bysystem administrators, or by end-users. These default parameters may or may not be optimal for all system configurations and applications.The MPI library of a typical proprietary system goes through heavy performance tuning for a range of applications. Since commodity clusters provide greater flexibility in their configurations (processor, memory and network), it is very hard to achieve optimal tuning using released version of any MPI library, with its default settings. This leads to the following broad challenge: "Can a comprehensive performance tuning framework be designed for MPI library so that the next generation InfiniBand, 10GigE/iWARP and RoCE clusters and applications will be able to extract `bare-metal' performance and maximum scalability?" The investigators, involving computerscientists from The Ohio State University (OSU) and Ohio Supercomputer Center (OSC) as well as computational scientists from the Texas Advanced Computing Center (TACC) and San Diego Supercomputer Center (SDSC), University of California San Diego (UCSD), will be addressing the above challenge with innovative solutions.The investigators will specifically address the following challenges: 1) Can a set of static tools be designed to optimize performance of an MPI library during installation time? 2) Can a set of dynamic tools with low overhead be designed to optimize performance on a per-user and per-application basis during production runs? 3) How to incorporate the proposed performance tuning framework with the upcoming MPIT interface? 4) How to configure MPI libraries on a given system to deliver different optimizations to a set of driving applications? and 5) What kind of benefits (in terms of performance, scalability, memory efficiency and reduction in cache pollution) can be achieved by the proposed tuning framework? The research will be driven by a set of applications from established NSF computational science researchers running large scale simulations on the TACC Ranger and other systems at OSC, SDSC and OSU. The proposed designs will be integrated into the open-source MVAPICH2 library.
消息传递接口(MPI)是现代高端计算(HEC)系统上非常广泛使用的并行编程模型。MPI库的许多性能方面,如延迟、带宽、可扩展性、内存占用、缓存污染、计算和通信的重叠等,都高度依赖于系统配置和应用程序要求。此外,随着多核处理器和商用网络技术(如InfiniBand和10 GigE/iWARP)的发展,现代集群正在迅速变化。它们变得多样化和异构,具有不同数量的处理器内核、处理器速度、存储器速度、多代网络适配器/交换机、I/O接口技术和加速器(GPGPU)等。通常,任何MPI库都通过采用各种运行时参数来处理上述平台多样性和应用程序敏感性。这些参数在发布过程中,或者由系统管理员,或者由最终用户进行调整。 这些默认参数可能对所有系统配置和应用程序都是最佳的,也可能不是最佳的。典型专有系统的MPI库需要为一系列应用程序进行大量的性能调优。 由于商用集群在配置(处理器、内存和网络)方面提供了更大的灵活性,因此使用任何MPI库的发布版本及其默认设置都很难实现最佳调优。这导致了以下广泛的挑战:“能否为MPI库设计一个全面的性能调优框架,以便下一代InfiniBand、10 GigE/iWARP和RoCE集群和应用程序能够提取'裸金属'性能和最大的可扩展性?“调查人员将利用创新的解决方案应对上述挑战,其中包括来自俄亥俄州州立大学(OSU)和俄亥俄州超级计算机中心(OSC)的计算机科学家,以及来自德克萨斯州高级计算中心(TACC)和圣地亚哥超级计算机中心(SDSC)、加州圣地亚哥大学(UCSD)的计算科学家。调查人员将具体应对以下挑战:1)可以设计一组静态工具来优化MPI库在安装时的性能吗? 2)能否设计一组低开销的动态工具来在生产运行期间优化每个用户和每个应用程序的性能? 3)如何将建议的性能调优框架与即将到来的MPIT接口结合起来? 4)如何在给定的系统上配置MPI库,以便为一组驱动应用程序提供不同的优化? 以及5)所提出的调优框架可以实现什么样的益处(在性能、可伸缩性、内存效率和减少缓存污染方面)? 这项研究将由NSF计算科学研究人员在OSC,SDSC和OSU的TACC Ranger和其他系统上进行大规模模拟的一系列应用程序驱动。 拟议的设计将被集成到开源MVAPICH 2库中。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

William Barth其他文献

Outcomes of a second external cephalic version for management of malpresentation
  • DOI:
    10.1016/j.ajog.2022.11.812
  • 发表时间:
    2023-01-01
  • 期刊:
  • 影响因子:
  • 作者:
    Fowsia Warsame;Thomas P. Kishkovich;Mackenzie N. Naert;Mireya P. Taboada;Kaitlyn E. James;William Barth;Mark A. Clapp
  • 通讯作者:
    Mark A. Clapp
Validation of external cephalic version prediction model in population with low neuraxial anesthesia use
  • DOI:
    10.1016/j.ajog.2022.11.987
  • 发表时间:
    2023-01-01
  • 期刊:
  • 影响因子:
  • 作者:
    Thomas P. Kishkovich;Mackenzie N. Naert;Fowsia Warsame;Mireya P. Taboada;Kaitlyn E. James;William Barth;Mark A. Clapp
  • 通讯作者:
    Mark A. Clapp
Pregnancy Increases Parathyroid Hormone Related Protein (PTHrP) Secretion from Human Uterine Smooth Muscle Cells
妊娠增加人子宫平滑肌细胞甲状旁腺激素相关蛋白(PTHrP)的分泌
  • DOI:
    10.1203/00006450-199904020-00581
  • 发表时间:
    1999-04-01
  • 期刊:
  • 影响因子:
    3.100
  • 作者:
    Jeffrey S Shenberger;Jerome A Choate;Patricia S Dixon;Kenneth Helal;William Barth
  • 通讯作者:
    William Barth
1195: Severe newborn morbidity in the second stage of labor
  • DOI:
    10.1016/j.ajog.2019.11.1207
  • 发表时间:
    2020-01-01
  • 期刊:
  • 影响因子:
  • 作者:
    Sarah E. Little;Mark A. Clapp;Sarah Lassey;Radek Bukowski;William Barth;Julian N. Robinson
  • 通讯作者:
    Julian N. Robinson
Role of individual provider in success of external cephalic version
  • DOI:
    10.1016/j.ajog.2022.11.622
  • 发表时间:
    2023-01-01
  • 期刊:
  • 影响因子:
  • 作者:
    Mackenzie N. Naert;Thomas P. Kishkovich;Fowsia Warsame;Mireya P. Taboada;Kaitlyn E. James;William Barth;Mark A. Clapp
  • 通讯作者:
    Mark A. Clapp

William Barth的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('William Barth', 18)}}的其他基金

Collaborative Research: Frameworks: Designing Next-Generation MPI Libraries for Emerging Dense GPU Systems
协作研究:框架:为新兴密集 GPU 系统设计下一代 MPI 库
  • 批准号:
    1931354
  • 财政年份:
    2019
  • 资助金额:
    $ 45万
  • 项目类别:
    Standard Grant
SHF: Large: Collaborative Research: Next Generation Communication Mechanisms exploiting Heterogeneity, Hierarchy and Concurrency for Emerging HPC Systems
SHF:大型:协作研究:利用新兴 HPC 系统的异构性、层次结构和并发性的下一代通信机制
  • 批准号:
    1565431
  • 财政年份:
    2016
  • 资助金额:
    $ 45万
  • 项目类别:
    Standard Grant
Collaborative Research: Integrated HPC Systems Usage and Performance of Resources Monitoring and Modeling (SUPReMM)
协作研究:集成 HPC 系统资源使用和性能监控和建模 (SUPReMM)
  • 批准号:
    1203604
  • 财政年份:
    2012
  • 资助金额:
    $ 45万
  • 项目类别:
    Standard Grant
SHF:Large:Collaborative Research:Unified Runtime for Supporting Hybrid Programming Models on Heterogeneous Architecture
SHF:大型:协作研究:支持异构架构上混合编程模型的统一运行时
  • 批准号:
    1213057
  • 财政年份:
    2012
  • 资助金额:
    $ 45万
  • 项目类别:
    Standard Grant

相似国自然基金

Research on Quantum Field Theory without a Lagrangian Description
  • 批准号:
    24ZR1403900
  • 批准年份:
    2024
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
Cell Research
  • 批准号:
    31224802
  • 批准年份:
    2012
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research
  • 批准号:
    31024804
  • 批准年份:
    2010
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research (细胞研究)
  • 批准号:
    30824808
  • 批准年份:
    2008
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
  • 批准号:
    10774081
  • 批准年份:
    2007
  • 资助金额:
    45.0 万元
  • 项目类别:
    面上项目

相似海外基金

Collaborative Research: SI2-SSI: Expanding Volunteer Computing
合作研究:SI2-SSI:扩展志愿者计算
  • 批准号:
    2039142
  • 财政年份:
    2020
  • 资助金额:
    $ 45万
  • 项目类别:
    Standard Grant
SI2-SSI: Collaborative Research: Einstein Toolkit Community Integration and Data Exploration
SI2-SSI:协作研究:Einstein Toolkit 社区集成和数据探索
  • 批准号:
    2114580
  • 财政年份:
    2020
  • 资助金额:
    $ 45万
  • 项目类别:
    Continuing Grant
Collaborative Research: SI2-SSI: Expanding Volunteer Computing
合作研究:SI2-SSI:扩展志愿者计算
  • 批准号:
    2001752
  • 财政年份:
    2019
  • 资助金额:
    $ 45万
  • 项目类别:
    Standard Grant
Collaborative Research: NISC SI2-S2I2 Conceptualization of CFDSI: Model, Data, and Analysis Integration for End-to-End Support of Fluid Dynamics Discovery and Innovation
合作研究:NISC SI2-S2I2 CFDSI 概念化:模型、数据和分析集成,用于流体动力学发现和创新的端到端支持
  • 批准号:
    1743178
  • 财政年份:
    2018
  • 资助金额:
    $ 45万
  • 项目类别:
    Continuing Grant
Collaborative Research: NISC SI2-S2I2 Conceptualization of CFDSI: Model, Data, and Analysis Integration for End-to-End Support of Fluid Dynamics Discovery and Innovation
合作研究:NISC SI2-S2I2 CFDSI 概念化:模型、数据和分析集成,用于流体动力学发现和创新的端到端支持
  • 批准号:
    1743185
  • 财政年份:
    2018
  • 资助金额:
    $ 45万
  • 项目类别:
    Continuing Grant
Collaborative Research: NISC SI2-S2I2 Conceptualization of CFDSI: Model, Data, and Analysis Integration for End-to-End Support of Fluid Dynamics Discovery and Innovation
合作研究:NISC SI2-S2I2 CFDSI 概念化:模型、数据和分析集成,用于流体动力学发现和创新的端到端支持
  • 批准号:
    1743180
  • 财政年份:
    2018
  • 资助金额:
    $ 45万
  • 项目类别:
    Continuing Grant
Collaborative Research: NISC SI2-S2I2 Conceptualization of CFDSI: Model, Data, and Analysis Integration for End-to-End Support of Fluid Dynamics Discovery and Innovation
合作研究:NISC SI2-S2I2 CFDSI 概念化:模型、数据和分析集成,用于流体动力学发现和创新的端到端支持
  • 批准号:
    1743179
  • 财政年份:
    2018
  • 资助金额:
    $ 45万
  • 项目类别:
    Continuing Grant
Collaborative Research: NISC SI2-S2I2 Conceptualization of CFDSI: Model, Data, and Analysis Integration for End-to-End Support of Fluid Dynamics Discovery and Innovation
合作研究:NISC SI2-S2I2 CFDSI 概念化:模型、数据和分析集成,用于流体动力学发现和创新的端到端支持
  • 批准号:
    1743191
  • 财政年份:
    2018
  • 资助金额:
    $ 45万
  • 项目类别:
    Continuing Grant
Collaborative Research: SI2-SSE: WRENCH: A Simulation Workbench for Scientific Worflow Users, Developers, and Researchers
协作研究:SI2-SSE:WRENCH:面向科学 Worflow 用户、开发人员和研究人员的模拟工作台
  • 批准号:
    1642369
  • 财政年份:
    2017
  • 资助金额:
    $ 45万
  • 项目类别:
    Standard Grant
Collaborative Research: SI2-SSI: Open Source Support for Massively Parallel, Generic Finite Element Methods
合作研究:SI2-SSI:对大规模并行、通用有限元方法的开源支持
  • 批准号:
    1741870
  • 财政年份:
    2017
  • 资助金额:
    $ 45万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了