Collaborative Research: SI2-SSI: A Comprehensive Performance Tuning Framework for the MPI Stack
合作研究:SI2-SSI:MPI 堆栈的综合性能调优框架
基本信息
- 批准号:1147926
- 负责人:
- 金额:$ 45.08万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2012
- 资助国家:美国
- 起止时间:2012-06-01 至 2016-05-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The Message Passing Interface (MPI) is a very widely used parallel programming model on modern High-End Computing (HEC) systems. Many performance aspects of MPI libraries, such as latency, bandwidth, scalability, memory footprint, cache pollution, overlap of computation and communication etc. are highly dependent on system configuration and application requirements. Additionally, modern clusters are changing rapidly with the growth of multi-core processors and commodity networking technologies such as InfiniBand and 10GigE/iWARP. They are becoming diverse and heterogeneous with varying number of processor cores, processor speed, memory speed, multi-generation network adapters/switches, I/O interface technologies, and accelerators (GPGPUs), etc. Typically, any MPI library deals with the above kind of diversity in platforms and sensitivity of applications by employing various runtime parameters. These parameters are tuned during its release, or bysystem administrators, or by end-users. These default parameters may or may not be optimal for all system configurations and applications.The MPI library of a typical proprietary system goes through heavy performance tuning for a range of applications. Since commodity clusters provide greater flexibility in their configurations (processor, memory and network), it is very hard to achieve optimal tuning using released version of any MPI library, with its default settings. This leads to the following broad challenge: "Can a comprehensive performance tuning framework be designed for MPI library so that the next generation InfiniBand, 10GigE/iWARP and RoCE clusters and applications will be able to extract `bare-metal' performance and maximum scalability?" The investigators, involving computerscientists from The Ohio State University (OSU) and Ohio Supercomputer Center (OSC) as well as computational scientists from the Texas Advanced Computing Center (TACC) and San Diego Supercomputer Center (SDSC), University of California San Diego (UCSD), will be addressing the above challenge with innovative solutions.The investigators will specifically address the following challenges: 1) Can a set of static tools be designed to optimize performance of an MPI library during installation time? 2) Can a set of dynamic tools with low overhead be designed to optimize performance on a per-user and per-application basis during production runs? 3) How to incorporate the proposed performance tuning framework with the upcoming MPIT interface? 4) How to configure MPI libraries on a given system to deliver different optimizations to a set of driving applications? and 5) What kind of benefits (in terms of performance, scalability, memory efficiency and reduction in cache pollution) can be achieved by the proposed tuning framework? The research will be driven by a set of applications from established NSF computational science researchers running large scale simulations on the TACC Ranger and other systems at OSC, SDSC and OSU. The proposed designs will be integrated into the open-source MVAPICH2 library.
消息传递接口(MPI)是现代高端计算(HEC)系统上非常广泛使用的并行编程模型。MPI库的许多性能方面,如延迟、带宽、可扩展性、内存占用、缓存污染、计算和通信的重叠等,都高度依赖于系统配置和应用程序要求。此外,随着多核处理器和商用网络技术(如InfiniBand和10 GigE/iWARP)的发展,现代集群正在迅速变化。它们变得多样化和异构,具有不同数量的处理器内核、处理器速度、存储器速度、多代网络适配器/交换机、I/O接口技术和加速器(GPGPU)等。通常,任何MPI库都通过采用各种运行时参数来处理上述平台多样性和应用程序敏感性。这些参数在发布过程中,或者由系统管理员,或者由最终用户进行调整。 这些默认参数可能对所有系统配置和应用程序都是最佳的,也可能不是最佳的。典型专有系统的MPI库需要为一系列应用程序进行大量的性能调优。 由于商用集群在配置(处理器、内存和网络)方面提供了更大的灵活性,因此使用任何MPI库的发布版本及其默认设置都很难实现最佳调优。这导致了以下广泛的挑战:“能否为MPI库设计一个全面的性能调优框架,以便下一代InfiniBand、10 GigE/iWARP和RoCE集群和应用程序能够提取'裸金属'性能和最大的可扩展性?“调查人员将利用创新的解决方案应对上述挑战,其中包括来自俄亥俄州州立大学(OSU)和俄亥俄州超级计算机中心(OSC)的计算机科学家,以及来自德克萨斯州高级计算中心(TACC)和圣地亚哥超级计算机中心(SDSC)、加州圣地亚哥大学(UCSD)的计算科学家。调查人员将具体应对以下挑战:1)可以设计一组静态工具来优化MPI库在安装时的性能吗? 2)能否设计一组低开销的动态工具来在生产运行期间优化每个用户和每个应用程序的性能? 3)如何将提议的性能调优框架与即将推出的MPIT接口结合起来? 4)如何在给定的系统上配置MPI库,以便为一组驱动应用程序提供不同的优化? 以及5)所提出的调优框架可以实现什么样的益处(在性能、可伸缩性、内存效率和减少缓存污染方面)? 这项研究将由NSF计算科学研究人员在OSC,SDSC和OSU的TACC Ranger和其他系统上进行大规模模拟的一系列应用程序驱动。 拟议的设计将被集成到开源MVAPICH 2库中。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Amitava Majumdar其他文献
Cyberinfrastructure Usage Modalities on the TeraGrid
TeraGrid 上的网络基础设施使用方式
- DOI:
10.1109/ipdps.2011.239 - 发表时间:
2011 - 期刊:
- 影响因子:0
- 作者:
Daniel S. Katz;David L. Hart;C. Jordan;Amitava Majumdar;J. Navarro;Warren Smith;John Towns;Von Welch;Nancy Wilkins - 通讯作者:
Nancy Wilkins
Thermal stability, dielectric and conductivity characteristics of 9,10-anthracene-diol-anhydride polycondensates
- DOI:
10.1007/bf00395581 - 发表时间:
1990-11-01 - 期刊:
- 影响因子:4.000
- 作者:
Amitava Majumdar;Mukul Biswas - 通讯作者:
Mukul Biswas
Ground bounce considerations in DC parametric test generation using boundary scan
使用边界扫描生成直流参数测试时的地弹注意事项
- DOI:
10.1109/vtest.1998.670853 - 发表时间:
1998 - 期刊:
- 影响因子:0
- 作者:
Amitava Majumdar;M. Komoda;Tim Ayres - 通讯作者:
Tim Ayres
A parallel Monte Carlo code for planar and SPECT imaging: implementation, verification and applications in /sup 131/I SPECT
用于平面和 SPECT 成像的并行蒙特卡罗代码:/sup 131/I SPECT 中的实现、验证和应用
- DOI:
10.1109/nssmic.2000.949310 - 发表时间:
2000 - 期刊:
- 影响因子:0
- 作者:
Y. Dewaraja;Michael Ljungberg;Amitava Majumdar;Abhijit Bose;K. Koral - 通讯作者:
K. Koral
Neuromorphic computing at scale
大规模神经形态计算
- DOI:
10.1038/s41586-024-08253-8 - 发表时间:
2025-01-22 - 期刊:
- 影响因子:48.500
- 作者:
Dhireesha Kudithipudi;Catherine Schuman;Craig M. Vineyard;Tej Pandit;Cory Merkel;Rajkumar Kubendran;James B. Aimone;Garrick Orchard;Christian Mayr;Ryad Benosman;Joe Hays;Cliff Young;Chiara Bartolozzi;Amitava Majumdar;Suma George Cardwell;Melika Payvand;Sonia Buckley;Shruti Kulkarni;Hector A. Gonzalez;Gert Cauwenberghs;Chetan Singh Thakur;Anand Subramoney;Steve Furber - 通讯作者:
Steve Furber
Amitava Majumdar的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Amitava Majumdar', 18)}}的其他基金
Collaborative Research: Frameworks: hpcGPT: Enhancing Computing Center User Support with HPC-enriched Generative AI
协作研究:框架:hpcGPT:通过 HPC 丰富的生成式 AI 增强计算中心用户支持
- 批准号:
2411297 - 财政年份:2024
- 资助金额:
$ 45.08万 - 项目类别:
Standard Grant
Category II: Exploring Neural Network Processors for AI in Science and Engineering
第二类:探索科学与工程中人工智能的神经网络处理器
- 批准号:
2005369 - 财政年份:2020
- 资助金额:
$ 45.08万 - 项目类别:
Cooperative Agreement
Collaborative Research: CIBR: Building Capacity for Data-driven Neuroscience Research
合作研究:CIBR:数据驱动神经科学研究能力建设
- 批准号:
1935749 - 财政年份:2020
- 资助金额:
$ 45.08万 - 项目类别:
Standard Grant
Collaborative Research: Frameworks: Designing Next-Generation MPI Libraries for Emerging Dense GPU Systems
协作研究:框架:为新兴密集 GPU 系统设计下一代 MPI 库
- 批准号:
1931450 - 财政年份:2019
- 资助金额:
$ 45.08万 - 项目类别:
Standard Grant
Promoting International Collaboration on Developing Scalable, Portable & Efficient HPC Software for Modern HPC Platforms
促进开发可扩展、便携的国际合作
- 批准号:
1849519 - 财政年份:2018
- 资助金额:
$ 45.08万 - 项目类别:
Standard Grant
SHF: Large: Collaborative Research: Next Generation Communication Mechanisms exploiting Heterogeneity, Hierarchy and Concurrency for Emerging HPC Systems
SHF:大型:协作研究:利用新兴 HPC 系统的异构性、层次结构和并发性的下一代通信机制
- 批准号:
1565336 - 财政年份:2016
- 资助金额:
$ 45.08万 - 项目类别:
Standard Grant
Bilateral BBSRC-NSF/BIO: Collaborative Research: ABI Development: Seamless Integration of Neuroscience Models and Tools with HPC - Easy Path to Supercomputing for Neuroscience
双边 BBSRC-NSF/BIO:合作研究:ABI 开发:神经科学模型和工具与 HPC 的无缝集成 - 神经科学超级计算的简单途径
- 批准号:
1458840 - 财政年份:2015
- 资助金额:
$ 45.08万 - 项目类别:
Standard Grant
BIGDATA: F: DKM: Collaborative Research: Scalable Middleware for Managing and Processing Big Data on Next Generation HPC Systems
BIGDATA:F:DKM:协作研究:用于在下一代 HPC 系统上管理和处理大数据的可扩展中间件
- 批准号:
1447861 - 财政年份:2014
- 资助金额:
$ 45.08万 - 项目类别:
Standard Grant
SHF: Large: Collaborative Research: Unified Runtime for Supporting Hybrid Programming Models on Heterogeneous Architecture.
SHF:大型:协作研究:支持异构架构上的混合编程模型的统一运行时。
- 批准号:
1213056 - 财政年份:2012
- 资助金额:
$ 45.08万 - 项目类别:
Standard Grant
Collaborative Research: ABI Development: Building A Community Resource for Neuroscientists
合作研究:ABI 开发:为神经科学家建立社区资源
- 批准号:
1146949 - 财政年份:2012
- 资助金额:
$ 45.08万 - 项目类别:
Standard Grant
相似国自然基金
Research on Quantum Field Theory without a Lagrangian Description
- 批准号:24ZR1403900
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
Cell Research
- 批准号:31224802
- 批准年份:2012
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research
- 批准号:31024804
- 批准年份:2010
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research (细胞研究)
- 批准号:30824808
- 批准年份:2008
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
- 批准号:10774081
- 批准年份:2007
- 资助金额:45.0 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: SI2-SSI: Expanding Volunteer Computing
合作研究:SI2-SSI:扩展志愿者计算
- 批准号:
2039142 - 财政年份:2020
- 资助金额:
$ 45.08万 - 项目类别:
Standard Grant
SI2-SSI: Collaborative Research: Einstein Toolkit Community Integration and Data Exploration
SI2-SSI:协作研究:Einstein Toolkit 社区集成和数据探索
- 批准号:
2114580 - 财政年份:2020
- 资助金额:
$ 45.08万 - 项目类别:
Continuing Grant
Collaborative Research: SI2-SSI: Expanding Volunteer Computing
合作研究:SI2-SSI:扩展志愿者计算
- 批准号:
2001752 - 财政年份:2019
- 资助金额:
$ 45.08万 - 项目类别:
Standard Grant
Collaborative Research: NISC SI2-S2I2 Conceptualization of CFDSI: Model, Data, and Analysis Integration for End-to-End Support of Fluid Dynamics Discovery and Innovation
合作研究:NISC SI2-S2I2 CFDSI 概念化:模型、数据和分析集成,用于流体动力学发现和创新的端到端支持
- 批准号:
1743178 - 财政年份:2018
- 资助金额:
$ 45.08万 - 项目类别:
Continuing Grant
Collaborative Research: NISC SI2-S2I2 Conceptualization of CFDSI: Model, Data, and Analysis Integration for End-to-End Support of Fluid Dynamics Discovery and Innovation
合作研究:NISC SI2-S2I2 CFDSI 概念化:模型、数据和分析集成,用于流体动力学发现和创新的端到端支持
- 批准号:
1743185 - 财政年份:2018
- 资助金额:
$ 45.08万 - 项目类别:
Continuing Grant
Collaborative Research: NISC SI2-S2I2 Conceptualization of CFDSI: Model, Data, and Analysis Integration for End-to-End Support of Fluid Dynamics Discovery and Innovation
合作研究:NISC SI2-S2I2 CFDSI 概念化:模型、数据和分析集成,用于流体动力学发现和创新的端到端支持
- 批准号:
1743180 - 财政年份:2018
- 资助金额:
$ 45.08万 - 项目类别:
Continuing Grant
Collaborative Research: NISC SI2-S2I2 Conceptualization of CFDSI: Model, Data, and Analysis Integration for End-to-End Support of Fluid Dynamics Discovery and Innovation
合作研究:NISC SI2-S2I2 CFDSI 概念化:模型、数据和分析集成,用于流体动力学发现和创新的端到端支持
- 批准号:
1743179 - 财政年份:2018
- 资助金额:
$ 45.08万 - 项目类别:
Continuing Grant
Collaborative Research: NISC SI2-S2I2 Conceptualization of CFDSI: Model, Data, and Analysis Integration for End-to-End Support of Fluid Dynamics Discovery and Innovation
合作研究:NISC SI2-S2I2 CFDSI 概念化:模型、数据和分析集成,用于流体动力学发现和创新的端到端支持
- 批准号:
1743191 - 财政年份:2018
- 资助金额:
$ 45.08万 - 项目类别:
Continuing Grant
Collaborative Research: SI2-SSE: WRENCH: A Simulation Workbench for Scientific Worflow Users, Developers, and Researchers
协作研究:SI2-SSE:WRENCH:面向科学 Worflow 用户、开发人员和研究人员的模拟工作台
- 批准号:
1642369 - 财政年份:2017
- 资助金额:
$ 45.08万 - 项目类别:
Standard Grant
Collaborative Research: SI2-SSI: Open Source Support for Massively Parallel, Generic Finite Element Methods
合作研究:SI2-SSI:对大规模并行、通用有限元方法的开源支持
- 批准号:
1741870 - 财政年份:2017
- 资助金额:
$ 45.08万 - 项目类别:
Standard Grant