CSR-PSCE, SM: MPI-PPA: Improving Efficiency of Large-Scale Clusters Through Statistical Performance Prediction

CSR-PSCE、SM:MPI-PPA:通过统计性能预测提高大规模集群的效率

基本信息

  • 批准号:
    0936251
  • 负责人:
  • 金额:
    $ 30.5万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2009
  • 资助国家:
    美国
  • 起止时间:
    2009-01-01 至 2013-08-31
  • 项目状态:
    已结题

项目摘要

This project develops a system that improves parallel efficiency on large numbers of processors - up to tens or hundreds of thousands - without running a program at scale. This system is called MPI-PPA: MPI Performance Prediction and Advisement. MPI-PPA takes as input a scientific computing application along with the input variables, including the desired number of processors, p. With executions on fewer than p processors only - so that these executions will occur quickly - MPI-PPA will produce a list of program phases that are predicted to achieve poor scalability, allowing the programmer to quickly address and possibly re-implement these phases - as well as a prediction for the entire program run.MPI-PPA makes these predictions using statistical regression to develop a prediction function that can be used with any number of processors. MPI-PPA will not require significant program comprehension, an important aspect when considering that computational scientists are typically experts in their scientific domain and not in computer science. The approach of MPI-PPA involves heavy reliance on statistical techniques, so the work in this project will be interdisciplinary between computer science (the PI) and statistics (the co-PI). MPI-PPA will be validated by using benchmark suites such as NAS and ASCI codes, along with large-scale applications - such as Paradis and Raptor - that are of interest to national labs.The broader impact of this work is multifold. First, MPI-PPA will be beneficial for computational scientists as well as cluster administrators. Among the benefits will be a simple and fast performance tuning system, an increase in overall cluster efficiency, and a reduction in response times for individual applications. The technology developed in this project will be transferred, in the form of performance tuning and prediction software, and made available to the public through cooperation with Lawrence Livermore National Laboratory. Second, more interdisciplinary interaction between statistics and computer science will be fostered through the supervised statistical consulting center at the University of Georgia. Third, efforts will continue recruiting students from strong historically black colleges and universities in the area, such as Morehouse University.
这个项目开发了一个系统,可以在大量处理器上提高并行效率-多达数万或数十万-而不需要大规模运行程序。这个系统被称为MPI-PPA:MPI性能预测和建议。MPI-PPA将科学计算应用程序连同包括所需处理器数量在内的输入变量一起作为输入。MPI-PPA只在少于p个处理器上执行--以便这些执行将快速发生--MPI-PPA将产生一个程序阶段列表,这些阶段被预测为实现较差的可伸缩性,从而允许程序员快速处理并可能重新实现这些阶段-以及对整个程序运行的预测。MPI-PPA将不需要大量的程序理解,这是一个重要的方面,因为考虑到计算科学家通常是其科学领域的专家,而不是计算机科学。MPI-PPA的方法涉及到对统计技术的严重依赖,因此本项目的工作将是计算机科学(PI)和统计学(co-PI)之间的交叉学科。MPI-PPA将通过使用NAS和ASCI代码等基准套件以及国家实验室感兴趣的大规模应用程序(如Paradis和Raptor)进行验证。这项工作的更广泛影响是多方面的。首先,MPI-PPA对计算科学家和集群管理员都是有益的。其中的好处包括简单快速的性能调优系统、整体集群效率的提高以及单个应用程序的响应时间缩短。该项目开发的技术将以性能调整和预测软件的形式转让,并通过与劳伦斯·利弗莫尔国家实验室的合作向公众提供。其次,将通过佐治亚大学受监督的统计咨询中心促进统计学和计算机科学之间更多的跨学科互动。第三,将继续努力从该地区历史上实力雄厚的黑人学院和大学招生,如莫尔豪斯大学。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

David Lowenthal其他文献

COMO CONHECEMOS O PASSADO
科莫·科赫西莫斯·奥帕萨多
  • DOI:
  • 发表时间:
    1998
  • 期刊:
  • 影响因子:
    0
  • 作者:
    David Lowenthal;Tradução Lúcia Haddad;Revisão técnica Mariana Maluf
  • 通讯作者:
    Revisão técnica Mariana Maluf
Cardiac Response to Exercise in Health and Disease
健康和疾病中心脏对运动的反应
  • DOI:
    10.1055/s-2007-1006312
  • 发表时间:
    1993
  • 期刊:
  • 影响因子:
    0
  • 作者:
    David Lowenthal;Michael Pollock
  • 通讯作者:
    Michael Pollock
A case report of Tubulo-Interstitial Nephritis with Uveitis (TINU syndrome) and follow-up for one year
  • DOI:
    10.1023/a:1025657713078
  • 发表时间:
    2002-01-01
  • 期刊:
  • 影响因子:
    1.900
  • 作者:
    Chadi Alkhalil;Fawad A. Tanvir;Abdurahman Ahmed;David Lowenthal
  • 通讯作者:
    David Lowenthal
From harmony of the spheres to national anthem: Reflections on musical heritage
  • DOI:
    10.1007/s10708-006-0008-y
  • 发表时间:
    2006-02-01
  • 期刊:
  • 影响因子:
    1.900
  • 作者:
    David Lowenthal
  • 通讯作者:
    David Lowenthal
Social Origins of Dictatorship and Democracy: Lord and Peasant in the Making of the Modern World
独裁与民主的​​社会根源:现代世界形成中的地主与农民
  • DOI:
    10.2307/2575331
  • 发表时间:
    1967
  • 期刊:
  • 影响因子:
    0
  • 作者:
    David Lowenthal;Barrington. Moore
  • 通讯作者:
    Barrington. Moore

David Lowenthal的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('David Lowenthal', 18)}}的其他基金

Collaborative Research: SHF: Medium: Co-Optimizing Computation and Data Transformations for Sparse Tensors
协作研究:SHF:中:稀疏张量的协同优化计算和数据转换
  • 批准号:
    2106621
  • 财政年份:
    2022
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Continuing Grant
Collaborative Research: OAC Core: Improving Utilization of High-Performance Computing Systems via Intelligent Co-scheduling
合作研究:OAC Core:通过智能协同调度提高高性能计算系统的利用率
  • 批准号:
    2103511
  • 财政年份:
    2021
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Standard Grant
CSR: Rethinking System Software for Overprovisioned, High-Performance Computing Systems
CSR:重新思考用于过度配置的高性能计算系统的系统软件
  • 批准号:
    1526015
  • 财政年份:
    2015
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Standard Grant
CSR: Small:Conductor: A Run-Time System for Exascale Computing
CSR:Small:Conductor:用于百亿亿次计算的运行时系统
  • 批准号:
    1216829
  • 财政年份:
    2012
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Standard Grant
CSR-PSCE, SM: MPI-PPA: Improving Efficiency of Large-Scale Clusters Through Statistical Performance Prediction
CSR-PSCE、SM:MPI-PPA:通过统计性能预测提高大规模集群的效率
  • 批准号:
    0834356
  • 财政年份:
    2008
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Continuing Grant
Collaborative Research: Efficient Detection and Alleviation of Scalability Problems
协作研究:有效检测和缓解可扩展性问题
  • 批准号:
    0429285
  • 财政年份:
    2004
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Standard Grant
SOFTWARE: Heterogeneous Cluster MPI: A System for Out-Of-Core, Heterogeneous Data Distribution
软件:异构集群 MPI:核外异构数据分发系统
  • 批准号:
    0234285
  • 财政年份:
    2003
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Continuing Grant
Instrumentation Grant for Research in Parallel and Distributed Computing
用于并行和分布式计算研究的仪器补助金
  • 批准号:
    9986032
  • 财政年份:
    2000
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Standard Grant
Career: An Integrated Compiler/Run-Time System for Global Data Distribution
职业生涯:用于全球数据分发的集成编译器/运行时系统
  • 批准号:
    9733063
  • 财政年份:
    1998
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Continuing Grant

相似海外基金

Collaborative Research: CSR-PSCE, SM: Adaptive Memory Management in Shared Environments
合作研究:CSR-PSCE、SM:共享环境中的自适应内存管理
  • 批准号:
    0834323
  • 财政年份:
    2008
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Continuing Grant
CSR-PSCE,SM: Trade-offs Between Static Power, Performance and Reliability in Future Chip Multiprocessors
CSR-PSCE,SM:未来芯片多处理器静态功耗、性能和可靠性之间的权衡
  • 批准号:
    0834799
  • 财政年份:
    2008
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Standard Grant
CSR-PSCE,SM: Recovery Aware Parallel Computing
CSR-PSCE,SM:恢复感知并行计算
  • 批准号:
    0834514
  • 财政年份:
    2008
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Continuing Grant
CSR-PSCE,SM: A Holistic Design Approach to Reliability Using 3D Stacked
CSR-PSCE,SM:使用 3D 堆叠的可靠性整体设计方法
  • 批准号:
    0834798
  • 财政年份:
    2008
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Standard Grant
CSR-PSCE, SM: Automatic Multithreaded and Transactional Memory Workload Synthesis for Efficient Multi-core Design Space Evaluation
CSR-PSCE、SM:自动多线程和事务性内存工作负载合成,用于高效的多核设计空间评估
  • 批准号:
    0834288
  • 财政年份:
    2008
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Standard Grant
Collaborative Research: CSR-PSCE, SM: Memory Thermal Management for Multi-Core Systems
合作研究:CSR-PSCE、SM:多核系统的内存热管理
  • 批准号:
    0834475
  • 财政年份:
    2008
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Standard Grant
CSR-PSCE, SM: Memory Management Innovations for Next-Generation SMP
CSR-PSCE、SM:下一代 SMP 的内存管理创新
  • 批准号:
    0834619
  • 财政年份:
    2008
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Continuing Grant
CSR-PSCE,SM: Compiler-Directed System Optimization of a Highly-Parallel Fine-Grained Chip Multiprocessor
CSR-PSCE,SM:高度并行细粒度芯片多处理器的编译器导向系统优化
  • 批准号:
    0834373
  • 财政年份:
    2008
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Continuing Grant
Collaborative Research: CSR-PSCE, SM: Memory Thermal Management for Multi-Core Systems
合作研究:CSR-PSCE、SM:多核系统的内存热管理
  • 批准号:
    0834469
  • 财政年份:
    2008
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Standard Grant
CSR-PSCE, SM: Recording and Deterministically Replaying Shared-memory Multiprocessor Execution Efficiently
CSR-PSCE、SM:高效记录和确定性重放共享内存多处理器执行
  • 批准号:
    0834738
  • 财政年份:
    2008
  • 资助金额:
    $ 30.5万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了