CI-P: REFT - A Reconfigurable Execution Framework Testbed for data-driven and extreme scale computing

CI-P:REFT - 用于数据驱动和超大规模计算的可重新配置执行框架测试台

基本信息

  • 批准号:
    1205518
  • 负责人:
  • 金额:
    $ 9.88万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2012
  • 资助国家:
    美国
  • 起止时间:
    2012-06-01 至 2015-05-31
  • 项目状态:
    已结题

项目摘要

The field of supercomputing is experiencing a rapid change in system structure, programming models, and software environments in response to advances in application requirements and in underlying enabling technologies. Traditional parallel programming approaches have relied on static resource allocation and task scheduling through programming interfaces such as MPI and OpenMP. These methods are reaching their efficiency and scalability limits on the new emerging classes of systems, spurring the creation of innovative dynamic strategies and software tools, including advanced runtime system software and programming interfaces that use them. To accelerate adoption of these next-generation methods, a unique environment is being created and operated that provides a comprehensive ensemble of state-of-the-art runtime system software and programming interfaces. Taken from previous research and development projects, some at the host institution, Indiana University, and others from premiere research organizations across the nation, these execution systems are integrated in a single supported Reconfigurable Execution Framework Testbed (REFT) and made available to parallel application algorithm developers as well as researchers in advanced tools for parallel computing. The basic REFT hardware capabilities include a medium- scale heterogeneous Linux cluster with multi-core sockets, high-bandwidth interconnect, and mass storage; field-programmable gate arrays; and instrumentation for power measurement. ParalleX-based HPX-3, ETI SWARM, Berkeley GasNet, Rice University?s Habanero, Illinois? Charm++, Cray Chapel, IBM X-10, and UPC among other programming and execution models comprise the major components of this unique facility.Supercomputing is making a sharp corner turn in form, function, and methodologies. Unfortunately, few in the field are skilled in the use of the emerging execution and programming models that are becoming increasingly critical to effectively utilizing supercomputers to deliver quality science for extreme-scale applications?either those at the highest end of the performance spectrum (Petaflops currently and Exaflops at the end of the decade) or strong-scaled fixed-size problems. REFT serves the NSF computational science community by dramatically lowering the barrier to training, experimentation, and adoption of new dynamic execution methods and systems. It provides full documentation, on-line tutorials, in-house classes, and workshops for skill development and community building for the broad US HPC community to accelerate application, evaluation, and exploitation. As a repository for competing and complementary software environments it provides a single site for conducting comparative studies by end-users to establish best practices. As an NSF resource, it serves to expedite and further goals of computational science by enabling effective application of the next generation Petaflops-class computer systems of millions of cores and eventual Exascale systems with billion-way concurrency.
超级计算领域正在经历系统结构、编程模型和软件环境的快速变化,以响应应用需求和底层使能技术的进步。传统的并行编程方法依赖于静态资源分配和任务调度,通过编程接口,如MPI和OpenMP。这些方法在新出现的系统类别上达到了它们的效率和可扩展性限制,从而刺激了创新的动态策略和软件工具的创建,包括高级运行时系统软件和使用它们的编程接口。为了加速这些下一代方法的采用,正在创建和操作一个独特的环境,该环境提供了最先进的运行时系统软件和编程接口的全面集成。从以前的研究和开发项目,一些在主办机构,印第安纳州大学,和其他来自全国各地的首要研究组织,这些执行系统集成在一个单一的支持可重构执行框架测试床(REFT),并提供给并行应用程序算法开发人员以及研究人员在先进的并行计算工具。基本的REFT硬件功能包括具有多核插槽、高带宽互连和大容量存储的中等规模异构Linux集群;现场可编程门阵列;以及用于功率测量的仪器。基于ParalleX的HPX-3、ETI SWARM、伯克利GasNet、莱斯大学?伊利诺伊州的哈巴内罗?Charm++、Cray Chapel、IBM X-10和UPC以及其他编程和执行模型构成了这一独特设施的主要组件。超级计算在形式、功能和方法上正在发生急剧的转变。 不幸的是,在该领域很少有人熟练使用新兴的执行和编程模型,这些模型对于有效利用超级计算机为极端规模的应用提供高质量的科学变得越来越重要。无论是那些在性能频谱的最高端(目前的千万亿次浮点运算和十年结束时的Exaflops)或强规模的固定大小的问题。REFT服务于NSF计算科学社区,大大降低了培训,实验和采用新的动态执行方法和系统的障碍。它为广泛的美国HPC社区提供完整的文档、在线教程、内部课程和技能开发和社区建设研讨会,以加速应用、评估和开发。作为相互竞争和互补的软件环境的存放处,它为最终用户进行比较研究以确定最佳做法提供了一个单一的网站。作为NSF的资源,它有助于加速和进一步的计算科学的目标,使下一代千万亿次浮点级计算机系统的有效应用程序的数百万个核心和最终的Exascale系统与十亿路并发。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Thomas Sterling其他文献

Productivity Metrics and Models
生产力指标和模型
  • DOI:
  • 发表时间:
    2004
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Thomas Sterling
  • 通讯作者:
    Thomas Sterling
An Empirical Evaluation of the Convex SPP-1000 Hierarchical Shared Memory System
Halo World: Tools for Parallel Cluster Finding in Astrophysical N-body Simulations
  • DOI:
    10.1023/a:1009729602855
  • 发表时间:
    1997-12-01
  • 期刊:
  • 影响因子:
    4.300
  • 作者:
    David W. Pfitzner;John K. Salmon;Thomas Sterling
  • 通讯作者:
    Thomas Sterling
InfiniBand: The “De Facto” Future Standard for System and Local Area Networks or Just a Scalable Replacement for PCI Buses?

Thomas Sterling的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Thomas Sterling', 18)}}的其他基金

I-Corps: Low False Negative 3-D Facial Recognition
I-Corps:低误报 3D 面部识别
  • 批准号:
    1830782
  • 财政年份:
    2018
  • 资助金额:
    $ 9.88万
  • 项目类别:
    Standard Grant
EAGER:Real-time Semantics for the ParalleX Execution Model to Enable Single-Image Multicore Embedded Computing
EAGER:ParalleX 执行模型的实时语义以实现单图像多核嵌入式计算
  • 批准号:
    1352969
  • 财政年份:
    2013
  • 资助金额:
    $ 9.88万
  • 项目类别:
    Standard Grant
HPC Education Through Formal On-line On-Demand Curriculum
通过正式的在线点播课程进行 HPC 教育
  • 批准号:
    1349527
  • 财政年份:
    2013
  • 资助金额:
    $ 9.88万
  • 项目类别:
    Standard Grant
SHF: Large: Collaborative Research: PXGL: Cyberinfrastructure for Scalable Graph Execution
SHF:大型:协作研究:PXGL:可扩展图形执行的网络基础设施
  • 批准号:
    1160602
  • 财政年份:
    2011
  • 资助金额:
    $ 9.88万
  • 项目类别:
    Continuing Grant
SHF: Large: Collaborative Research: PXGL: Cyberinfrastructure for Scalable Graph Execution
SHF:大型:协作研究:PXGL:可扩展图形执行的网络基础设施
  • 批准号:
    1111676
  • 财政年份:
    2011
  • 资助金额:
    $ 9.88万
  • 项目类别:
    Continuing Grant
Increasing Student Participation in Cluster Computing through IEEE Cluster 2009 Attendance
通过出席 IEEE Cluster 2009 提高学生对集群计算的参与
  • 批准号:
    0936014
  • 财政年份:
    2009
  • 资助金额:
    $ 9.88万
  • 项目类别:
    Standard Grant
A System Architecture Point Design Study for Exascale Computing
百万兆级计算的系统架构点设计研究
  • 批准号:
    0836755
  • 财政年份:
    2008
  • 资助金额:
    $ 9.88万
  • 项目类别:
    Standard Grant
Collaborative Research: A Study and Implementation of Semantic Constructs for Highly Scalable Leading-edge Scientific Computing
协作研究:高度可扩展的前沿科学计算的语义结构的研究和实现
  • 批准号:
    0833193
  • 财政年份:
    2008
  • 资助金额:
    $ 9.88万
  • 项目类别:
    Standard Grant
National Dissemination of HPC Introductory Education through Multimedia
通过多媒体在全国传播 HPC 入门教育
  • 批准号:
    0634064
  • 财政年份:
    2007
  • 资助金额:
    $ 9.88万
  • 项目类别:
    Standard Grant
WORKSHOP: The 2nd Workshop on Enabling Technologies for Peta(fl)ops Computing to be held February 15-19, 1999 in Santa Barbara, CA at the Doubletree Hotel
研讨会:第二届 Peta(fl)ops 计算支持技术研讨会将于 1999 年 2 月 15 日至 19 日在加利福尼亚州圣巴巴拉 Doubletree 酒店举行
  • 批准号:
    9816114
  • 财政年份:
    1999
  • 资助金额:
    $ 9.88万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了