CC* Compute: A Cost-Effective, 2,048 Core InfiniBand Cluster at UTC for Campus Research and Education
CC* 计算:UTC 的具有成本效益的 2,048 核心 InfiniBand 集群,用于校园研究和教育
基本信息
- 批准号:1925603
- 负责人:
- 金额:$ 39.22万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2019
- 资助国家:美国
- 起止时间:2019-07-01 至 2022-06-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
A team of researchers at the University of Tennessee at Chattanooga (UTC) will make a significant upgrade to the campus cyberinfrastructure that will provide state-of-the-art, cost-effective high-performance computing not previously possible. This project will significantly improve university researchers' and students' ability to perform, enhance, and expand their current computationally-intensive research, prototyping, and development activities and will complement other investments already made, in-progress, or on the plan-of-record of UTC, including access to commercial cloud computing services. In addition to computer science and engineering, the UTC team anticipates significant research projects in mathematics, hydrology and computational fluid dynamics which will engage four regional partner universities. Two teaching projects address HPC education and use of HPC for mechanical engineering undergraduate research/design. In addition to these funded projects, merited additional research projects are enabled over time as the PIs, Central IT, and the cluster's Advisory Board attract and onboard additional researchers and students requiring HPC. Among other users are the more than 20 computational science Ph.D. students, plus several postdocs. Furthermore, SimCenter---UTC's research computing hub---supports undergraduate research through self-funding and REU in HPC, providing additional users for the proposed cluster. This award allows the University of Tennessee at Chattanooga (UTC) to procure an innovative, 2,048-compute core, 16-server AMD EPYC2 cluster networked with 100Gbit/s InfiniBand plus 8TB of main memory and 77 Tflop/s of double-precision floating point arithmetic. EPYC2 Rome 7nm processors will be newly available at or near the start of the period of performance, so this project includes state-of-the-art, cost-effective, high-performance computing not previously possible using Intel or AMD processors. The university has invested in a "commodity" cluster as recently as three years ago, and it is heavily utilized by the existing user base. This system will be nearly four years old by the beginning of this proposed grant. By way of complement, upgrades to storage (1.1PB), internal networking, data center infrastructure, and private cloud virtualization (coming online by mid-2019) have prepared UTC to support a new campus-wide cluster with a growing number of users in addition to those named here. The proposed new campus cluster will enable core scales and total cluster memory not previously available on campus and thus help researchers prepare their scalable problem scenarios for greater scales on national resources such as XSEDE. Projects enabled immediately are 14 science driver projects (12 research, two teaching). Seven projects involve four regional partner universities. At least ten NSF grants at UTK, UTC, UAB, Tennessee Tech, and Ole Miss are enhanced. Project areas highlighted include fault-tolerant parallel computing, performance monitoring of HPC, next-generation parallel programming with MPI, special-purpose linear algebra, hydrology, and computational fluid dynamics research.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
田纳西大学查塔努加大学(UTC)的一组研究人员将对Cyberinfrasture进行重大升级,该校园将提供以前无法提供最先进的,具有成本效益的高性能计算的。 该项目将显着提高大学研究人员和学生的执行,增强和扩展其当前计算密集型研究,原型制作和开发活动的能力,并将补充已经进行的其他投资,正在进行中或UTC录制的UTC计划,包括访问商业云计算服务。除了计算机科学和工程外,UTC团队还预计,数学,水文学和计算流体动力学的重要研究项目将与四个区域合作伙伴大学相关。 两个教学项目涉及HPC教育和HPC用于机械工程本科研究/设计的使用。除了这些资助的项目外,随着PIS,Central IT和集群的顾问委员会吸引和登上需要HPC的学生,随着PIS,Central IT和集群的顾问委员会的吸引并在船上需要HPC,随着时间的推移,还可以进行其他研究项目。在其他用户中,有20多个计算科学博士学位。学生,加上几个博士后。此外,Simcenter --- UTC的研究计算中心---通过HPC的自筹资金和REU支持本科研究,为拟议的集群提供了其他用户。该奖项允许田纳西大学查塔努加大学(UTC)购买创新的,2,048计算机的核心,16服务器AMD AMD EPYC2集群,该集群与100Gbit/s Infiniband和8TB的主存储器和77 Tflop/S tflop/s的双确定浮点浮点数。 EPYC2 ROME 7NM处理器将在绩效期开始或接近新近可用,因此该项目包括先前使用Intel或AMD处理器以前无法使用的最先进的,具有成本效益的高性能计算。该大学在三年前就已经投资了一个“商品”集群,并被现有用户群大量利用。该系统将在这项提议的赠款开始之初将近四年。通过补充,升级到存储(1.1pb),内部网络,数据中心基础架构和私人云虚拟化(2019年中期将在线上线)已准备好支持新的校园范围内的新群集,除了此处命名的用户外,还有越来越多的用户。拟议的新校园集群将使核心量表和校园中无法获得的总集群记忆,因此可以帮助研究人员准备其可扩展的问题方案,以了解更大的国家资源(例如XSEDE)。启用的项目立即是14个科学驱动程序项目(12个研究,两项教学)。七个项目涉及四所区域合作伙伴大学。 在UTK,UTC,UAB,田纳西理工学院和Ole Miss至少有十项NSF补助金得到了增强。 Project areas highlighted include fault-tolerant parallel computing, performance monitoring of HPC, next-generation parallel programming with MPI, special-purpose linear algebra, hydrology, and computational fluid dynamics research.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
项目成果
期刊论文数量(2)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Implementation and evaluation of MPI 4.0 partitioned communication libraries
MPI 4.0分区通信库的实现和评估
- DOI:10.1016/j.parco.2021.102827
- 发表时间:2021
- 期刊:
- 影响因子:1.4
- 作者:Dosanjh, Matthew G.F.;Worley, Andrew;Schafer, Derek;Soundararajan, Prema;Ghafoor, Sheikh;Skjellum, Anthony;Bangalore, Purushotham V.;Grant, Ryan E.
- 通讯作者:Grant, Ryan E.
Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives
分区点对点通信原语的便携式实现的设计
- DOI:10.1145/3458744.3474046
- 发表时间:2021
- 期刊:
- 影响因子:0
- 作者:Worley, Andrew;Prema Soundararajan, Prema;Schafer, Derek;Bangalore, Purushotham;Grant, Ryan;Dosanjh, Matthew;Skjellum, Anthony;Ghafoor, Sheikh
- 通讯作者:Ghafoor, Sheikh
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Anthony Skjellum其他文献
Understanding GPU Triggering APIs for MPI+X Communication
了解用于 MPI X 通信的 GPU 触发 API
- DOI:
- 发表时间:
2024 - 期刊:
- 影响因子:0
- 作者:
Patrick G. Bridges;Anthony Skjellum;E. Suggs;Derek Schafer;P. Bangalore - 通讯作者:
P. Bangalore
Anthony Skjellum的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Anthony Skjellum', 18)}}的其他基金
SPX: Collaborative Research: Intelligent Communication Fabrics to Facilitate Extreme Scale Computing
SPX:协作研究:促进超大规模计算的智能通信结构
- 批准号:
2412182 - 财政年份:2023
- 资助金额:
$ 39.22万 - 项目类别:
Standard Grant
Collaborative Research: EAGER: Real-time Strategies and Synchronized Time Distribution Mechanisms for Enhanced Exascale Performance-Portability and Predictability
合作研究:EAGER:实时策略和同步时间分配机制,以增强百亿亿次性能-可移植性和可预测性
- 批准号:
2405142 - 财政年份:2023
- 资助金额:
$ 39.22万 - 项目类别:
Standard Grant
Beginnings: Creating and Sustaining a Diverse Community of Expertise in Quantum Information Science (EQUIS) Across the Southeastern United States
起点:在美国东南部创建并维持一个多元化的量子信息科学 (EQUIS) 专业社区
- 批准号:
2414461 - 财政年份:2023
- 资助金额:
$ 39.22万 - 项目类别:
Cooperative Agreement
Collaborative Research: EAGER: Real-time Strategies and Synchronized Time Distribution Mechanisms for Enhanced Exascale Performance-Portability and Predictability
合作研究:EAGER:实时策略和同步时间分配机制,以增强百亿亿次性能-可移植性和可预测性
- 批准号:
2151020 - 财政年份:2022
- 资助金额:
$ 39.22万 - 项目类别:
Standard Grant
CC* Networking Infrastructure: Advancing High-speed Networking at UTC for Research and Education
CC* 网络基础设施:推进 UTC 的研究和教育高速网络
- 批准号:
1925598 - 财政年份:2019
- 资助金额:
$ 39.22万 - 项目类别:
Standard Grant
SPX: Collaborative Research: Intelligent Communication Fabrics to Facilitate Extreme Scale Computing
SPX:协作研究:促进超大规模计算的智能通信结构
- 批准号:
1918987 - 财政年份:2019
- 资助金额:
$ 39.22万 - 项目类别:
Standard Grant
Collaborative Research: Software Engineering Workforce Development in High Performance Computing for Digital Twins
协作研究:数字孪生高性能计算中的软件工程劳动力开发
- 批准号:
1935628 - 财政年份:2019
- 资助金额:
$ 39.22万 - 项目类别:
Standard Grant
Collaborative Research: CICI: Regional: SouthEast SciEntific Cybersecurity for University Research (SouthEast SECURE)
合作研究:CICI:区域:东南大学研究科学网络安全 (SouthEast SECURE)
- 批准号:
1812404 - 财政年份:2017
- 资助金额:
$ 39.22万 - 项目类别:
Standard Grant
SHF: Medium: Collaborative Research: Next-Generation Message Passing for Parallel Programming: Resiliency, Time-to-Solution, Performance-Portability, Scalability, and QoS
SHF:中:协作研究:并行编程的下一代消息传递:弹性、解决时间、性能可移植性、可扩展性和 QoS
- 批准号:
1822191 - 财政年份:2017
- 资助金额:
$ 39.22万 - 项目类别:
Continuing Grant
SHF: Small: Collaborative Research: Coupling Computation and Communication in FPGA-Enhanced Clouds and Clusters
SHF:小型:协作研究:FPGA 增强型云和集群中的耦合计算和通信
- 批准号:
1821431 - 财政年份:2017
- 资助金额:
$ 39.22万 - 项目类别:
Standard Grant
相似国自然基金
多不确定性条件下中国碳排放的社会成本计算和政策应用:风险学习视角
- 批准号:72303217
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
资源受限下道路网络临近检测技术研究
- 批准号:61901052
- 批准年份:2019
- 资助金额:23.5 万元
- 项目类别:青年科学基金项目
移动边缘计算卸载服务中资源管理关键问题研究
- 批准号:61702287
- 批准年份:2017
- 资助金额:24.0 万元
- 项目类别:青年科学基金项目
基于多尺度模拟结构和力学性能设计低成本金属硬质化合物
- 批准号:51701152
- 批准年份:2017
- 资助金额:21.0 万元
- 项目类别:青年科学基金项目
低成本高强塑镁合金板材的合金设计与异步轧制变形的基础研究
- 批准号:U1610253
- 批准年份:2016
- 资助金额:288.0 万元
- 项目类别:联合基金项目
相似海外基金
A novel algorithm to compute adherence from electronic adherence monitoring devices
一种计算电子依从性监测设备依从性的新算法
- 批准号:
10698066 - 财政年份:2022
- 资助金额:
$ 39.22万 - 项目类别:
A novel algorithm to compute adherence from electronic adherence monitoring devices
一种计算电子依从性监测设备依从性的新算法
- 批准号:
10516828 - 财政年份:2022
- 资助金额:
$ 39.22万 - 项目类别:
An adaptive compute solution for characterizing macromolecular complexes by mass spectrometry with electron-based fragmentation
一种自适应计算解决方案,用于通过基于电子的碎片质谱分析来表征大分子复合物
- 批准号:
10581698 - 财政年份:2020
- 资助金额:
$ 39.22万 - 项目类别:
An adaptive compute solution for characterizing macromolecular complexes by mass spectrometry with electron-based fragmentation
一种自适应计算解决方案,用于通过基于电子的碎片质谱分析来表征大分子复合物
- 批准号:
10480227 - 财政年份:2020
- 资助金额:
$ 39.22万 - 项目类别: