Category I: Anvil - A National Composable Advanced Computational Resource for the Future of Science and Engineering

第一类:Anvil - 面向科学与工程未来的国家级可组合高级计算资源

基本信息

  • 批准号:
    2005632
  • 负责人:
  • 金额:
    $ 995.22万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Cooperative Agreement
  • 财政年份:
    2020
  • 资助国家:
    美国
  • 起止时间:
    2020-10-01 至 2026-09-30
  • 项目状态:
    未结题

项目摘要

As computing permeates nearly all fields of science and engineering, there is an exponential growth of computing needs from both the traditional computing-intensive domains and the emerging new and more diverse fields of research. The rise of machine learning and artificial intelligence applications has accelerated and broadened the use of computational resources from research in creating new and more environmentally friendly materials to improving medicine in our fight against deadly diseases. There are three main challenges to meeting this rapidly evolving landscape of national computational needs: a shortage of capacity, increasingly diverse applications, and computational literacy and training. This project aims to meet these challenges and transform the way computing is delivered by developing and deploying a composable advanced computing resource, Anvil, to the national research community to significantly increase both the computing capacity and accessibility. Anvil integrates a large-capacity high-performance computing (HPC) cluster with a comprehensive ecosystem of software, access interfaces, programming environments, and composable services to form a seamless environment able to support a broad range of current and future science and engineering applications. Through a carefully designed student training program and partnerships with regional and other universities, XSEDE, and Women in HPC programs, this project will develop computing competency in the next-generation workforce, and engage and train a broader audience including underrepresented students at minority-serving and EPSCoR (Established Program to Stimulate Competitive Research) institutions.Built with a forward-looking architecture with a high core count, and improved memory bandwidth and I/O, Anvil can effectively support traditional HPC with fast turnaround for high throughput, mid-scale computation jobs. Anvil consists of 1000 128-core computing nodes based on the next-generation AMD Epyc “Milan" architecture that can deliver a total peak performance of 5.3 Petaflops. Each node has 256 GB of memory, and a 100 gigabits/second bandwidth from the Mellanox HDR InfiniBand interconnect, allowing multiple jobs of up to 1024 cores to be run at full speed over the interconnect fabric. These nodes are complemented by 32 large-memory nodes with 1 TB of RAM each, and 16 Nvidia GPU nodes with 4 “Volta Next” GPUs per node. The GPU nodes are capable of 1.57 petaflops of single-precision performance to support machine learning and a wide range of current and future science and engineering applications. Anvil’s multiple tiers of storage systems include a long-term archive, persistent file and campaign storage, a 10 PB scratch file system, a 3 PB flash burst buffer, and object storage to support a variety of workflows and storage needs. Anvil will lower the barrier to entry to advanced computing CI by providing interactive computing and desktop environments that ease the transition for users from diverse domains new to HPC. By providing feature-rich interactive environments such as Open OnDemand and ThinLinc, users can rapidly become productive on Anvil through Linux and Windows desktops, or familiar tools through their browser (e.g., Jupyter, RStudio). Complex scientific software environments and application stacks will be supported via containers orchestrated within a powerful composable subsystem. Anvil supports cloud-bursting of computational workloads as well as use of public cloud machine learning platforms including GPU and FPGA accelerators and software tools to automate hyperparameter tuning and algorithm selection for exploratory ML research. An existing production-quality science gateway at Purdue will support XSEDE researchers to share their data and tools online and facilitate easy access to Anvil and other XSEDE resources in classroom instruction and training activities.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
随着计算渗透到几乎所有的科学和工程领域,无论是传统的计算密集型领域还是新兴的、更多样化的研究领域,计算需求都呈指数级增长。机器学习和人工智能应用的兴起加速并扩大了计算资源的使用,从研究创造新的更环保的材料到改善我们对抗致命疾病的医学。要满足这种迅速发展的国家计算需求,有三个主要挑战:能力短缺、应用日益多样化以及计算素养和培训。该项目旨在应对这些挑战,并通过开发和部署可组合的先进计算资源Anvil来改变计算的交付方式,以显著提高计算能力和可访问性。Anvil将大容量高性能计算(HPC)集群与软件、访问接口、编程环境和可组合服务的综合生态系统集成在一起,形成一个无缝的环境,能够支持广泛的当前和未来的科学和工程应用。通过精心设计的学生培训计划以及与地区和其他大学、XSEDE和HPC项目中的女性合作,该项目将培养下一代劳动力的计算能力,并吸引和培训更广泛的受众,包括少数族裔服务机构和EPSCoR(建立计划以刺激竞争性研究)机构中代表性不足的学生。Anvil采用具有前瞻性的架构,具有高核数,改进的内存带宽和I/O,可以有效地支持传统的高性能计算,具有快速周转的高吞吐量,中等规模的计算任务。Anvil由1000个基于下一代AMD Epyc“米兰”架构的128核计算节点组成,可提供5.3千万亿次的总峰值性能。每个节点具有256gb内存,以及来自Mellanox HDR InfiniBand互连的100gb /秒带宽,允许在互连结构上全速运行多达1024个内核的多个作业。这些节点有32个大内存节点,每个节点有1tb的RAM, 16个Nvidia GPU节点,每个节点有4个“Volta Next”GPU。GPU节点具有每秒1.57千万亿次的单精度性能,可支持机器学习以及广泛的当前和未来科学与工程应用。Anvil的多层存储系统包括长期存档、持久文件和活动存储、10pb临时文件系统、3pb闪存突发缓冲区和对象存储,以支持各种工作流程和存储需求。Anvil将通过提供交互式计算和桌面环境来降低进入高级计算CI的门槛,从而使不同领域的用户轻松过渡到高性能计算。通过提供功能丰富的交互环境,如Open OnDemand和ThinLinc,用户可以通过Linux和Windows桌面,或通过浏览器(如Jupyter, RStudio)熟悉的工具,在Anvil上快速提高生产力。复杂的科学软件环境和应用程序栈将通过在强大的可组合子系统中编排的容器来支持。Anvil支持云爆发计算工作负载,以及使用公共云机器学习平台,包括GPU和FPGA加速器和软件工具,以自动进行超参数调优和算法选择,以进行探索性机器学习研究。普渡大学现有的生产质量科学门户将支持XSEDE研究人员在线共享他们的数据和工具,并促进在课堂教学和培训活动中轻松访问Anvil和其他XSEDE资源。该奖项反映了美国国家科学基金会的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(4)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Understanding Factors that Influence Research Computing and Data Careers
了解影响研究计算和数据职业的因素
Defining Performance of Scientific Application Workloads on the AMD Milan Platform
定义 AMD Milan 平台上科学应用程序工作负载的性能
  • DOI:
    10.1145/3437359.3465596
  • 发表时间:
    2021
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Wu, Tsai-Wei;Lien Harrell, Stephen;Lentner, Geoffrey;Younts, Alex;Weekly, Sam;Mertes, Zoey;Maji, Amiya;Smith, Preston;Zhu, Xiao
  • 通讯作者:
    Zhu, Xiao
Anvil - System Architecture and Experiences from Deployment and Early User Operations
Anvil - 系统架构以及部署和早期用户运营的经验
Cyberinfrastructure for sustainability sciences
可持续科学的网络基础设施
  • DOI:
    10.1088/1748-9326/acd9dd
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    6.7
  • 作者:
    Song, Carol X.;Merwade, Venkatesh;Wang, Shaowen;Witt, Michael;Kumar, Vipin;Irwin, Elena;Zhao, Lan;Walton, Amy
  • 通讯作者:
    Walton, Amy
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Xiaohui Carol Song其他文献

Xiaohui Carol Song的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Xiaohui Carol Song', 18)}}的其他基金

CC* Networking Infrastructure: Integrating Big Data Instrumentation into Campus Cyberinfrastructure
CC* 网络基础设施:将大数据仪器集成到校园网络基础设施中
  • 批准号:
    1827184
  • 财政年份:
    2018
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Standard Grant
Framework: Data: HDR: Extensible Geospatial Data Framework towards FAIR (Findable, Accessible, Interoperable, Reusable) Science
框架:数据:HDR:面向 FAIR(可查找、可访问、可互操作、可重用)科学的可扩展地理空间数据框架
  • 批准号:
    1835822
  • 财政年份:
    2018
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Standard Grant
CIF21 DIBBs: Integrating Geospatial Capabilities into HUBzero
CIF21 DIBB:将地理空间功能集成到 HUBzero 中
  • 批准号:
    1261727
  • 财政年份:
    2013
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Cooperative Agreement
INTEROP: Developing Community-based DRought Information Network Protocols and Tools for Multidisciplinary Regional Scale Applications (DRInet)
INTEROP:开发基于社区的干旱信息网络协议和工具,用于多学科区域规模应用(DRInet)
  • 批准号:
    0753116
  • 财政年份:
    2008
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Continuing Grant
SCI: TeraGrid Resource Partners
SCI:TeraGrid 资源合作伙伴
  • 批准号:
    0503992
  • 财政年份:
    2005
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Cooperative Agreement
SBIR Phase I: An Adaptive Remote-Data Access System For Wireless Handheld Devices
SBIR 第一阶段:无线手持设备的自适应远程数据访问系统
  • 批准号:
    0231708
  • 财政年份:
    2003
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Standard Grant

相似海外基金

EA: Upgrade of the Laser Heating System in the High-Pressure Diamond-Anvil Cell Laboratory at Arizona State University
EA:亚利桑那州立大学高压金刚石砧室实验室激光加热系统升级
  • 批准号:
    2335071
  • 财政年份:
    2024
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Standard Grant
Structural Determination of the Martian Core through High-Pressure, High-Temperature Experiments Using a Diamond Anvil Cell
使用金刚石砧池通过高压、高温实验确定火星核心的结构
  • 批准号:
    23KJ0725
  • 财政年份:
    2023
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Grant-in-Aid for JSPS Fellows
AnVIL Clinical Environment for Innovation and Translation (ACE-IT)
AnVIL 创新与转化临床环境 (ACE-IT)
  • 批准号:
    10747551
  • 财政年份:
    2023
  • 资助金额:
    $ 995.22万
  • 项目类别:
Experimental study of Raman geobarometry using hydrothermal diamond anvil cell
热液金刚石压砧拉曼地压测量实验研究
  • 批准号:
    22H01333
  • 财政年份:
    2022
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Collaborative: EAGER: Demonstration that Thin Film Phase Transformations Can Be Monitored at High-Temperature and High-Pressure in a Diamond Anvil Cell
协作:EAGER:证明可以在金刚石砧池中的高温高压下监测薄膜相变
  • 批准号:
    2031331
  • 财政年份:
    2021
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Standard Grant
Collaborative: EAGER: Demonstration that Thin Film Phase Transformations Can Be Monitored at High-Temperature and High-Pressure in a Diamond Anvil Cell
协作:EAGER:证明可以在金刚石砧池中的高温高压下监测薄膜相变
  • 批准号:
    2031149
  • 财政年份:
    2021
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Standard Grant
Exploration of new superconductors using diamond anvil cell
利用金刚石砧池探索新型超导体
  • 批准号:
    19H02177
  • 财政年份:
    2019
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
High-pressure generation to the lowest mantle condition in multi-anvil apparatus using nano-polycrystalline diamond anvils
使用纳米多晶金刚石砧在多砧装置中产生最低地幔条件的高压
  • 批准号:
    19K21901
  • 财政年份:
    2019
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Grant-in-Aid for Challenging Research (Exploratory)
Study on local Initiative and Interaction with the Korean Peninsula around the establishment of the Ritsuryo State, based on the inner patterns of the pottery by beating technique using the anvil
基于铁砧敲击陶器内部纹样的立陵国建立前后的地方倡议及与朝鲜半岛的互动研究
  • 批准号:
    19K01106
  • 财政年份:
    2019
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Project Anvil
铁砧计划
  • 批准号:
    536269-2018
  • 财政年份:
    2019
  • 资助金额:
    $ 995.22万
  • 项目类别:
    Experience Awards (previously Industrial Undergraduate Student Research Awards)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了