SHF: Small: Addressing Challenges for the Next Decade of Massively Parallel NUMA Accelerators
SHF:小型:应对大规模并行 NUMA 加速器未来十年的挑战
基本信息
- 批准号:1910924
- 负责人:
- 金额:$ 49.54万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2019
- 资助国家:美国
- 起止时间:2019-10-01 至 2023-09-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The physical and economic principles that enabled Dennard scaling and Moore's law in the semiconductor industry have reached their breaking point. However, as the number of transistors economically fabricated on a single chip plateaus, the processor industry has pivoted to create single-package computing systems, composed of multiple sub-components known as chiplets. Chiplets, which communicate via high-bandwidth on-package networks, offer the potential for transparent performance scaling into the next decade. However, chiplets introduce challenging non-uniform memory access characteristics into single-package systems that have traditionally not been subject to these effects. This project develops techniques to overcome the challenges of non-uniform memory accesses on high-performance single- and multi-package systems without programmer intervention. Exploring programmer-transparent scaling mechanisms improves the portability and lifetime of programs, decreasing the cost and complexity of software. Through the creation of course content and undergraduate summer internships, the project fosters an understanding of how to program machines in a post-Moore world and how compute accelerators should be designed to minimize the impact on the end-programmer as system complexity increases.This project develops coordinated data placement and thread scheduling algorithms that leverage static information from the compiler and dynamic information from the runtime system to inform data placement and hardware-based thread scheduling. It advances the state-of-the-art by developing an open-source Graphic Processing Unit (GPU) simulator with a hierarchical interconnect that can be used to model both chiplet-based GPUs and multi-GPU systems. The researchers are exploring compiler informed data placement and thread scheduling in GPUs. Initial results demonstrate that a static analysis of the code can predict the data accessed by GPU threadblocks. Analysis shows that it is possible to determine which threads in a grid share memory pages, and the manner of that sharing, by building new static techniques that add an additional dimension to decades of work on compilers for sequential code. Using static information, in combination with runtime information provided by GPU drivers, the researchers are developing advanced data placement, prefetching, and thread scheduling algorithms. Both future chiplet-based designs and existing multi-GPU systems benefit from the development of these algorithms. Looking beyond the high-bandwidth memory used in GPUs today the project explores the system-level implications of heterogeneous memory in a chiplet-based system. Data placement and thread scheduling have even more importance in GPU systems of the future that make use of high bandwidth memory, traditional dynamic random-access memory, and non-volatile memory. The problem sizes in such systems are anticipated to be so large that opportunistic data placement and thread scheduling are even more critical than in conventional systems. The project uses sharing patterns based on the inter-kernel producer-consumer nature of machine learning workloads to change the program's code layout, runtime data placement, and threadblock scheduling algorithm to maximize locality in multi-node systems.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
在半导体行业实现了丹纳德缩放和摩尔定律的物理和经济原则已经达到了他们的突破点。但是,随着在单个芯片高原上经济制造的晶体管数量,处理器行业已枢转创建单包计算系统,该系统由多个子组件组成,称为chiplets。通过高带宽的包装网络通信的Chiplet为未来十年的透明性能扩展提供了潜力。但是,奇普特(Chiplets)将具有挑战性的非均匀记忆访问特性引入了传统上不受这些影响的单包装系统。该项目开发了克服在没有程序员干预的情况下,在高性能单包系统和多包装系统上克服非均匀内存访问的挑战的技术。探索程序员 - 透明缩放机制可改善程序的可移植性和寿命,从而降低软件的成本和复杂性。通过创建课程内容和本科暑期实习,该项目促进了对如何在运动后世界中编程机器进行编程的理解调度。它通过开发具有层次互连的开源图形处理单元(GPU)模拟器来推进最新技术,该模拟器可用于模拟基于chiplet的GPU和多GPU系统。研究人员正在探索编译器知情的数据放置和GPU中的线程调度。初始结果表明,对代码的静态分析可以预测GPU ThreadBlocks访问的数据。分析表明,可以通过构建新的静态技术来确定网格共享内存页面中的哪些线程以及共享的方式,这些静态技术为连续代码的编译器上的数十年工作增加了一个额外的维度。使用静态信息,结合了GPU驱动程序提供的运行时信息,研究人员正在开发高级数据放置,预取程和线程调度算法。这些算法的开发都受益于未来的基于chiplet的设计和现有的多GPU系统。除了当今GPU中使用的高带宽内存之外,该项目还探索了基于chiplet的系统中异质内存的系统级含义。数据放置和线程调度在未来的GPU系统中更为重要,这些系统利用高带宽内存,传统的动态随机访问记忆和非挥发性内存。预计此类系统中的问题大小是如此之大,以至于机会性数据放置和线程调度比传统系统更为关键。该项目使用基于机器学习工作负载的内分内生产商 - 消费者的性质来改变程序的代码布局,运行时数据放置和ThreadBlock计划算法,以最大程度地提高多节点系统中的区域性。该奖项反映了NSF的法定任务,并通过评估范围的范围,该奖项反映了支持者的支持和宽广的知识群体。
项目成果
期刊论文数量(7)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Mitigating GPU Core Partitioning Performance Effects
- DOI:10.1109/hpca56546.2023.10070957
- 发表时间:2023-02
- 期刊:
- 影响因子:0
- 作者:Aaron Barnes;Fangjia Shen;Timothy G. Rogers
- 通讯作者:Aaron Barnes;Fangjia Shen;Timothy G. Rogers
SIMR: Single Instruction Multiple Request Processing for Energy-Efficient Data Center Microservices
SIMR:节能数据中心微服务的单指令多请求处理
- DOI:10.1109/micro56248.2022.00040
- 发表时间:2022
- 期刊:
- 影响因子:0
- 作者:Khairy, Mahmoud;Alawneh, Ahmad;Barnes, Aaron;Rogers, Timothy G.
- 通讯作者:Rogers, Timothy G.
Locality-Centric Data and Threadblock Management for Massive GPUs
- DOI:10.1109/micro50266.2020.00086
- 发表时间:2020-10
- 期刊:
- 影响因子:0
- 作者:Mahmoud Khairy;Vadim Nikiforov;D. Nellans;Timothy G. Rogers
- 通讯作者:Mahmoud Khairy;Vadim Nikiforov;D. Nellans;Timothy G. Rogers
Accel-Sim: An Extensible Simulation Framework for Validated GPU Modeling
- DOI:10.1109/isca45697.2020.00047
- 发表时间:2018-10
- 期刊:
- 影响因子:0
- 作者:Mahmoud Khairy;Zhesheng Shen;Tor M. Aamodt;Timothy G. Rogers
- 通讯作者:Mahmoud Khairy;Zhesheng Shen;Tor M. Aamodt;Timothy G. Rogers
Deterministic Atomic Buffering
确定性原子缓冲
- DOI:10.1109/micro50266.2020.00083
- 发表时间:2020
- 期刊:
- 影响因子:0
- 作者:Chou, Yuan Hsi;Ng, Christopher;Cattell, Shaylin;Intan, Jeremy;Sinclair, Matthew D.;Devietti, Joseph;Rogers, Timothy G.;Aamodt, Tor M.
- 通讯作者:Aamodt, Tor M.
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Timothy Rogers其他文献
A tale of 3 testes? A rare presentation of lipoblastoma with a novel karyotype
- DOI:
10.1016/j.jpedsurg.2009.10.093 - 发表时间:
2010-01-01 - 期刊:
- 影响因子:
- 作者:
Andrew Robb;Timothy Rogers;Guy Nicholls - 通讯作者:
Guy Nicholls
Self-Reported Emotions in Simulation-Based Learning: Active Participants vs. Observers.
基于模拟的学习中的自我报告情绪:主动参与者与观察者。
- DOI:
- 发表时间:
2019 - 期刊:
- 影响因子:0
- 作者:
Timothy Rogers;Carly Andler;Bridget OʼBrien;S. van Schaik - 通讯作者:
S. van Schaik
The BEST study--a prospective study to compare business class versus economy class air travel as a cause of thrombosis.
最佳研究——一项比较商务舱和经济舱航空旅行作为血栓形成原因的前瞻性研究。
- DOI:
10.7196/samj.2256 - 发表时间:
2003 - 期刊:
- 影响因子:0
- 作者:
B. Jacobson;M. Münster;Alberto Smith;K. Burnand;Andrew Carter;A. Abdool‐Carrim;E. Marcos;P. Becker;Timothy Rogers;D. le Roux;J. Calvert;M. Nel;Robyn Brackin;M. Veller - 通讯作者:
M. Veller
Analyzing the Communication Gap Between the Instructional Design Consultant and the Faculty Member in the Design and Development Process of a Web-Based Course
分析网络课程设计和开发过程中教学设计顾问和教师之间的沟通差距
- DOI:
- 发表时间:
2010 - 期刊:
- 影响因子:0
- 作者:
Timothy Rogers - 通讯作者:
Timothy Rogers
Prospective, Multicenter Clinical Study of the Covera Vascular Covered Stent in the Treatment of Stenosis at the Graft-Vein Anastomosis of Dysfunctional Hemodialysis Access Grafts
- DOI:
10.1016/j.jvir.2022.02.008 - 发表时间:
2022-05-01 - 期刊:
- 影响因子:
- 作者:
Bart Dolmatch;Umar Waheed;Saravanan Balamuthusamy;Jeffrey Hoggard;Richard Settlage;George Lipkowitz;Naveen Atray;Mahmood Razavi;Gary Saito;Clifford Sales;Erin Moore;Timothy Rogers;Pablo Pergola;Jeffrey Packer;Jonah Licht;Angelo Makris - 通讯作者:
Angelo Makris
Timothy Rogers的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Timothy Rogers', 18)}}的其他基金
Autonomous Modelling Solutions for Operational Structural Dynamic Systems
运行结构动态系统的自主建模解决方案
- 批准号:
EP/W002140/1 - 财政年份:2022
- 资助金额:
$ 49.54万 - 项目类别:
Research Grant
CAREER: Accessible Accelerators: Leveraging Productive Software on Efficient Hardware
职业:无障碍加速器:在高效硬件上利用高效软件
- 批准号:
1943379 - 财政年份:2020
- 资助金额:
$ 49.54万 - 项目类别:
Continuing Grant
相似国自然基金
SERT-nNOS蛋白相互作用的结构基础及其小分子互作抑制剂的设计、合成及快速抗抑郁活性研究
- 批准号:82373728
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
APOE调控小胶质细胞脂代谢模式在ASD认知和社交损伤中的作用及机制研究
- 批准号:82373597
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
小胶质细胞外泌体通过miR-486抑制神经元铁死亡介导电针修复脊髓损伤的机制研究
- 批准号:82360454
- 批准年份:2023
- 资助金额:32 万元
- 项目类别:地区科学基金项目
CUL4B正反馈调控FOXO3a-FOXM1通路促进非小细胞肺癌放疗抵抗的机制研究
- 批准号:82360584
- 批准年份:2023
- 资助金额:32 万元
- 项目类别:地区科学基金项目
葡萄糖饥饿条件下AMPK-CREB-PPA1信号通路促进非小细胞肺癌细胞增殖的分子机制研究
- 批准号:82360518
- 批准年份:2023
- 资助金额:32 万元
- 项目类别:地区科学基金项目
相似海外基金
HealthyU-Latinx: A Technology-based Tool for addressing Health Literacy in Latinx Secondary Students and their Families
HealthyU-Latinx:一种基于技术的工具,用于提高拉丁裔中学生及其家庭的健康素养
- 批准号:
10699830 - 财政年份:2023
- 资助金额:
$ 49.54万 - 项目类别:
CPS: Small: Infusing Quantum Computing, Decomposition, and Learning for Addressing Cyber-Physical Systems Optimization Challenges
CPS:小型:融合量子计算、分解和学习来应对网络物理系统优化挑战
- 批准号:
2312086 - 财政年份:2023
- 资助金额:
$ 49.54万 - 项目类别:
Standard Grant
Addressing the wireless power problem: A low-power hybrid radio for neuroscience experiments
解决无线电源问题:用于神经科学实验的低功耗混合无线电
- 批准号:
10697023 - 财政年份:2023
- 资助金额:
$ 49.54万 - 项目类别:
Addressing bone marrow lesions that compromise osteochondral tissue repair
解决损害骨软骨组织修复的骨髓病变
- 批准号:
10822755 - 财政年份:2023
- 资助金额:
$ 49.54万 - 项目类别:
Beat Extreme: An Interactive, Tailored Text Messaging Program Combining Extreme Weather Alerts with Hyper-localized Resources & Actionable Insights for Addressing Climate Change
Beat Extreme:一款将极端天气警报与超本地化资源相结合的交互式定制短信程序
- 批准号:
10698887 - 财政年份:2023
- 资助金额:
$ 49.54万 - 项目类别: