CRII: SHF: A Compiler and Runtime Infrastructure for Flexible Scheduling and Scheduling-Enabled Optimizations on GPUs
CRII:SHF:用于 GPU 上灵活调度和启用调度优化的编译器和运行时基础架构
基本信息
- 批准号:1464216
- 负责人:
- 金额:$ 17.5万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2015
- 资助国家:美国
- 起止时间:2015-03-01 至 2018-02-28
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Title: CRII:SHF: A Compiler and Runtime Infrastructure for Flexible Scheduling and Scheduling-Enabled Optimizations on GPUsThe computing power of a GPU (Graphics Processing Unit) lies in its abundant memory bandwidth and massive parallelism. However, its hardware thread schedulers, despite being able to quickly distribute computation to processors, often fail to capitalize on program characteristics effectively, achieving only a fraction of the GPUs' full potential. Moreover, current GPUs do not allow programmers or compilers to control thread scheduling, forfeiting important optimization opportunities at the program level. This research aims to develop a new software-level infrastructure for flexible scheduling and scheduling-enabled optimizations on GPUs. The intellectual merits of the research are two-fold: 1) It develops compiler techniques to circumvent the restrictions from the hardware thread scheduler, which enable programmers or the runtime to flexibly schedule tasks to the GPU processors; 2) It designs runtime optimizations to leverage the flexible scheduling. The project's broader significance and importance are that it provides essential support enhancing the computing efficiency of data-intensive applications in the era of GPU computing and, due to the importance of these applications, fosters sustained advances in science, engineering, humanity, and health.The project designs a code transformation component to enable flexible scheduling. The transformation, named SM (Streaming Multiprocessor)-centric transformation, consists of two techniques. The first technique is SM-centric task selection, which breaks the mapping between tasks and thread blocks and directly associates tasks with processors. The second technique is a filling and retreating scheme, which addresses some behaviors of the hardware scheduler and flexibly controls the number of active tasks for each processor. The project also designs three types of optimizations, namely parallelism control, affinity-based scheduling, and processor partitioning, which leverage the scheduling support to optimize for parallelism, locality, and resource allocation. The project develops both static and dynamic approaches to efficiently searching for the optimal scheduling strategies adapted to address various program and input features.
职务名称:CRII:SHF:GPU上的灵活调度和支持并行优化的并行和可扩展基础架构GPU(图形处理单元)的计算能力在于其丰富的内存带宽和大规模并行性。然而,尽管其硬件线程处理器能够快速将计算分配给处理器,但通常无法有效地利用程序特性,只能实现GPU全部潜力的一小部分。此外,当前的GPU不允许程序员或编译器控制线程调度,从而丧失了在程序级的重要优化机会。这项研究旨在开发一种新的软件级基础设施,用于在GPU上进行灵活的调度和支持并行处理的优化。该研究的智力价值是两方面的:1)它开发了编译器技术,以规避来自硬件线程调度器的限制,使程序员或运行时能够灵活地将任务调度到GPU处理器; 2)它设计了运行时优化,以利用灵活的调度。该项目的更广泛意义和重要性在于,它为GPU计算时代的数据密集型应用程序的计算效率提供了必要的支持,并由于这些应用程序的重要性,促进了科学,工程,人类和健康的持续进步。该项目设计了代码转换组件,以实现灵活的调度。这种转换称为以流多处理器为中心的转换,它由两种技术组成。第一种技术是以SM为中心的任务选择,它打破了任务和线程块之间的映射,直接将任务与处理器相关联。第二种技术是填充和撤退计划,它解决了硬件调度器的一些行为,并灵活地控制每个处理器的活动任务的数量。该项目还设计了三种类型的优化,即并行度控制,基于亲和力的调度和处理器分区,利用调度支持来优化并行度,局部性和资源分配。该项目开发了静态和动态的方法,以有效地搜索适合于解决各种程序和输入功能的最佳调度策略。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Bo Wu其他文献
SPread: Exploiting fractal social community For efficient multi-coPy routing in VDTNs
SPread:利用分形社交社区在 VDTN 中实现高效的多副本路由
- DOI:
10.1109/iccnc.2017.7876149 - 发表时间:
2017 - 期刊:
- 影响因子:0
- 作者:
Bo Wu;Kang;Haiying Shen - 通讯作者:
Haiying Shen
Evaluation of effective elastic constants for polycrystalline PZT thin films by XRD patterns and pole figures
通过 XRD 图案和极图评估多晶 PZT 薄膜的有效弹性常数
- DOI:
10.1007/s11771-007-0229-3 - 发表时间:
2007 - 期刊:
- 影响因子:0
- 作者:
Xuejun Zheng;Liping Tang;Qin;Bo Wu - 通讯作者:
Bo Wu
Stretchable thermoelectric generators with enhanced output by infrared reflection for wearable application
可拉伸热电发生器,通过红外反射增强输出,适用于可穿戴应用
- DOI:
10.1016/j.cej.2022.139749 - 发表时间:
2022-10 - 期刊:
- 影响因子:15.1
- 作者:
Bo Wu;Wei Wei;Yang Guo;Weng Hou Yip;Beng Kang Tay;Chengyi Hou;Qinghong Zhang;Yaogang Li;Hongzhi Wang - 通讯作者:
Hongzhi Wang
An Investigation of Half-Metallic Ferromagnets Behavior in Hg2CuTi-Type Heusler Alloy Ti2FeAl by Using GGA
利用 GGA 研究 Hg2CuTi 型 Heusler 合金 Ti2FeAl 中的半金属铁磁体行为
- DOI:
10.4028/www.scientific.net/amr.535-537.1291 - 发表时间:
2012 - 期刊:
- 影响因子:0
- 作者:
Xiude Yang;Bo Wu;Song Zhang - 通讯作者:
Song Zhang
Amyloid b proteins inhibit Cl 2 -ATPase activity in cultured rat hippocampal neurons
淀粉样蛋白 b 抑制培养的大鼠海马神经元中的 Cl 2 -ATP 酶活性
- DOI:
- 发表时间:
2001 - 期刊:
- 影响因子:0
- 作者:
K. Yagyu;K. Kitagawa;T. Irie;Bo Wu;Xun;N. Hattori;C. Inagaki - 通讯作者:
C. Inagaki
Bo Wu的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Bo Wu', 18)}}的其他基金
CAREER: Compiler and Runtime Support for Multi-Tasking on Commodity GPUs
职业:商用 GPU 上多任务的编译器和运行时支持
- 批准号:
1750760 - 财政年份:2018
- 资助金额:
$ 17.5万 - 项目类别:
Continuing Grant
SPX: Collaborative Research: Pinpointing and Resolving Scalability Culprits Hidden in Different Components of the Whole System Stack
SPX:协作研究:查明并解决隐藏在整个系统堆栈不同组件中的可扩展性问题
- 批准号:
1823005 - 财政年份:2018
- 资助金额:
$ 17.5万 - 项目类别:
Standard Grant
CSR: Small: Collaborative Research: Exploring Portable Data Placement on Massively Parallel Platforms with Heterogeneous Memory Architectures
CSR:小型:协作研究:探索具有异构内存架构的大规模并行平台上的便携式数据放置
- 批准号:
1618912 - 财政年份:2016
- 资助金额:
$ 17.5万 - 项目类别:
Standard Grant
相似国自然基金
天然超短抗菌肽Temporin-SHf衍生多肽的构效分析与抗菌机制研究
- 批准号:
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
衔接蛋白SHF负向调控胶质母细胞瘤中EGFR/EGFRvIII再循环和稳定性的功能及机制研究
- 批准号:82302939
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
EGFR/GRβ/Shf调控环路在胶质瘤中的作用机制研究
- 批准号:81572468
- 批准年份:2015
- 资助金额:60.0 万元
- 项目类别:面上项目
相似海外基金
SHF: FET: Medium: Designing and Synthesizing a Quantum Circuit Compiler
SHF:FET:中:设计和综合量子电路编译器
- 批准号:
2212232 - 财政年份:2022
- 资助金额:
$ 17.5万 - 项目类别:
Standard Grant
SHF: Medium: Formal Methods as a First-Class Citizen of a Mainstream Compiler Framework
SHF:Medium:作为主流编译器框架的一等公民的形式方法
- 批准号:
1955688 - 财政年份:2020
- 资助金额:
$ 17.5万 - 项目类别:
Continuing Grant
SHF: Small: The Compiler-Architecture Solution to the Data Dependent, Circuit-Level Critical-Paths Variations
SHF:小型:针对数据相关、电路级关键路径变化的编译器架构解决方案
- 批准号:
1908488 - 财政年份:2019
- 资助金额:
$ 17.5万 - 项目类别:
Standard Grant
SHF: Small: Indy: Toward Safe and Fast Compiler Flags
SHF:小:Indy:迈向安全快速的编译器标志
- 批准号:
1817073 - 财政年份:2018
- 资助金额:
$ 17.5万 - 项目类别:
Standard Grant
SHF: Small: Improving Memory Performance on Fused Architectures through Compiler and Runtime Innovations
SHF:小型:通过编译器和运行时创新提高融合架构的内存性能
- 批准号:
1525609 - 财政年份:2015
- 资助金额:
$ 17.5万 - 项目类别:
Standard Grant
SHF: Small: Compiler and Architectural Techniques for Soft Error Resilience
SHF:小型:软错误恢复能力的编译器和架构技术
- 批准号:
1527463 - 财政年份:2015
- 资助金额:
$ 17.5万 - 项目类别:
Standard Grant
SHF: Small: Compiler Validation via Equivalence Modulo Inputs
SHF:小型:通过等效模输入进行编译器验证
- 批准号:
1528133 - 财政年份:2015
- 资助金额:
$ 17.5万 - 项目类别:
Standard Grant
SHF: Small: Collaborative Research: Compiler Coaching
SHF:小型:协作研究:编译器指导
- 批准号:
1421412 - 财政年份:2014
- 资助金额:
$ 17.5万 - 项目类别:
Standard Grant
SHF: SMALL: COLLABORATIVE RESEARCH: Compiler Coaching
SHF:小型:协作研究:编译器指导
- 批准号:
1421652 - 财政年份:2014
- 资助金额:
$ 17.5万 - 项目类别:
Standard Grant
SHF: Small: Optimizing Compiler and Runtime for Concurrency-Oriented Execution Model
SHF:小型:优化面向并发的执行模型的编译器和运行时
- 批准号:
1421505 - 财政年份:2014
- 资助金额:
$ 17.5万 - 项目类别:
Standard Grant