Programmable Code Optimization and Empirical Tuning For High-end Computing
高端计算的可编程代码优化和经验调整
基本信息
- 批准号:1261778
- 负责人:
- 金额:$ 11万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2012
- 资助国家:美国
- 起止时间:2012-09-01 至 2014-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The complexity of modern high-end computers has made it exceedingly difficult for scientific applications to effectively manage resources such as extreme-scale parallelism, single-chip multi-processors, and deep hierarchy of shared/distributed caches and memories. In particular, as machines and applications have both evolved to become complex and massively parallel, compilers have failed to automatically bridge the gap between complex software and diverse hardware platforms.Optimization models for parallel computing have lagged far behind those for serial applications, and conventional compilers are increasingly unable to accommodate emerging high-end architectures.This research develops a new optimization model that allows1) developers to effectively interact with advanced optimizing compilers to provide both domain-specific knowledge and high-level optimization strategies (e.g., directions to enable new or choose amongst differing parallelization strategies); 2) computational specialists to easily define arbitrary domain-specific transformations to directly control performance optimizations to their code; 3) architecture-sensitive optimizations to be easily parameterized and empirically tuned to achieve portable high performance.The optimization model is supported with an integrated environment that contains two main components: ROSE, a C/C++/Fortran2003 source-to-source optimizing compiler developed at DOE/LLNL; and POET, a transformation language together with an empirical optimization engine developed at UTSA. This framework permits different levels of automation and programmer intervention, from fully-automated tuning to semi-automated development to fully programmable control. The research targets both the optimization needs of computational kernels and the more general requirements of whole program optimizations. The framework is integrated as an external development mechanism for the widely-adopted ATLAS library and is connected with empirical tuning research under DOE SciDAC program to improve the efficiency of large-scale scientific applications.
现代高端计算机的复杂性使得科学应用程序非常难以有效地管理资源,如极端规模的并行、单芯片多处理器以及共享/分布式高速缓存和存储器的深层层次结构。特别是,随着机器和应用程序都发展到复杂和大规模并行,编译器无法自动弥合复杂软件和不同硬件平台之间的差距。并行计算的优化模型远远落后于串行应用程序的优化模型,传统编译器越来越不能适应新兴的高端体系结构。本研究开发了一个新的优化模型,允许开发人员有效地与高级优化编译器交互,以提供特定领域的知识和高级优化策略(例如,启用新的方向或在不同的并行化策略中进行选择);2)计算专家可以轻松定义任意特定领域的转换,以直接控制代码的性能优化;3)对体系结构敏感的优化,易于参数化和经验性调整,以实现可移植的高性能。该优化模型由一个集成环境支持,该环境包含两个主要组件:由DOE/LLNL开发的C/C++/Fortran 2003源代码到源代码优化编译器ROSE;以及转换语言POTE和UTSA开发的经验优化引擎。该框架允许不同级别的自动化和程序员干预,从全自动调谐到半自动开发再到完全可编程控制。该研究既针对计算内核的优化需求,也针对整个程序优化的更一般需求。该框架被集成为广泛采用的ATLAS库的外部开发机制,并与美国能源部SciDAC计划下的经验调整研究相连接,以提高大规模科学应用的效率。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Qing Yi其他文献
Automatically Optimizing Stencil Computations on Many-Core NUMA Architectures
自动优化多核 NUMA 架构上的模板计算
- DOI:
- 发表时间:
2016 - 期刊:
- 影响因子:0
- 作者:
Pei;Qing Yi;D. Quinlan;C. Liao;Yongqing Yan - 通讯作者:
Yongqing Yan
An Evaluation of Rater Agreement Indices Using Generalizability Theory
使用概括性理论评估评估者一致性指数
- DOI:
10.1007/978-3-319-77249-3_7 - 发表时间:
2017 - 期刊:
- 影响因子:0
- 作者:
Dongmei Li;Qing Yi;Benjamin Andrews - 通讯作者:
Benjamin Andrews
Time-frequency spillovers between carbon, fossil fuels, and clean energy markets: New insights from the TVP-VAR framework
碳、化石燃料和清洁能源市场之间的时频溢出效应:来自TVP - VAR框架的新见解
- DOI:
10.1016/j.energy.2025.135737 - 发表时间:
2025-05-15 - 期刊:
- 影响因子:9.400
- 作者:
Qing Yi;Yuanying Jiang - 通讯作者:
Yuanying Jiang
R-ISS Stage-Dependent Single-Cell Sequencing Analysis Uncovers Oncogenes and Potential Immunotherapeutic Targets in Multiple Myeloma
- DOI:
10.1182/blood-2023-179654 - 发表时间:
2023-11-02 - 期刊:
- 影响因子:
- 作者:
Ling Zhong;Ji Luo;Lan Luo;Qing Yi;Tao Jiang - 通讯作者:
Tao Jiang
Mechanization Services, Farm Productivity and Institutional Innovation in China
中国机械化服务、农业生产力与制度创新
- DOI:
10.1108/caer-12-2018-0244 - 发表时间:
2019-06 - 期刊:
- 影响因子:5.1
- 作者:
Qing Yi;Moyu Chen;Yu Sheng;Jikun Huang - 通讯作者:
Jikun Huang
Qing Yi的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Qing Yi', 18)}}的其他基金
SHF: Small: Whole-application Coordiated Parallelization Through The Optimization Of Abstraction Hierarchies
SHF:小型:通过抽象层次结构的优化实现全应用程序协调并行化
- 批准号:
1910488 - 财政年份:2019
- 资助金额:
$ 11万 - 项目类别:
Standard Grant
I-Corps: Optimized Compiler Applications
I-Corps:优化的编译器应用程序
- 批准号:
1723712 - 财政年份:2017
- 资助金额:
$ 11万 - 项目类别:
Standard Grant
SHF: Small: Specializing Compilers For High Performance Computing Through Coordinated Data and Algorithm Optimizations
SHF:小型:通过协调数据和算法优化实现高性能计算的专用编译器
- 批准号:
1421443 - 财政年份:2014
- 资助金额:
$ 11万 - 项目类别:
Standard Grant
CAREER: Multilayer Code Synthesis For Correctness and Performance
职业:多层代码合成以确保正确性和性能
- 批准号:
1261811 - 财政年份:2012
- 资助金额:
$ 11万 - 项目类别:
Continuing Grant
SHF: Small: Collaborative Research: Programming Interface And Runtime For Self-Tuning Scalable C/C++ Data Structures
SHF:小型:协作研究:自调整可扩展 C/C 数据结构的编程接口和运行时
- 批准号:
1261584 - 财政年份:2012
- 资助金额:
$ 11万 - 项目类别:
Standard Grant
SHF: Small: Collaborative Research: Programming Interface And Runtime For Self-Tuning Scalable C/C++ Data Structures
SHF:小型:协作研究:自调整可扩展 C/C 数据结构的编程接口和运行时
- 批准号:
1218179 - 财政年份:2012
- 资助金额:
$ 11万 - 项目类别:
Standard Grant
CAREER: Multilayer Code Synthesis For Correctness and Performance
职业:多层代码合成以确保正确性和性能
- 批准号:
0747357 - 财政年份:2008
- 资助金额:
$ 11万 - 项目类别:
Continuing Grant
Programmable Code Optimization and Empirical Tuning For High-end Computing
高端计算的可编程代码优化和经验调整
- 批准号:
0833203 - 财政年份:2008
- 资助金额:
$ 11万 - 项目类别:
Standard Grant
相似国自然基金
基于Big Code深度背景增强的Android应用代码反混淆研究
- 批准号:61972290
- 批准年份:2019
- 资助金额:60.0 万元
- 项目类别:面上项目
基于强自旋轨道耦合纳米线自旋量子比特的Surface code量子计算实验研究
- 批准号:11574379
- 批准年份:2015
- 资助金额:73.0 万元
- 项目类别:面上项目
提高网络存储可靠性- P2P文件Erasure Code机制研究
- 批准号:60303002
- 批准年份:2003
- 资助金额:24.0 万元
- 项目类别:青年科学基金项目
新一代乘积编码(Product Code)及解码方法的研究
- 批准号:60372070
- 批准年份:2003
- 资助金额:22.0 万元
- 项目类别:面上项目
相似海外基金
Automating Matrix Code Optimization for Performance and Portability
自动优化矩阵代码以提高性能和可移植性
- 批准号:
RGPIN-2019-06516 - 财政年份:2022
- 资助金额:
$ 11万 - 项目类别:
Discovery Grants Program - Individual
Automating Matrix Code Optimization for Performance and Portability
自动优化矩阵代码以提高性能和可移植性
- 批准号:
RGPIN-2019-06516 - 财政年份:2021
- 资助金额:
$ 11万 - 项目类别:
Discovery Grants Program - Individual
ASCENT: Ferroelectric-based Compute-in-Memory Dynamical Engine (Ferro-CoDE) to Solve Hard Combinatorial Optimization
ASCENT:基于铁电的内存计算动态引擎 (Ferro-CoDE) 解决硬组合优化问题
- 批准号:
2132918 - 财政年份:2021
- 资助金额:
$ 11万 - 项目类别:
Standard Grant
Automating Matrix Code Optimization for Performance and Portability
自动优化矩阵代码以提高性能和可移植性
- 批准号:
RGPIN-2019-06516 - 财政年份:2020
- 资助金额:
$ 11万 - 项目类别:
Discovery Grants Program - Individual
Efficient Optimization of Code that Compute with Multidimensional Arrays
多维数组计算代码的高效优化
- 批准号:
554268-2020 - 财政年份:2020
- 资助金额:
$ 11万 - 项目类别:
University Undergraduate Student Research Awards
Automating Matrix Code Optimization for Performance and Portability
自动优化矩阵代码以提高性能和可移植性
- 批准号:
DGECR-2019-00303 - 财政年份:2019
- 资助金额:
$ 11万 - 项目类别:
Discovery Launch Supplement
Automating Matrix Code Optimization for Performance and Portability
自动优化矩阵代码以提高性能和可移植性
- 批准号:
RGPIN-2019-06516 - 财政年份:2019
- 资助金额:
$ 11万 - 项目类别:
Discovery Grants Program - Individual
Optimization of building-ground coupled seismic motion analysis code on parallel system with mutli-core processors
多核处理器并行系统上建筑-地面耦合地震运动分析代码的优化
- 批准号:
18K11325 - 财政年份:2018
- 资助金额:
$ 11万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Design of reliable and efficient communication and computing systems: architecture, code design, and optimization
可靠高效的通信和计算系统的设计:架构、代码设计和优化
- 批准号:
436111-2013 - 财政年份:2018
- 资助金额:
$ 11万 - 项目类别:
Discovery Grants Program - Individual
Multi-Opt - Multi-criterial Code Optimization for Embedded Hard Real-Time Systems
Multi-Opt - 嵌入式硬实时系统的多标准代码优化
- 批准号:
380772147 - 财政年份:2017
- 资助金额:
$ 11万 - 项目类别:
Research Grants