XPS: FULL: FP: Collaborative Research: Taming parallelism: optimally exploiting high-throughput parallel architectures
XPS:完整:FP:协作研究:驯服并行性:最佳地利用高吞吐量并行架构
基本信息
- 批准号:1439126
- 负责人:
- 金额:$ 32.96万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2014
- 资助国家:美国
- 起止时间:2014-09-01 至 2019-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Title: XPS: FULL: FP: Collaborative Research: Taming parallelism: Optimally exploiting high-throughput parallel architecturesOver the past decade, computer manufacturers have focused on producing "multicore" chips, that package multiple, powerful computing cores on a single chip. Researchers have invested significant effort in developing methods for writing programs that can run efficiently on these cores. The basic idea is to allow programmers to write programs using a high-level programming model and to rely on an underlying compiler and runtime system to efficiently schedule these programs on multicore platforms. However, due to power and heat dissipation concerns, emerging "throughput-oriented" computing systems increasingly rely on far simpler computing cores to deliver parallel computing performance. These cores are much more efficient than traditional multicores, and can deliver much higher performance. Practitioners across numerous fields -- bioinformatics, data analytics, machine learning, etc. -- are deploying these systems to harness their power. Unfortunately, existing high level programming models are targeted to multicore chips, and do not produce code that can run effectively on these new systems. As a result, practitioners are forced to rewrite their applications, with painstaking low-level optimization and scheduling. This project will develop schemes to adapt applications written for multicore systems to run efficiently on throughput-oriented processors. The intellectual merits are novel program optimizations that will transform multicore-oriented programs into forms that map efficiently to throughput-oriented processors, scheduling mechanisms that ensure that these throughput-oriented processors do not waste computational resources, and scheduling policies that ensure that the mechanisms are used effectively. The project's broader significance and importance are that programmers will be able to write portable, high-performant and energy-efficient programs for both traditional multicore systems as well as throughput-oriented systems. Moreover, high-level programming models will be used to program the throughput-oriented machines, thus leading to significant reduction of programming effort for practitioners in many science and engineering disciplines. Finally, outreach efforts enhance the project by providing training and mentoring to a diverse group of students.Languages like Cilk provide support for "dynamic multithreading", which allows programmers to identify all of the parallelism in their program, while relying on sophisticated runtime systems to map that parallelism to available parallel execution hardware at runtime. However, Cilk-style execution is inappropriate for the vector-based parallelism found in SIMD units, GPUs and the Xeon Phi; vector parallelism requires finding identical computations performed on different data units. This project investigates a series of transformations that will morph Cilk-style programs into programs that expose vectorizable parallelism, allowing dynamic multithreading programs to be mapped to emerging throughput-oriented architectures. The enabling transformation involves transforming task parallel applications into data-parallel applications by identifying similar tasks being performed at different points in the computation. This project develops a series of scheduling mechanisms and provably efficient scheduling policies that ensure that parallelizing dynamic multithreading applications on throughput-oriented architectures are effective. In this manner, this project enables portable applications that run efficiently both on multicores and on vector-based architectures.
职务名称:光电子能谱:满:FP:合作研究:驯服并行:最佳地利用高吞吐量并行架构在过去的十年中,计算机制造商一直专注于生产“多核”芯片,即在单个芯片上封装多个强大的计算核心。研究人员已经投入了大量的精力来开发编写可以在这些核心上有效运行的程序的方法。其基本思想是允许程序员使用高级编程模型编写程序,并依赖底层编译器和运行时系统在多核平台上有效地调度这些程序。然而,由于功耗和散热问题,新兴的“面向吞吐量”计算系统越来越依赖简单得多的计算核心来提供并行计算性能。这些内核比传统的多核更高效,并且可以提供更高的性能。生物信息学、数据分析、机器学习等众多领域的从业者正在部署这些系统来利用它们的力量。不幸的是,现有的高级编程模型是针对多核芯片的,并且不能产生可以在这些新系统上有效运行的代码。因此,从业者被迫重写他们的应用程序,进行艰苦的底层优化和调度。该项目将开发方案,使为多核系统编写的应用程序能够在面向吞吐量的处理器上高效运行。智能的优点是新的程序优化,将多核导向的程序转换成有效地映射到面向吞吐量的处理器,调度机制,确保这些面向吞吐量的处理器不浪费计算资源,调度策略,确保机制被有效地使用的形式。该项目更广泛的意义和重要性在于,程序员将能够为传统的多核系统以及面向吞吐量的系统编写可移植的,高性能的和节能的程序。此外,高级编程模型将被用来编程的吞吐量为导向的机器,从而导致在许多科学和工程学科的从业者的编程工作显着减少。最后,拓展工作通过为不同的学生群体提供培训和指导来增强项目。像Cilk这样的语言提供了对“动态多线程”的支持,这允许程序员识别他们程序中的所有并行性,同时依赖于复杂的运行时系统来将该并行性映射到运行时可用的并行执行硬件。然而,Cilk风格的执行不适合SIMD单元、GPU和Xeon Phi中基于向量的并行性;向量并行性需要找到在不同数据单元上执行的相同计算。这个项目调查了一系列的转换,将变形Cilk风格的程序到程序,暴露可向量化的并行性,允许动态多线程程序被映射到新兴的面向吞吐量的架构。启用转换涉及通过识别在计算中的不同点处执行的类似任务来将任务并行应用转换为数据并行应用。该项目开发了一系列的调度机制和可证明有效的调度策略,以确保面向吞吐量的体系结构上的并行化动态多线程应用程序是有效的。通过这种方式,该项目使便携式应用程序能够在多核和基于向量的架构上高效运行。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Milind Kulkarni其他文献
Can paediatric surgical registrars safely perform supervised hypospadias surgery?
儿科手术注册员可以在监督下安全地进行尿道下裂手术吗?
- DOI:
- 发表时间:
2024 - 期刊:
- 影响因子:2
- 作者:
Charlotte Hughes;Hazem Mosa;Sandra Johnson;J. Parr;Ravindar Anbarasan;Milind Kulkarni;A. Mathur - 通讯作者:
A. Mathur
InContext: simple parallelism for distributed applications
InContext:分布式应用程序的简单并行性
- DOI:
10.1145/1996130.1996144 - 发表时间:
2011 - 期刊:
- 影响因子:0
- 作者:
Sunghwan Yoo;Hyojeong Lee;C. Killian;Milind Kulkarni - 通讯作者:
Milind Kulkarni
Garbage Collection for Mostly Serialized Heaps
大多数序列化堆的垃圾收集
- DOI:
- 发表时间:
2024 - 期刊:
- 影响因子:0
- 作者:
Chaitanya Koparkar;Vidush Singhal;Aditya Gupta;Mike Rainey;Michael Vollmer;Artem Pelenitsyn;Sam Tobin;Milind Kulkarni;Ryan R. Newton - 通讯作者:
Ryan R. Newton
Scheduling Transformation and Dependence Tests for Recursive Programs
递归程序的调度转换和依赖性测试
- DOI:
- 发表时间:
2018 - 期刊:
- 影响因子:0
- 作者:
Kirshanthan Sundararajah;Milind Kulkarni - 通讯作者:
Milind Kulkarni
The Centre for Market and Public Organisation One Kind of Democracy One Kind of Democracy
市场与公共组织中心 一种民主 一种民主
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
Siwan Anderson;P. Francois;Ashok Kotwal;Milind Kulkarni;Tim Murugkar;Gustavo Besley;Biju Bobonis;Jim Rao;Jim Fearon;Francesco Robinson;John Trebbi;Debraj Hoddinott;Nava Ray;Robin Ashraf;Garance Burgess;Dilip Genicot;Thomas Mookherjee;Fujiwara - 通讯作者:
Fujiwara
Milind Kulkarni的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Milind Kulkarni', 18)}}的其他基金
Collaborative Research: PPoSS: LARGE: A Full-Stack Architecture for Sparse Computation
协作研究:PPoSS:LARGE:稀疏计算的全栈架构
- 批准号:
2216978 - 财政年份:2022
- 资助金额:
$ 32.96万 - 项目类别:
Continuing Grant
Travel: Student Travel Grant for the Programming Languages Mentoring Workshop at PLDI 2022
旅费:PLDI 2022 编程语言指导研讨会的学生旅费补助
- 批准号:
2227746 - 财政年份:2022
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
SHF: Small: A Composable, Sound Optimization Framework for Loops and Recursion
SHF:小型:用于循环和递归的可组合、完善的优化框架
- 批准号:
1908504 - 财政年份:2019
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
SPX: Write Once, Run on Anything: Verified, Tuned Accelerator Kernels from High Level Specifications
SPX:一次写入,在任何设备上运行:根据高级规范进行验证、调整的加速器内核
- 批准号:
1919197 - 财政年份:2019
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
NSF Student Travel Grant for 2019 Midwest Programming Languages Summit (MWPLS)
2019 年中西部编程语言峰会 (MWPLS) 的 NSF 学生旅费补助金
- 批准号:
1942074 - 财政年份:2019
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
SPX: Collaborative Research: Eat your Wheaties: Multi-Grain Compilers for Parallel Builds at Every Scale
SPX:协作研究:吃你的小麦:用于各种规模并行构建的多粒度编译器
- 批准号:
1725672 - 财政年份:2017
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
SI2-SSI: Collaborative Research: ParaTreet: Parallel Software for Spatial Trees in Simulation and Analysis
SI2-SSI:协作研究:ParaTreet:仿真和分析中的空间树并行软件
- 批准号:
1550525 - 财政年份:2016
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
SHF: Small: Collaborative Research: Hybrid Static-Dynamic Analyses for RegionSerializability
SHF:小型:协作研究:区域可串行性的混合静态动态分析
- 批准号:
1422178 - 财政年份:2014
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
XPS: CLCCA: On the Hunt for Correctness and Performance Bugs in Large-scale Programs
XPS:CLCCA:寻找大型程序中的正确性和性能错误
- 批准号:
1337158 - 财政年份:2013
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
CAREER:Toward a locality-enhancing transformation framework for irregular programs
职业生涯:为非正规项目建立一个增强地方性的转型框架
- 批准号:
1150013 - 财政年份:2012
- 资助金额:
$ 32.96万 - 项目类别:
Continuing Grant
相似国自然基金
基于连续散射模型的多波段极化SAR散射特征充分提取研究
- 批准号:2025JJ60236
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
高质量充分就业目标下的浙江省劳动力结构性错配:事实、测算与优化
- 批准号:2025C35092
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
面向高维复杂流数据的在线充分降维方法研究
- 批准号:QN25A010001
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
关于哈密顿图的充分参数条件的研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
基于数字经济的中国高质量充分就业理论与政策研究
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:面上项目
多模态学习中完整表征和充分训练方法研究
- 批准号:Q24F020077
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
基于充分降维的分位数处理效应的估计方法和异质性检验
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:青年科学基金项目
基于模态间合作与竞争关系的多模态完整表征和充分训练方法研究
- 批准号:62306289
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
低温低熵状态下外加石墨三维充分诱导可熔融生物前驱体制备高度有序、高首次库伦效率的低成本储钠硬碳材料
- 批准号:52302293
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
非充分混合假定下的大尺度流域示踪水文模型研究
- 批准号:52309023
- 批准年份:2023
- 资助金额:20 万元
- 项目类别:青年科学基金项目
相似海外基金
XPS: Full: FP: Collaborative Research: Sphinx: Combining Data and Instruction Level Parallelism through Demand Driven Execution of Imperative Programs
XPS:完整:FP:协作研究:Sphinx:通过命令式程序的需求驱动执行将数据和指令级并行性相结合
- 批准号:
1533828 - 财政年份:2015
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
XPS: FULL: FP: Write-Efficient Parallel Algorithms for Emerging Memory Technologies
XPS:FULL:FP:用于新兴内存技术的写高效并行算法
- 批准号:
1533858 - 财政年份:2015
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
XPS: FULL: FP: Collaborative Research:Advancing autovectorization
XPS:完整:FP:协作研究:推进自动矢量化
- 批准号:
1533912 - 财政年份:2015
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
XPS: Full: FP: Collaborative Research: Sphinx: Combining Data and Instruction Level Parallelism through Demand Driven Execution of Imperative Programs
XPS:完整:FP:协作研究:Sphinx:通过命令式程序的需求驱动执行将数据和指令级并行性相结合
- 批准号:
1533846 - 财政年份:2015
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
XPS: FULL: FP: A profile-centric IDE for science-based performance engineering in the cloud
XPS:FULL:FP:以配置文件为中心的 IDE,用于云中基于科学的性能工程
- 批准号:
1533644 - 财政年份:2015
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
XPS: FULL: FP: Collaborative Research: Advancing autovectorization
XPS:完整:FP:协作研究:推进自动矢量化
- 批准号:
1533926 - 财政年份:2015
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
XPS: FULL: FP: Design and Synthesis of New Energy-efficient Self-healing Computing Electronics with Real-time Configurability
XPS:FULL:FP:具有实时可配置性的新型节能自愈计算电子设备的设计与合成
- 批准号:
1533656 - 财政年份:2015
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
XPS: FULL: FP: Collaborative Research: Synchrony-aware Primitives for Building Highly Auditable, Highly Scalable, Highly Available Distributed Systems
XPS:完整:FP:协作研究:用于构建高度可审计、高度可扩展、高度可用的分布式系统的同步感知原语
- 批准号:
1533802 - 财政年份:2015
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
XPS: FULL: FP: Tools and Algorithms for Resilient, Power-efficient ExaScale Computing Using the GNU-CAF Compiler
XPS:FULL:FP:使用 GNU-CAF 编译器实现弹性、高能效 ExaScale 计算的工具和算法
- 批准号:
1533850 - 财政年份:2015
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant
XPS: FULL: FP: Collaborative Research: Model-based, Event Driven Scalable Programming for the Mobile Cloud
XPS:完整:FP:协作研究:移动云的基于模型、事件驱动的可扩展编程
- 批准号:
1438982 - 财政年份:2014
- 资助金额:
$ 32.96万 - 项目类别:
Standard Grant