CSR :Small: Exploiting Slowdowns for Speedup in Power-Scalable HPC Systems.

CSR:小:利用减速来提高功率可扩展 HPC 系统的速度。

基本信息

项目摘要

Advanced computing systems --- that support a wide variety of applications in fields such as economics, sciences, and medicine --- are increasingly being designed with energy efficiency considerations. An extant approach to energy management is to run the underlying processors and devices at varying voltage and frequency. Typically, the approaches push the devices to run as fast as possible within thermal limits using the premise that "faster is better or at least does no harm." There is growing evidence in the prevailing literature that "slower is sometimes better." For example, for benchmark applications such as IOZone, it has been observed that running the processors at a faster speed can lead to significant slowdowns in the overall execution time. At large scale, e.g., in the Amazon Web Services cloud, such performance loss can cost hundreds of thousands of dollars in CPU hours and waste precious energy often begotten from polluting fossil fuels. However, isolating the root cause of such slowdowns in today's complex systems at the scale of data centers is akin to finding a needle in a haystack. Performance is now a function of the complex interaction between application design, system resources, and the underlying hardware. Furthermore, power scaling makes the raw performance of the hardware a variable; thus, further confounding attempts to isolate slowdowns.This project builds novel technologies that identify, model and automate the minimization or elimination of slowdowns in parallel and distributed applications when power scaling is enabled. The key approach is fine-grain application and kernel instrumentation to develop in-depth analysis of the interaction between parallel and distributed applications and the software and hardware stack. The intellectual merit of this research involves three intermediate research goals: 1) Exhaustive testing and deep system and code analysis on a large class of applications and a diverse set of systems to classify and isolate the slowdown phenomenon due to power scaling; 2) Design, implementation, and validation of models of the critical paths of applications exhibiting sensitivity to slowdowns; and 3) Analysis of the resulting models and design, implementation, and validation of the automated, open-source, runtime optimization techniques to steer power scaling to minimize or eliminate slowdowns.Completion of the project will improve the performance and energy efficiency of advanced systems. Adoption of the resulting runtime tools will enable use of power scaling to save energy while simultaneously reducing time-to-solution for modern applications and systems. The resulting artifacts and technologies will contribute to U.S. competitiveness by addressing the challenge of building large-scale systems within power constraints. The educational activities will help produce diverse graduates with highly marketable skill sets. The integration of the research discoveries and software tools, which will be open source and made public, into the educational curriculum will help capture the interest of the next generation of computer scientists.
先进的计算系统--支持经济、科学和医学等领域的各种应用--越来越多地在设计时考虑到能源效率。现有的能量管理方法是以变化的电压和频率运行底层处理器和设备。通常情况下,这些方法推动设备在热限制内尽可能快地运行,前提是“越快越好,或者至少没有伤害。在流行的文献中有越来越多的证据表明,“慢有时是更好的。“例如,对于IOZone等基准应用程序,已经观察到以更快的速度运行处理器可能会导致整体执行时间显着降低。例如,在大规模下,在Amazon Web Services云中,这种性能损失可能会花费数十万美元的CPU时间,并浪费宝贵的能源,这些能源通常来自污染性化石燃料。然而,在当今数据中心规模的复杂系统中隔离这种减速的根本原因类似于大海捞针。性能现在是应用程序设计、系统资源和底层硬件之间复杂交互的函数。此外,功率缩放使硬件的原始性能成为一个变量;因此,进一步混淆尝试隔离slowdowns.This项目建立新的技术,识别,建模和自动化的最小化或消除减速时,并行和分布式应用程序的功率缩放启用。关键的方法是细粒度应用程序和内核插装,以深入分析并行和分布式应用程序与软件和硬件堆栈之间的交互。本研究的智力价值涉及三个中间研究目标:1)对大类应用程序和各种系统进行详尽的测试和深入的系统和代码分析,以分类和隔离由于功率缩放而导致的减速现象; 2)设计,实现和验证对减速敏感的应用程序的关键路径模型;以及3)分析所产生的模型,设计、实施和验证自动化、开源、运行时优化技术,以引导功率缩放,最大限度地减少或消除减速。该项目的完成将提高先进系统的性能和能源效率。采用由此产生的运行时工具将能够使用功率缩放来节省能源,同时减少现代应用程序和系统的解决方案时间。由此产生的人工制品和技术将通过解决在电力限制下构建大规模系统的挑战来促进美国的竞争力。教育活动将有助于培养具有高度市场化技能的多样化毕业生。将研究发现和软件工具(将开放源码并公开)纳入教育课程将有助于吸引下一代计算机科学家的兴趣。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Kirk Cameron其他文献

Interpolation of sparse high-dimensional data
  • DOI:
    10.1007/s11075-020-01040-2
  • 发表时间:
    2020-11-13
  • 期刊:
  • 影响因子:
    2.000
  • 作者:
    Thomas C. H. Lux;Layne T. Watson;Tyler H. Chang;Yili Hong;Kirk Cameron
  • 通讯作者:
    Kirk Cameron

Kirk Cameron的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Kirk Cameron', 18)}}的其他基金

CNS: CORE: Small: iLORE: Computer Systems Performance Integrated Lineage Repository
CNS:核心:小型:iLORE:计算机系统性能集成谱系存储库
  • 批准号:
    1939076
  • 财政年份:
    2019
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
CSR: Large: VarSys: Managing Variability in High-Performance Computing Systems
CSR:大型:VarSys:管理高性能计算系统的可变性
  • 批准号:
    1838271
  • 财政年份:
    2018
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
CSR: Large: VarSys: Managing Variability in High-Performance Computing Systems
CSR:大型:VarSys:管理高性能计算系统的可变性
  • 批准号:
    1565314
  • 财政年份:
    2016
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
SHF:Small:Collaborative Research: Application-aware Energy Modeling and Power Management for Parallel and High Performance Computing
SHF:Small:协作研究:用于并行和高性能计算的应用感知能源建模和电源管理
  • 批准号:
    1422712
  • 财政年份:
    2014
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
EAGER: Kinetic Computing Sculpture: A functional parallel cluster of Raspberry Pi computers that inspire computational thinking
EAGER:动能计算雕塑:激发计算思维的 Raspberry Pi 计算机功能并行集群
  • 批准号:
    1355955
  • 财政年份:
    2013
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
CSR: Medium: Collaborative Research: GridPac: A Resource Management System for Energy and Performance Optimization on Computational Grids
CSR:媒介:协作研究:GridPac:计算网格能源和性能优化的资源管理系统
  • 批准号:
    0905187
  • 财政年份:
    2009
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
CSR: Large: Collaborative Research: Multi-core Applications Modeling Infrastructure (MAMI)
CSR:大型:协作研究:多核应用建模基础设施 (MAMI)
  • 批准号:
    0910784
  • 财政年份:
    2009
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
SGER: Metrics And Methodologies for High Performance System Energy Benchmarking
SGER:高性能系统能源基准测试的指标和方法
  • 批准号:
    0848670
  • 财政年份:
    2008
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
CRI: MISER: A High-performance, Power-aware Cluster
CRI:MISER:高性能、功耗感知集群
  • 批准号:
    0709025
  • 财政年份:
    2007
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
CSR-AES: Thermal Conductors: Runtime software support for proactive heat management in advanced execution systems
CSR-AES:热导体:运行时软件支持高级执行系统中的主动热管理
  • 批准号:
    0720750
  • 财政年份:
    2007
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant

相似国自然基金

昼夜节律性small RNA在血斑形成时间推断中的法医学应用研究
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
tRNA-derived small RNA上调YBX1/CCL5通路参与硼替佐米诱导慢性疼痛的机制研究
  • 批准号:
    n/a
  • 批准年份:
    2022
  • 资助金额:
    10.0 万元
  • 项目类别:
    省市级项目
Small RNA调控I-F型CRISPR-Cas适应性免疫性的应答及分子机制
  • 批准号:
    32000033
  • 批准年份:
    2020
  • 资助金额:
    24.0 万元
  • 项目类别:
    青年科学基金项目
Small RNAs调控解淀粉芽胞杆菌FZB42生防功能的机制研究
  • 批准号:
    31972324
  • 批准年份:
    2019
  • 资助金额:
    58.0 万元
  • 项目类别:
    面上项目
变异链球菌small RNAs连接LuxS密度感应与生物膜形成的机制研究
  • 批准号:
    81900988
  • 批准年份:
    2019
  • 资助金额:
    21.0 万元
  • 项目类别:
    青年科学基金项目
肠道细菌关键small RNAs在克罗恩病发生发展中的功能和作用机制
  • 批准号:
    31870821
  • 批准年份:
    2018
  • 资助金额:
    56.0 万元
  • 项目类别:
    面上项目
基于small RNA 测序技术解析鸽分泌鸽乳的分子机制
  • 批准号:
    31802058
  • 批准年份:
    2018
  • 资助金额:
    26.0 万元
  • 项目类别:
    青年科学基金项目
Small RNA介导的DNA甲基化调控的水稻草矮病毒致病机制
  • 批准号:
    31772128
  • 批准年份:
    2017
  • 资助金额:
    60.0 万元
  • 项目类别:
    面上项目
基于small RNA-seq的针灸治疗桥本甲状腺炎的免疫调控机制研究
  • 批准号:
    81704176
  • 批准年份:
    2017
  • 资助金额:
    20.0 万元
  • 项目类别:
    青年科学基金项目
水稻OsSGS3与OsHEN1调控small RNAs合成及其对抗病性的调节
  • 批准号:
    91640114
  • 批准年份:
    2016
  • 资助金额:
    85.0 万元
  • 项目类别:
    重大研究计划

相似海外基金

SaTC: CORE: Small: Building Resilience into LEO Satellite Networks by Exploiting Network Layer Characteristics
SaTC:核心:小型:通过利用网络层特征构建 LEO 卫星网络的弹性
  • 批准号:
    2308761
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
SaTC: CORE: Small: Exploiting Stimulus-response Correlation for Wireless Hidden Device Localization
SaTC:核心:小:利用刺激响应相关性进行无线隐藏设备定位
  • 批准号:
    2155181
  • 财政年份:
    2022
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: Exploiting Performance Correlations for Accurate and Low-cost Performance Testing for Serverless Computing
协作研究:SHF:小型:利用性能相关性对无服务器计算进行准确且低成本的性能测试
  • 批准号:
    2155096
  • 财政年份:
    2022
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
Collaborative Research: SHF: Small: Exploiting Performance Correlations for Accurate and Low-cost Performance Testing for Serverless Computing
协作研究:SHF:小型:利用性能相关性对无服务器计算进行准确且低成本的性能测试
  • 批准号:
    2155097
  • 财政年份:
    2022
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
Exploiting new drug targets in extremely resistant M.abscessus by using small molecule Lipid II binders
使用小分子脂质 II 结合剂在极其耐药的脓肿分枝杆菌中开发新的药物靶点
  • 批准号:
    10183396
  • 财政年份:
    2021
  • 资助金额:
    $ 50万
  • 项目类别:
Exploiting new drug targets in extremely resistant M.abscessus by using small molecule Lipid II binders
使用小分子脂质 II 结合剂在极其耐药的脓肿分枝杆菌中开发新的药物靶点
  • 批准号:
    10378085
  • 财政年份:
    2021
  • 资助金额:
    $ 50万
  • 项目类别:
III: Small: Collaborative Research: Algorithms, systems, and theories for exploiting data dependencies in crowdsourcing
III:小型:协作研究:在众包中利用数据依赖性的算法、系统和理论
  • 批准号:
    2007941
  • 财政年份:
    2020
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
SHF: Small: Understanding and Exploiting Software Defined Networks (SDN) in High Performance Computing (HPC) Environments
SHF:小型:理解和利用高性能计算 (HPC) 环境中的软件定义网络 (SDN)
  • 批准号:
    2007827
  • 财政年份:
    2020
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
Exploiting genetic vulnerabilities to improve outcomes in small cell carcinoma of the ovary
利用遗传弱点改善小细胞卵巢癌的治疗结果
  • 批准号:
    420635
  • 财政年份:
    2020
  • 资助金额:
    $ 50万
  • 项目类别:
    Operating Grants
Agents Provocateur: Exploiting bacterial biofilm stimulation to identify bioactive small molecules
Agents Provocateur:利用细菌生物膜刺激来识别生物活性小分子
  • 批准号:
    RGPIN-2016-06521
  • 财政年份:
    2020
  • 资助金额:
    $ 50万
  • 项目类别:
    Discovery Grants Program - Individual
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了