Reinventing the tuning and debugging tools for multi-thousand cores computer systems

重新发明数千核​​计算机系统的调优和调试工具

基本信息

  • 批准号:
    RGPIN-2017-05634
  • 负责人:
  • 金额:
    $ 2.04万
  • 依托单位:
  • 依托单位国家:
    加拿大
  • 项目类别:
    Discovery Grants Program - Individual
  • 财政年份:
    2018
  • 资助国家:
    加拿大
  • 起止时间:
    2018-01-01 至 2019-12-31
  • 项目状态:
    已结题

项目摘要

The top supercomputers in recent years owe much of their power to the number-crunching ability of Graphical Processing Units (GPU) used for General Purpose computation (GPGPU). However, GPGPUs are also frequently used in smaller clusters and individual workstations for demanding tasks. Even in mobile phones and embedded systems, GPGPUs are an efficient medium to accomplish specialised tasks such as image processing. Furthermore, GPGPUs are at the forefront of a wider trend towards heterogeneous system architecture (HSA), where specialised co-procesors are used for a number of tasks beyond graphics and video, such as signal processing and network packet processing.******The internal architecture of these complex GPGPU devices, with over 4000 cores and arithmetic units, was not well documented until recently. The architecture evolved significantly in recent years with a much tighter integration with main processing CPU cores, offering shared virtual memory and user-level queues. The problem is that the programming tools for debugging and tuning these systems are severely lacking. The objective of the proposed research is to devise new more efficient algorithms and software architectures for the debugging, tracing and profiling tools, such that they can adequately cope with hardware architectures boasting thousands of cores and putting a strong pressure on the available memory and bandwidth. The challenges are at several levels: insuring that the tracing and profiling tools impose a minimal overhead on the system studied to avoid affecting its behaviour, efficiently exploiting all the available tracing and profiling hardware assistance, and efficiently interfacing the debugging, tracing and profiling tools to the operating system, device driver and application to be monitored.******The significance and novelty of this work is in addressing a serious challenge currently faced by all users of such systems: having suitable tools to efficiently debug and tune their applications on these systems. It is widely recognised that the extremely rapid advances in heterogeneous parallel processing hardware availability has not been followed by an equivalent progress in the software development tools. As a result, a large number of users are not achieving the level of performance that would be attainable with proper tools, or are simply not taking advantage of such hardware, because of the difficulty in efficiently programming, debugging and tuning these devices. The proposed research program will thus have a significant impact, helping all users of sophisticated computing devices in better exploiting the available hardware.
近年来,顶级超级计算机的能力很大程度上归功于用于通用计算(GPGPU)的图形处理单元(GPU)的数字处理能力。但是,gpgpu也经常用于较小的集群和单个工作站,以执行要求很高的任务。即使在移动电话和嵌入式系统中,gpgpu也是完成特殊任务(如图像处理)的有效媒介。此外,gpgpu处于异构系统架构(HSA)更广泛趋势的前沿,其中专用协处理器用于图形和视频之外的许多任务,例如信号处理和网络数据包处理。******这些复杂的GPGPU设备的内部架构,超过4000个内核和算术单元,直到最近才被很好地记录下来。近年来,该体系结构有了显著的发展,与主处理CPU内核的集成更加紧密,提供了共享虚拟内存和用户级队列。问题是严重缺乏用于调试和调优这些系统的编程工具。提出的研究目标是为调试、跟踪和分析工具设计新的更有效的算法和软件架构,使它们能够充分应对拥有数千个内核的硬件架构,并对可用内存和带宽施加强大的压力。挑战在几个层面上:确保跟踪和分析工具对所研究的系统施加最小的开销,以避免影响其行为,有效地利用所有可用的跟踪和分析硬件辅助,并有效地将调试、跟踪和分析工具连接到要监控的操作系统、设备驱动程序和应用程序。******这项工作的意义和新颖之处在于解决了此类系统的所有用户当前面临的严峻挑战:拥有合适的工具来有效地调试和调优这些系统上的应用程序。人们普遍认识到,异构并行处理硬件可用性的飞速发展并没有伴随着软件开发工具的同等进步。因此,由于难以有效地对这些设备进行编程、调试和调优,大量用户没有达到使用适当工具可以达到的性能水平,或者根本没有利用这些硬件。因此,拟议的研究计划将产生重大影响,帮助所有复杂计算设备的用户更好地利用可用的硬件。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Dagenais, Michel其他文献

An SVM-based framework for detecting DoS attacks in virtualized clouds under changing environment

Dagenais, Michel的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Dagenais, Michel', 18)}}的其他基金

Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2022
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Discovery Grants Program - Individual
Monitoring and Debugging of High Performance Distributed Heterogeneous Cloud Applications
高性能分布式异构云应用的监控和调试
  • 批准号:
    554158-2020
  • 财政年份:
    2021
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Alliance Grants
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2021
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Discovery Grants Program - Individual
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
  • 批准号:
    507883-2016
  • 财政年份:
    2020
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Collaborative Research and Development Grants
Monitoring and Debugging of High Performance Distributed Heterogeneous Cloud Applications
高性能分布式异构云应用的监控和调试
  • 批准号:
    554158-2020
  • 财政年份:
    2020
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Alliance Grants
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2020
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Discovery Grants Program - Individual
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
  • 批准号:
    507883-2016
  • 财政年份:
    2019
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Collaborative Research and Development Grants
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2019
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Discovery Grants Program - Individual
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
  • 批准号:
    507883-2016
  • 财政年份:
    2018
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Collaborative Research and Development Grants
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
  • 批准号:
    507883-2016
  • 财政年份:
    2017
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Collaborative Research and Development Grants

相似海外基金

Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2022
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Discovery Grants Program - Individual
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2021
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Discovery Grants Program - Individual
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2020
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Discovery Grants Program - Individual
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2019
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Discovery Grants Program - Individual
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2017
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Discovery Grants Program - Individual
Integrated tracing, profiling and debugging for tuning large heterogeneous clusters
用于调整大型异构集群的集成跟踪、分析和调试
  • 批准号:
    424666-2011
  • 财政年份:
    2014
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Collaborative Research and Development Grants
Integrated tracing, profiling and debugging for tuning large heterogeneous clusters
用于调整大型异构集群的集成跟踪、分析和调试
  • 批准号:
    424666-2011
  • 财政年份:
    2013
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Collaborative Research and Development Grants
Integrated tracing, profiling and debugging for tuning large heterogeneous clusters
用于调整大型异构集群的集成跟踪、分析和调试
  • 批准号:
    424666-2011
  • 财政年份:
    2012
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Collaborative Research and Development Grants
CAREER: Automated Analysis, Debugging, and Fine-Tuning of Timing Properties in Embedded Real-Time Systems
职业:嵌入式实时系统中时序属性的自动分析、调试和微调
  • 批准号:
    9875648
  • 财政年份:
    1999
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Continuing Grant
Study on Granularity Tuning Mechanism in Fine-Grain Parallel Processing
细粒度并行处理中的粒度调节机制研究
  • 批准号:
    07458055
  • 财政年份:
    1995
  • 资助金额:
    $ 2.04万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了