Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核计算机系统的调优和调试工具
基本信息
- 批准号:RGPIN-2017-05634
- 负责人:
- 金额:$ 2.04万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2022
- 资助国家:加拿大
- 起止时间:2022-01-01 至 2023-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The top supercomputers in recent years owe much of their power to the number-crunching ability of Graphical Processing Units (GPU) used for General Purpose computation (GPGPU). However, GPGPUs are also frequently used in smaller clusters and individual workstations for demanding tasks. Even in mobile phones and embedded systems, GPGPUs are an efficient medium to accomplish specialised tasks such as image processing. Furthermore, GPGPUs are at the forefront of a wider trend towards heterogeneous system architecture (HSA), where specialised co-procesors are used for a number of tasks beyond graphics and video, such as signal processing and network packet processing.The internal architecture of these complex GPGPU devices, with over 4000 cores and arithmetic units, was not well documented until recently. The architecture evolved significantly in recent years with a much tighter integration with main processing CPU cores, offering shared virtual memory and user-level queues. The problem is that the programming tools for debugging and tuning these systems are severely lacking. The objective of the proposed research is to devise new more efficient algorithms and software architectures for the debugging, tracing and profiling tools, such that they can adequately cope with hardware architectures boasting thousands of cores and putting a strong pressure on the available memory and bandwidth. The challenges are at several levels: insuring that the tracing and profiling tools impose a minimal overhead on the system studied to avoid affecting its behaviour, efficiently exploiting all the available tracing and profiling hardware assistance, and efficiently interfacing the debugging, tracing and profiling tools to the operating system, device driver and application to be monitored.The significance and novelty of this work is in addressing a serious challenge currently faced by all users of such systems: having suitable tools to efficiently debug and tune their applications on these systems. It is widely recognised that the extremely rapid advances in heterogeneous parallel processing hardware availability has not been followed by an equivalent progress in the software development tools. As a result, a large number of users are not achieving the level of performance that would be attainable with proper tools, or are simply not taking advantage of such hardware, because of the difficulty in efficiently programming, debugging and tuning these devices. The proposed research program will thus have a significant impact, helping all users of sophisticated computing devices in better exploiting the available hardware.
近年来,顶级超级计算机的大部分功能归功于用于通用计算(GPGPU)的图形处理单元(GPU)的数字处理能力。但是,GPGPU也经常用于较小的群集和单个工作站,以进行要求。即使在手机和嵌入式系统中,GPGPU也是完成特殊任务(例如图像处理)的有效媒介。此外,GPGPU处于朝着异质系统体系结构(HSA)的更广泛趋势的最前沿,在该趋势中,专门的副过程可用于除图形和视频之外的许多任务,例如信号处理和网络数据包处理。这些复杂的GPGPU设备的内部体系结构,以及这些复杂的GPGPU设备,以及超过4000枚钢管和Arith and arith andity andIts,ways ways ways ways ways ways ways ways ways ways ways ways ways ways。近年来,该体系结构与主要处理CPU内核的整合变得更加严格,提供了共享的虚拟内存和用户级的队列。问题在于,严重缺乏用于调试和调试这些系统的编程工具。拟议的研究的目的是为调试,跟踪和分析工具设计新的更有效的算法和软件体系结构,以便它们可以充分应对拥有数千个核心的硬件体系结构,并在可用的内存和带宽上施加强大的压力。挑战在几个层面上:确保追踪和分析工具在所研究的系统上施加最小的开销,以避免影响其行为,有效利用所有可用的追踪和分析硬件帮助,并有效地接触到访问审核,触摸和对所有系统的挑战,以挑战这些工作,以挑战这些工作。拥有合适的工具,可以有效调试和调整其在这些系统上的应用。人们普遍认识到,异质并行处理硬件可用性的极快进步尚未在软件开发工具中取得同等的进步。结果,大量用户无法实现适当工具可以达到的性能水平,或者根本不利用这种硬件,因为难以有效地编程,调试和调整这些设备。因此,拟议的研究计划将产生重大影响,帮助所有复杂计算设备的用户更好地利用可用的硬件。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Dagenais, Michel其他文献
An SVM-based framework for detecting DoS attacks in virtualized clouds under changing environment
- DOI:
10.1186/s13677-018-0109-4 - 发表时间:
2018-04-13 - 期刊:
- 影响因子:4
- 作者:
Abusitta, Adel;Bellaiche, Martine;Dagenais, Michel - 通讯作者:
Dagenais, Michel
Efficient Model to Query and Visualize the System States Extracted from Trace Data
- DOI:
10.1007/978-3-642-40787-1_13 - 发表时间:
2013-01-01 - 期刊:
- 影响因子:0
- 作者:
Montplaisir, Alexandre;Ezzati-Jivan, Naser;Dagenais, Michel - 通讯作者:
Dagenais, Michel
Optimum off-line trace synchronization of computer clusters
- DOI:
10.1088/1742-6596/341/1/012029 - 发表时间:
2012-01-01 - 期刊:
- 影响因子:0
- 作者:
Jabbarifar, Masoume;Dagenais, Michel;Sendi, Alireza Shameli - 通讯作者:
Sendi, Alireza Shameli
Dagenais, Michel的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Dagenais, Michel', 18)}}的其他基金
Monitoring and Debugging of High Performance Distributed Heterogeneous Cloud Applications
高性能分布式异构云应用的监控和调试
- 批准号:
554158-2020 - 财政年份:2021
- 资助金额:
$ 2.04万 - 项目类别:
Alliance Grants
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核计算机系统的调优和调试工具
- 批准号:
RGPIN-2017-05634 - 财政年份:2021
- 资助金额:
$ 2.04万 - 项目类别:
Discovery Grants Program - Individual
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
- 批准号:
507883-2016 - 财政年份:2020
- 资助金额:
$ 2.04万 - 项目类别:
Collaborative Research and Development Grants
Monitoring and Debugging of High Performance Distributed Heterogeneous Cloud Applications
高性能分布式异构云应用的监控和调试
- 批准号:
554158-2020 - 财政年份:2020
- 资助金额:
$ 2.04万 - 项目类别:
Alliance Grants
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核计算机系统的调优和调试工具
- 批准号:
RGPIN-2017-05634 - 财政年份:2020
- 资助金额:
$ 2.04万 - 项目类别:
Discovery Grants Program - Individual
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
- 批准号:
507883-2016 - 财政年份:2019
- 资助金额:
$ 2.04万 - 项目类别:
Collaborative Research and Development Grants
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核计算机系统的调优和调试工具
- 批准号:
RGPIN-2017-05634 - 财政年份:2019
- 资助金额:
$ 2.04万 - 项目类别:
Discovery Grants Program - Individual
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核计算机系统的调优和调试工具
- 批准号:
RGPIN-2017-05634 - 财政年份:2018
- 资助金额:
$ 2.04万 - 项目类别:
Discovery Grants Program - Individual
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
- 批准号:
507883-2016 - 财政年份:2018
- 资助金额:
$ 2.04万 - 项目类别:
Collaborative Research and Development Grants
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
- 批准号:
507883-2016 - 财政年份:2017
- 资助金额:
$ 2.04万 - 项目类别:
Collaborative Research and Development Grants
相似国自然基金
柔性钙钛矿室内光伏器件中“微-宏观”应力调谐及其载流子复合动力学研究
- 批准号:62305261
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
基于气体多通腔多模非线性效应的大能量可调谐光源的研究
- 批准号:12374318
- 批准年份:2023
- 资助金额:52 万元
- 项目类别:面上项目
自供能半主动调谐质量阻尼器的振动能量收集与减震机理研究
- 批准号:52308526
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
可调谐MOF-纳米线复合网络微结构增强的双频谱联用肺癌检测研究
- 批准号:12374342
- 批准年份:2023
- 资助金额:52 万元
- 项目类别:面上项目
基于调谐波纹液柱阻尼器的大型风力发电塔架结构减振控制研究
- 批准号:52368074
- 批准年份:2023
- 资助金额:32 万元
- 项目类别:地区科学基金项目
相似海外基金
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核计算机系统的调优和调试工具
- 批准号:
RGPIN-2017-05634 - 财政年份:2021
- 资助金额:
$ 2.04万 - 项目类别:
Discovery Grants Program - Individual
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核计算机系统的调优和调试工具
- 批准号:
RGPIN-2017-05634 - 财政年份:2020
- 资助金额:
$ 2.04万 - 项目类别:
Discovery Grants Program - Individual
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核计算机系统的调优和调试工具
- 批准号:
RGPIN-2017-05634 - 财政年份:2019
- 资助金额:
$ 2.04万 - 项目类别:
Discovery Grants Program - Individual
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核计算机系统的调优和调试工具
- 批准号:
RGPIN-2017-05634 - 财政年份:2018
- 资助金额:
$ 2.04万 - 项目类别:
Discovery Grants Program - Individual
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核计算机系统的调优和调试工具
- 批准号:
RGPIN-2017-05634 - 财政年份:2017
- 资助金额:
$ 2.04万 - 项目类别:
Discovery Grants Program - Individual