Automated monitoring and debugging of large scale manycore heterogeneous systems

大规模众核异构系统的自动监控和调试

基本信息

  • 批准号:
    507883-2016
  • 负责人:
  • 金额:
    $ 18.5万
  • 依托单位:
  • 依托单位国家:
    加拿大
  • 项目类别:
    Collaborative Research and Development Grants
  • 财政年份:
    2019
  • 资助国家:
    加拿大
  • 起止时间:
    2019-01-01 至 2020-12-31
  • 项目状态:
    已结题

项目摘要

The communication and computing infrastructure has evolved through the years, getting more efficient, sophisticated, integrated and networked. Newer mobile devices (including smart robots or autonomous cars) and servers often contain 8 or more cores in their central processing unit. These systems are based on heterogeneous processors, with efficient traditional central processing units, but also with co-processing units optimised for graphics (GPGPUs with thousands of cores), networking, signal processing or even for Machine Learning. These co-processing units are highly parallel and often contain over 8 billion logic elements (transistors) each. Adding to this complexity is the increasing reliance on virtualisation, which hides the specificities of the hardware, allowing an application to run on several different processor models, but makes the performance more difficult to analyse. As a result, even a simple operation such as initiating a phone call, making a Web search, routing a packet or displaying a video frame, can involve many parallel cores on more than one processing unit, possibly on several servers. Moreover, the same operation, a few seconds later, may be served in a different way by different cores and physical servers. Therefore, understanding the performance of these operations has become extremely difficult and the tools for that purpose are severely lacking. In this project, the tracing, monitoring, profiling and debugging tools for manycore systems will be extended to efficiently extract information from all units in all layers, from the hardware to the application, and cope with the large number (several thousands) of cores. Furthermore, new methods and algorithms will be developed to automate the analysis of the extracted monitoring data. As a result, the designers and operators of distributed applications on mobile devices, cloud servers and other heterogeneous computing systems, will have the tools in hand to quickly analyse their system performance, automatically or manually find problems, and optimise operations.
多年来,通信和计算基础架构一直在发展,变得更加高效,复杂,集成和网络。较新的移动设备(包括智能机器人或自动驾驶汽车)和服务器通常在其中央处理单元中包含8个或更多核心。这些系统基于异构处理器,具有有效的传统中央处理单元,但也具有针对图形优化(具有数千个内核的GPGPU),网络,信号处理甚至机器学习的协调单元。这些协调单元高度平行,并且通常包含超过80亿个逻辑元素(晶体管)。加上这种复杂性的是对虚拟化的依赖越来越多,它隐藏了硬件的特殊性,从而使应用程序可以在几种不同的处理器模型上运行,但使得性能更加难以分析。结果,即使是一个简单的操作,例如启动电话,进行网络搜索,路由数据包或显示视频框架,也可能涉及多个处理单元上的许多并行内核,可能是在多个服务器上。此外,几秒钟后的同一操作可以由不同的核心和物理服务器以不同的方式提供。因此,了解这些操作的性能变得非常困难,并且严重缺乏该目的的工具。在此项目中,将扩展到许多核系统的跟踪,监视,分析和调试工具,以从所有层中的所有单元(从硬件到应用程序)中的所有单元中有效提取信息,并应对大量(数千个)内核。此外,将开发新的方法和算法来自动化提取的监视数据的分析。结果,在移动设备,云服务器和其他异质计算系统上分布式应用程序的设计师和运营商将拥有手中的工具来快速分析其系统性能,自动或手动发现问题并优化操作。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Dagenais, Michel其他文献

An SVM-based framework for detecting DoS attacks in virtualized clouds under changing environment
Efficient Model to Query and Visualize the System States Extracted from Trace Data
  • DOI:
    10.1007/978-3-642-40787-1_13
  • 发表时间:
    2013-01-01
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Montplaisir, Alexandre;Ezzati-Jivan, Naser;Dagenais, Michel
  • 通讯作者:
    Dagenais, Michel
Optimum off-line trace synchronization of computer clusters
  • DOI:
    10.1088/1742-6596/341/1/012029
  • 发表时间:
    2012-01-01
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Jabbarifar, Masoume;Dagenais, Michel;Sendi, Alireza Shameli
  • 通讯作者:
    Sendi, Alireza Shameli

Dagenais, Michel的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Dagenais, Michel', 18)}}的其他基金

Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2022
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Discovery Grants Program - Individual
Monitoring and Debugging of High Performance Distributed Heterogeneous Cloud Applications
高性能分布式异构云应用的监控和调试
  • 批准号:
    554158-2020
  • 财政年份:
    2021
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Alliance Grants
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2021
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Discovery Grants Program - Individual
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
  • 批准号:
    507883-2016
  • 财政年份:
    2020
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Collaborative Research and Development Grants
Monitoring and Debugging of High Performance Distributed Heterogeneous Cloud Applications
高性能分布式异构云应用的监控和调试
  • 批准号:
    554158-2020
  • 财政年份:
    2020
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Alliance Grants
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2020
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Discovery Grants Program - Individual
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2019
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Discovery Grants Program - Individual
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2018
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Discovery Grants Program - Individual
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
  • 批准号:
    507883-2016
  • 财政年份:
    2018
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Collaborative Research and Development Grants
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
  • 批准号:
    507883-2016
  • 财政年份:
    2017
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Collaborative Research and Development Grants

相似国自然基金

基于DKK3甲基化调控p21介导的免疫监视探讨芪术抗癌方诱导CD8+T细胞浸润机制
  • 批准号:
    82374531
  • 批准年份:
    2023
  • 资助金额:
    48 万元
  • 项目类别:
    面上项目
肿瘤浸润NK细胞免疫检查点分子促进亚实性结节型肺腺癌逃逸免疫监视的机制研究
  • 批准号:
    82303150
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
组织细胞外基质异常对机体肿瘤免疫监视效应的影响及作用机制
  • 批准号:
    32370839
  • 批准年份:
    2023
  • 资助金额:
    50 万元
  • 项目类别:
    面上项目
RGD-68Ga@AuNCs PET监测PRMT5通过VEGFA调节肺腺癌血管新生的功能及机制
  • 批准号:
    82372007
  • 批准年份:
    2023
  • 资助金额:
    48.00 万元
  • 项目类别:
    面上项目
光老化成纤维细胞通过IL6损害树突状细胞免疫监视致黑色素瘤免疫逃逸的分子机制及四君子汤的干预作用
  • 批准号:
    82304938
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

Monitoring and Debugging of High Performance Distributed Heterogeneous Cloud Applications
高性能分布式异构云应用的监控和调试
  • 批准号:
    554158-2020
  • 财政年份:
    2022
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Alliance Grants
Monitoring and Debugging of High Performance Distributed Heterogeneous Cloud Applications
高性能分布式异构云应用的监控和调试
  • 批准号:
    554158-2020
  • 财政年份:
    2021
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Alliance Grants
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
  • 批准号:
    507883-2016
  • 财政年份:
    2020
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Collaborative Research and Development Grants
Monitoring and Debugging of High Performance Distributed Heterogeneous Cloud Applications
高性能分布式异构云应用的监控和调试
  • 批准号:
    554158-2020
  • 财政年份:
    2020
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Alliance Grants
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
  • 批准号:
    507883-2016
  • 财政年份:
    2018
  • 资助金额:
    $ 18.5万
  • 项目类别:
    Collaborative Research and Development Grants
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了