Probing and Monitoring live distributed many-core systems

探测和监控实时分布式多核系统

基本信息

  • 批准号:
    36677-2012
  • 负责人:
  • 金额:
    $ 1.82万
  • 依托单位:
  • 依托单位国家:
    加拿大
  • 项目类别:
    Discovery Grants Program - Individual
  • 财政年份:
    2015
  • 资助国家:
    加拿大
  • 起止时间:
    2015-01-01 至 2016-12-31
  • 项目状态:
    已结题

项目摘要

The long term objective of this research is to support the design, tuning, operation and maintenance of sophisticated online many-core distributed computer systems, by providing algorithms and techniques to build advanced probing and monitoring tools. These online distributed systems are used pervasively in the Web, IT and cellular phone infrastructure but prove extremely difficult and costly to debug and tune for design and operation. Thus, tools are needed to precisely monitor, measure and understand the behavior and performance of distributed systems running at full speed under realistic production loads. Indeed, many errors may not be detectable when running at slower speeds and smaller loads because this changes the system timing. To achieve this, the first short term objective is to extract low overhead but detailed execution information from all layers in the execution stack (system level, user-level and virtual machines level) and using all the available sources of information (e.g. software and hardware traces, and performance counters). A second objective is to develop and validate these extraction algorithms to insure that they scale well and maintain their low overhead even on the newer architectures with 50 or more shared memory processors (e.g. Intel MIC chips). Finally, a third objective is to develop a number of specialized trace analysis modules. This will insure that the proposed algorithms and tools will collect all the useful information available, and will do it most efficiently, even on many-core systems, such that the timing data remains valid. The specialized trace analysis modules will allow users to quickly understand the system behavior. The ability to efficiently and accurately monitor distributed many-core systems will enable system designers and operators in the Canadian high technology industry to more rapidly debug and tune distributed applications, thus operating existing systems more efficiently or designing better new systems faster.
这项研究的长期目标是通过提供构建高级探测和监控工具的算法和技术,支持复杂的在线多核分布式计算机系统的设计、调试、操作和维护。这些在线分布式系统在Web、IT和移动电话基础设施中广泛使用,但事实证明,调试和调优设计和操作极其困难和昂贵。因此,需要工具来精确地监视、测量和了解在现实生产负载下全速运行的分布式系统的行为和性能。事实上,当以较慢的速度和较小的负载运行时,可能无法检测到许多错误,因为这会改变系统的时序。 为了实现这一点,第一个短期目标是从执行堆栈中的所有层(系统级、用户级和虚拟机级)提取低开销但详细的执行信息,并使用所有可用的信息来源(例如,软件和硬件跟踪以及性能计数器)。第二个目标是开发和验证这些提取算法,以确保即使在具有50个或更多共享内存处理器(例如Intel MIC芯片)的较新架构上,也能很好地扩展并保持较低的开销。最后,第三个目标是开发一些专门的痕迹分析模块。这将确保拟议的算法和工具将收集所有可用的有用信息,并将以最有效的方式执行,即使在多核系统上也是如此,从而使计时数据保持有效。专门的跟踪分析模块将使用户能够快速了解系统行为。 高效、准确地监控分布式多核系统的能力将使加拿大高科技行业的系统设计师和操作员能够更快地调试和调整分布式应用程序,从而更高效地运行现有系统或更快地设计更好的新系统。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Dagenais, Michel其他文献

An SVM-based framework for detecting DoS attacks in virtualized clouds under changing environment

Dagenais, Michel的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Dagenais, Michel', 18)}}的其他基金

Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2022
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual
Monitoring and Debugging of High Performance Distributed Heterogeneous Cloud Applications
高性能分布式异构云应用的监控和调试
  • 批准号:
    554158-2020
  • 财政年份:
    2021
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Alliance Grants
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2021
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
  • 批准号:
    507883-2016
  • 财政年份:
    2020
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Collaborative Research and Development Grants
Monitoring and Debugging of High Performance Distributed Heterogeneous Cloud Applications
高性能分布式异构云应用的监控和调试
  • 批准号:
    554158-2020
  • 财政年份:
    2020
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Alliance Grants
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2020
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
  • 批准号:
    507883-2016
  • 财政年份:
    2019
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Collaborative Research and Development Grants
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2019
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual
Reinventing the tuning and debugging tools for multi-thousand cores computer systems
重新发明数千核​​计算机系统的调优和调试工具
  • 批准号:
    RGPIN-2017-05634
  • 财政年份:
    2018
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Discovery Grants Program - Individual
Automated monitoring and debugging of large scale manycore heterogeneous systems
大规模众核异构系统的自动监控和调试
  • 批准号:
    507883-2016
  • 财政年份:
    2018
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Collaborative Research and Development Grants

相似海外基金

An innovative cyber compliance platform using AI, live monitoring data and machine learning to automate compliance and due diligence completion.
一个创新的网络合规平台,使用人工智能、实时监控数据和机器学习来自动完成合规和尽职调查。
  • 批准号:
    10100493
  • 财政年份:
    2024
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Collaborative R&D
Novel live biomass sensor for improved microbial monitoring, wastewater reactor management and processing, allowing significant energy, carbon emissions, and chemical reduction.
新型活生物质传感器可改善微生物监测、废水反应器管理和处理,从而显着减少能源、碳排放和化学品排放。
  • 批准号:
    10074021
  • 财政年份:
    2023
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Grant for R&D
Continuous Photoacoustic Monitoring of Neonatal Stroke in Intensive Care Unit
重症监护病房新生儿中风的连续光声监测
  • 批准号:
    10548689
  • 财政年份:
    2022
  • 资助金额:
    $ 1.82万
  • 项目类别:
Live monitoring of foreign-body response in animals by diffuse Raman spectroscopy
通过漫射拉曼光谱实时监测动物异物反应
  • 批准号:
    NC/W001179/1
  • 财政年份:
    2021
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Research Grant
Noninvasive, wireless thermal sensors for the quantitative monitoring of ventricular shunt function in patients with hydrocephalus
用于定量监测脑积水患者心室分流功能的无创无线热传感器
  • 批准号:
    10684838
  • 财政年份:
    2021
  • 资助金额:
    $ 1.82万
  • 项目类别:
Noninvasive, wireless thermal sensors for the quantitative monitoring of ventricular shunt function in patients with hydrocephalus
用于定量监测脑积水患者心室分流功能的无创无线热传感器
  • 批准号:
    10619401
  • 财政年份:
    2021
  • 资助金额:
    $ 1.82万
  • 项目类别:
An Interoperable HL7 FHIR-based Medical Device Data System (MDDS) For Accessing And Integrating Live Point-Of-Care Data From High-Acuity Bedside Patient Monitoring Equipment
基于 HL7 FHIR 的可互操作医疗设备数据系统 (MDDS),用于访问和集成来自高敏锐度床边患者监护设备的实时护理点数据
  • 批准号:
    10353084
  • 财政年份:
    2021
  • 资助金额:
    $ 1.82万
  • 项目类别:
Noninvasive, wireless thermal sensors for the quantitative monitoring of ventricular shunt function in patients with hydrocephalus
用于定量监测脑积水患者心室分流功能的无创无线热传感器
  • 批准号:
    10200529
  • 财政年份:
    2021
  • 资助金额:
    $ 1.82万
  • 项目类别:
New Photo-Acoustic Imaging Process in Fetal Monitoring to Dramatically Reduce Brain Injuries in Newborns
胎儿监测中的新光声成像流程可显着减少新生儿脑损伤
  • 批准号:
    10010328
  • 财政年份:
    2020
  • 资助金额:
    $ 1.82万
  • 项目类别:
Live Corrosion Monitoring IoT Development
实时腐蚀监测物联网开发
  • 批准号:
    81883
  • 财政年份:
    2020
  • 资助金额:
    $ 1.82万
  • 项目类别:
    Collaborative R&D
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了