High-performance and scalable communication subsystems for exascale computing
用于百亿亿次计算的高性能和可扩展通信子系统
基本信息
- 批准号:238964-2011
- 负责人:
- 金额:$ 1.6万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2013
- 资助国家:加拿大
- 起止时间:2013-01-01 至 2014-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
High-Performance Computing (HPC) is the key to many scientific discoveries and engineering innovations. It is used to tackle computationally-intensive problems in fields as diverse as drug discovery, modeling of global climate system, seismic processing for oil and gas, green energy, genomics and bioinformatics, and astrophysics. Scientific/engineering simulations are mainly written with the Message-Passing Interface (MPI) library. Parallel processes in these simulations compute on their local data while extensively communicating with each other through the MPI library. Communication adversely affects the performance and scalability of MPI applications running on HPC clusters. With the availability of multi-core and soon many-core architectures offering increasing parallelism at all levels, HPC clusters consisting of hundreds of thousands of nodes with millions of cores and complex network topologies are poised to break the Exaflops (10^18 floating point operations per second) barrier in the coming years. With the emergence of such highly hierarchical clusters, MPI has to be optimized for performance and scalability in order to cope with the ever-increasing demands of large-scale simulations. The proposed research is highly original and innovative in the sense that it addresses key issues in MPI, by including topology-awareness for process mapping, by incorporating dedicated queues for partner processes, by quality of service provisioning partner/non-partner traffic, and by developing fiber-based asynchronous progression techniques. The outcome of this research will be relevant to various sectors in Canada, including Environment Canada, Compute Canada, Canada Genome Sciences Centre, oil and gas industries, and ultimately the Canadian public at large. It is expected that the findings from this research will have significant impact on the target community, and that it will lead to new directions for future research. The proposed research is ideal for training HQP in that it has a strong foundation that translates immediately into practical applications and implementations. There is a high demand for graduates in HPC and networking, and the HQP trained will be well positioned to compete for jobs in academia and industry.
高性能计算(HPC)是许多科学发现和工程创新的关键。它用于解决药物发现、全球气候系统建模、石油和天然气地震处理、绿色能源、基因组学和生物信息学以及天体物理学等领域的计算密集型问题。科学/工程模拟主要使用消息传递接口(MPI)库编写。这些模拟中的并行进程根据其本地数据进行计算,同时通过MPI库进行广泛的相互通信。通信会对在HPC群集上运行的MPI应用程序的性能和可扩展性产生不利影响。随着多核和即将出现的多核体系结构在所有级别上提供越来越多的并行性,由具有数百万个核心和复杂网络拓扑的数十万个节点组成的HPC集群有望在未来几年打破Exaflop(每秒10^18次浮点运算)的障碍。随着这种高层次集群的出现,MPI必须在性能和可扩展性上进行优化,以应对日益增长的大规模仿真需求。拟议的研究具有很高的原创性和创新性,因为它解决了MPI中的关键问题,包括用于流程映射的拓扑感知,通过纳入合作伙伴流程的专用队列,通过服务质量提供合作伙伴/非合作伙伴流量,以及通过开发基于光纤的异步进程技术。这项研究的结果将与加拿大的各个部门相关,包括加拿大环境部、加拿大计算机公司、加拿大基因组科学中心、石油和天然气行业,以及最终的加拿大公众。预计这项研究的结果将对目标社区产生重大影响,并将为未来的研究带来新的方向。拟议的研究是培训HQP的理想选择,因为它有坚实的基础,可以立即转化为实际应用和实施。HPC和网络领域的毕业生需求量很大,所培养的HQP将在学术界和工业界的工作竞争中处于有利地位。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Afsahi, Ahmad其他文献
Accelerating Deep Learning Using Interconnect-Aware UCX Communication for MPI Collectives
- DOI:
10.1109/mm.2022.3148670 - 发表时间:
2022-03-01 - 期刊:
- 影响因子:3.6
- 作者:
Temucin, Yltan Hassan;Sojoodi, Amir Hossein;Afsahi, Ahmad - 通讯作者:
Afsahi, Ahmad
Afsahi, Ahmad的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Afsahi, Ahmad', 18)}}的其他基金
Efficient and Scalable Communication and System Software for Exascale Computing
用于百亿亿次计算的高效且可扩展的通信和系统软件
- 批准号:
RGPIN-2016-05389 - 财政年份:2021
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Scalable Communication and System Software for Exascale Computing
用于百亿亿次计算的高效且可扩展的通信和系统软件
- 批准号:
RGPIN-2016-05389 - 财政年份:2020
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Scalable Communication and System Software for Exascale Computing
用于百亿亿次计算的高效且可扩展的通信和系统软件
- 批准号:
RGPIN-2016-05389 - 财政年份:2019
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Scalable Communication and System Software for Exascale Computing
用于百亿亿次计算的高效且可扩展的通信和系统软件
- 批准号:
RGPIN-2016-05389 - 财政年份:2018
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Scalable Communication and System Software for Exascale Computing
用于百亿亿次计算的高效且可扩展的通信和系统软件
- 批准号:
RGPIN-2016-05389 - 财政年份:2017
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Scalable Communication and System Software for Exascale Computing
用于百亿亿次计算的高效且可扩展的通信和系统软件
- 批准号:
RGPIN-2016-05389 - 财政年份:2016
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
High-performance and scalable communication subsystems for exascale computing
用于百亿亿次计算的高性能和可扩展通信子系统
- 批准号:
238964-2011 - 财政年份:2015
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
High-performance and scalable communication subsystems for exascale computing
用于百亿亿次计算的高性能和可扩展通信子系统
- 批准号:
238964-2011 - 财政年份:2014
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
High-performance and scalable communication subsystems for exascale computing
用于百亿亿次计算的高性能和可扩展通信子系统
- 批准号:
238964-2011 - 财政年份:2012
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
High-performance and scalable communication subsystems for exascale computing
用于百亿亿次计算的高性能和可扩展通信子系统
- 批准号:
238964-2011 - 财政年份:2011
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
相似国自然基金
Scalable Learning and Optimization: High-dimensional Models and Online Decision-Making Strategies for Big Data Analysis
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:合作创新研究团队
相似海外基金
DIrectional and SCalable (DISC) Microelectrode Array for Speech Decoding
用于语音解码的定向和可扩展 (DISC) 微电极阵列
- 批准号:
10513043 - 财政年份:2023
- 资助金额:
$ 1.6万 - 项目类别:
Examining the electroencephalographic fingerprint of default mode network hyperconnectivity for scalable and personalized neurofeedback in schizophrenia
检查默认模式网络超连接的脑电图指纹,以实现精神分裂症的可扩展和个性化神经反馈
- 批准号:
10509002 - 财政年份:2022
- 资助金额:
$ 1.6万 - 项目类别:
Sub-100 nm and scalable self-therapeutic nanoparticles to target autophagy in pancreatic cancer
亚 100 nm 且可扩展的自我治疗纳米颗粒可靶向胰腺癌的自噬
- 批准号:
10604147 - 财政年份:2022
- 资助金额:
$ 1.6万 - 项目类别:
Examining the electroencephalographic fingerprint of default mode network hyperconnectivity for scalable and personalized neurofeedback in schizophrenia
检查默认模式网络超连接的脑电图指纹,以实现精神分裂症的可扩展和个性化神经反馈
- 批准号:
10675554 - 财政年份:2022
- 资助金额:
$ 1.6万 - 项目类别:
Health 360x Clinical Research Platform for Scalable Access to Clinical Trials
Health 360x 临床研究平台可扩展临床试验的访问
- 批准号:
10624966 - 财政年份:2021
- 资助金额:
$ 1.6万 - 项目类别:
A scalable service to improve health care quality through precision audit and feedback
通过精确审核和反馈提高医疗保健质量的可扩展服务
- 批准号:
10704164 - 财政年份:2021
- 资助金额:
$ 1.6万 - 项目类别:
Health 360x Clinical Research Platform for Scalable Access to Clinical Trials
Health 360x 临床研究平台可扩展临床试验的访问
- 批准号:
10515803 - 财政年份:2021
- 资助金额:
$ 1.6万 - 项目类别:
A scalable service to improve health care quality through precision audit and feedback
通过精确审核和反馈提高医疗保健质量的可扩展服务
- 批准号:
10342937 - 财政年份:2021
- 资助金额:
$ 1.6万 - 项目类别:
Health 360x Clinical Research Platform for Scalable Access to Clinical Trials
Health 360x 临床研究平台可扩展临床试验的访问
- 批准号:
10258627 - 财政年份:2021
- 资助金额:
$ 1.6万 - 项目类别:
A scalable aptamer-based electrochemical biosensor for rapid detection of SARS-CoV-2 from saliva
一种基于适配体的可扩展电化学生物传感器,用于快速检测唾液中的 SARS-CoV-2
- 批准号:
10320990 - 财政年份:2020
- 资助金额:
$ 1.6万 - 项目类别: