High-performance and scalable communication subsystems for exascale computing
用于百亿亿次计算的高性能和可扩展通信子系统
基本信息
- 批准号:238964-2011
- 负责人:
- 金额:$ 1.6万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2014
- 资助国家:加拿大
- 起止时间:2014-01-01 至 2015-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
High-Performance Computing (HPC) is the key to many scientific discoveries and engineering innovations. It is used to tackle computationally-intensive problems in fields as diverse as drug discovery, modeling of global climate system, seismic processing for oil and gas, green energy, genomics and bioinformatics, and astrophysics. Scientific/engineering simulations are mainly written with the Message-Passing Interface (MPI) library. Parallel processes in these simulations compute on their local data while extensively communicating with each other through the MPI library. Communication adversely affects the performance and scalability of MPI applications running on HPC clusters. With the availability of multi-core and soon many-core architectures offering increasing parallelism at all levels, HPC clusters consisting of hundreds of thousands of nodes with millions of cores and complex network topologies are poised to break the Exaflops (10^18 floating point operations per second) barrier in the coming years. With the emergence of such highly hierarchical clusters, MPI has to be optimized for performance and scalability in order to cope with the ever-increasing demands of large-scale simulations. The proposed research is highly original and innovative in the sense that it addresses key issues in MPI, by including topology-awareness for process mapping, by incorporating dedicated queues for partner processes, by quality of service provisioning partner/non-partner traffic, and by developing fiber-based asynchronous progression techniques. The outcome of this research will be relevant to various sectors in Canada, including Environment Canada, Compute Canada, Canada Genome Sciences Centre, oil and gas industries, and ultimately the Canadian public at large. It is expected that the findings from this research will have significant impact on the target community, and that it will lead to new directions for future research. The proposed research is ideal for training HQP in that it has a strong foundation that translates immediately into practical applications and implementations. There is a high demand for graduates in HPC and networking, and the HQP trained will be well positioned to compete for jobs in academia and industry.
高性能计算(HPC)是许多科学发现和工程创新的关键。它被用于解决药物发现、全球气候系统建模、石油和天然气地震处理、绿色能源、基因组学和生物信息学以及天体物理学等领域的计算密集型问题。科学/工程模拟主要使用消息传递接口(MPI)库编写。这些模拟中的并行进程在计算其本地数据的同时,通过MPI库彼此进行广泛的通信。通信会对运行在HPC集群上的MPI应用程序的性能和可伸缩性产生不利影响。随着多核和即将到来的众核架构的可用性,在所有级别上提供越来越多的并行性,由数十万个节点组成的HPC集群具有数百万个核心和复杂的网络拓扑,有望在未来几年内打破Exaflops(每秒10^18次浮点运算)的障碍。随着这种高度层次化集群的出现,MPI必须针对性能和可扩展性进行优化,以科普不断增长的大规模仿真需求。拟议的研究是高度原创性和创新的意义上,它解决了MPI中的关键问题,包括拓扑感知的进程映射,通过将专用队列的合作伙伴进程,通过服务质量供应合作伙伴/非合作伙伴的流量,并通过开发基于光纤的异步进展技术。这项研究的结果将与加拿大的各个部门有关,包括加拿大环境部、加拿大计算部、加拿大基因组科学中心、石油和天然气工业,最终与加拿大公众有关。预计本研究的结果将对目标社区产生重大影响,并为未来的研究提供新的方向。拟议的研究是理想的培训HQP,因为它有一个强大的基础,立即转化为实际应用和实施。HPC和网络专业的毕业生需求量很大,经过培训的HQP将能够很好地竞争学术界和工业界的工作。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Afsahi, Ahmad其他文献
Accelerating Deep Learning Using Interconnect-Aware UCX Communication for MPI Collectives
- DOI:
10.1109/mm.2022.3148670 - 发表时间:
2022-03-01 - 期刊:
- 影响因子:3.6
- 作者:
Temucin, Yltan Hassan;Sojoodi, Amir Hossein;Afsahi, Ahmad - 通讯作者:
Afsahi, Ahmad
Afsahi, Ahmad的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Afsahi, Ahmad', 18)}}的其他基金
Efficient and Scalable Communication and System Software for Exascale Computing
用于百亿亿次计算的高效且可扩展的通信和系统软件
- 批准号:
RGPIN-2016-05389 - 财政年份:2021
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Scalable Communication and System Software for Exascale Computing
用于百亿亿次计算的高效且可扩展的通信和系统软件
- 批准号:
RGPIN-2016-05389 - 财政年份:2020
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Scalable Communication and System Software for Exascale Computing
用于百亿亿次计算的高效且可扩展的通信和系统软件
- 批准号:
RGPIN-2016-05389 - 财政年份:2019
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Scalable Communication and System Software for Exascale Computing
用于百亿亿次计算的高效且可扩展的通信和系统软件
- 批准号:
RGPIN-2016-05389 - 财政年份:2018
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Scalable Communication and System Software for Exascale Computing
用于百亿亿次计算的高效且可扩展的通信和系统软件
- 批准号:
RGPIN-2016-05389 - 财政年份:2017
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
Efficient and Scalable Communication and System Software for Exascale Computing
用于百亿亿次计算的高效且可扩展的通信和系统软件
- 批准号:
RGPIN-2016-05389 - 财政年份:2016
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
High-performance and scalable communication subsystems for exascale computing
用于百亿亿次计算的高性能和可扩展通信子系统
- 批准号:
238964-2011 - 财政年份:2015
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
High-performance and scalable communication subsystems for exascale computing
用于百亿亿次计算的高性能和可扩展通信子系统
- 批准号:
238964-2011 - 财政年份:2013
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
High-performance and scalable communication subsystems for exascale computing
用于百亿亿次计算的高性能和可扩展通信子系统
- 批准号:
238964-2011 - 财政年份:2012
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
High-performance and scalable communication subsystems for exascale computing
用于百亿亿次计算的高性能和可扩展通信子系统
- 批准号:
238964-2011 - 财政年份:2011
- 资助金额:
$ 1.6万 - 项目类别:
Discovery Grants Program - Individual
相似国自然基金
Scalable Learning and Optimization: High-dimensional Models and Online Decision-Making Strategies for Big Data Analysis
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:合作创新研究团队
相似海外基金
DIrectional and SCalable (DISC) Microelectrode Array for Speech Decoding
用于语音解码的定向和可扩展 (DISC) 微电极阵列
- 批准号:
10513043 - 财政年份:2023
- 资助金额:
$ 1.6万 - 项目类别:
Examining the electroencephalographic fingerprint of default mode network hyperconnectivity for scalable and personalized neurofeedback in schizophrenia
检查默认模式网络超连接的脑电图指纹,以实现精神分裂症的可扩展和个性化神经反馈
- 批准号:
10509002 - 财政年份:2022
- 资助金额:
$ 1.6万 - 项目类别:
Sub-100 nm and scalable self-therapeutic nanoparticles to target autophagy in pancreatic cancer
亚 100 nm 且可扩展的自我治疗纳米颗粒可靶向胰腺癌的自噬
- 批准号:
10604147 - 财政年份:2022
- 资助金额:
$ 1.6万 - 项目类别:
Examining the electroencephalographic fingerprint of default mode network hyperconnectivity for scalable and personalized neurofeedback in schizophrenia
检查默认模式网络超连接的脑电图指纹,以实现精神分裂症的可扩展和个性化神经反馈
- 批准号:
10675554 - 财政年份:2022
- 资助金额:
$ 1.6万 - 项目类别:
Health 360x Clinical Research Platform for Scalable Access to Clinical Trials
Health 360x 临床研究平台可扩展临床试验的访问
- 批准号:
10624966 - 财政年份:2021
- 资助金额:
$ 1.6万 - 项目类别:
A scalable service to improve health care quality through precision audit and feedback
通过精确审核和反馈提高医疗保健质量的可扩展服务
- 批准号:
10704164 - 财政年份:2021
- 资助金额:
$ 1.6万 - 项目类别:
Health 360x Clinical Research Platform for Scalable Access to Clinical Trials
Health 360x 临床研究平台可扩展临床试验的访问
- 批准号:
10515803 - 财政年份:2021
- 资助金额:
$ 1.6万 - 项目类别:
A scalable service to improve health care quality through precision audit and feedback
通过精确审核和反馈提高医疗保健质量的可扩展服务
- 批准号:
10342937 - 财政年份:2021
- 资助金额:
$ 1.6万 - 项目类别:
Health 360x Clinical Research Platform for Scalable Access to Clinical Trials
Health 360x 临床研究平台可扩展临床试验的访问
- 批准号:
10258627 - 财政年份:2021
- 资助金额:
$ 1.6万 - 项目类别:
A scalable aptamer-based electrochemical biosensor for rapid detection of SARS-CoV-2 from saliva
一种基于适配体的可扩展电化学生物传感器,用于快速检测唾液中的 SARS-CoV-2
- 批准号:
10320990 - 财政年份:2020
- 资助金额:
$ 1.6万 - 项目类别: