Techniques for Fault-Tolerant Communication in Parallel Computers
并行计算机中的容错通信技术
基本信息
- 批准号:9525887
- 负责人:
- 金额:$ 15.38万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:1996
- 资助国家:美国
- 起止时间:1996-01-01 至 1999-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The project studies techniques for performing efficient communication in parallel computers that have faulty components. Parallel computers with a large number of identical processors have a great potential for fault-tolerance. The main challenge that must be overcome in order to exploit this potential for fault- tolerance is the fact that faults also affect communications between processors. Hardware faults in processors, routers, and communication links force messages to take new paths that avoid the faulty components. In addition, the mapping of computations from faulty processors to nonfaulty ones changes the pattern of communication. As a result, faults can have a major impact on the performance, and even correctness, of the communication subsystem in a parallel computer. Specifically, the project focuses on the following two key issues; creating efficient, deadlock-free routing algorithms for parallel computers with faulty components, and developing efficient mappings of communication-intensive real-time applications to parallel computers that contain faults. This research will result in more efficient use of parallel computers and a better understanding of how their inherent potential for fault-tolerance can be utilized.
该项目研究在具有故障组件的并行计算机中执行有效通信的技术。 具有大量相同处理器的并行计算机具有很大的容错潜力。 为了利用这种容错潜力,必须克服的主要挑战是故障也会影响处理器之间的通信。 处理器、路由器和通信链路中的硬件故障会迫使消息采取新的路径,以避开故障组件。 此外,从故障处理器到非故障处理器的计算映射改变了通信模式。 因此,故障可能对并行计算机中通信子系统的性能甚至正确性产生重大影响。 具体而言,该项目侧重于以下两个关键问题:创建高效的,无死锁的路由算法的并行计算机故障组件,并开发通信密集型实时应用程序的高效映射到包含故障的并行计算机。 这项研究将导致更有效地利用并行计算机和更好地了解如何利用其固有的容错潜力。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Robert Cypher其他文献
Data reduction and fast routing: A strategy for efficient algorithms for message-passing parallel computers
- DOI:
10.1007/bf01758752 - 发表时间:
1992-06-01 - 期刊:
- 影响因子:0.700
- 作者:
Jorge L. C. Sanz;Robert Cypher - 通讯作者:
Robert Cypher
A quantitative study of parallel scientific applications with explicit communication
- DOI:
10.1007/bf00128097 - 发表时间:
1996-01-01 - 期刊:
- 影响因子:2.700
- 作者:
Robert Cypher;Alex Ho;Smaragda Konstantinidou;Paul Messina - 通讯作者:
Paul Messina
Robert Cypher的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Robert Cypher', 18)}}的其他基金
Principles and Mechanisms for Asynchronous Parallel Computing
异步并行计算原理与机制
- 批准号:
9522301 - 财政年份:1996
- 资助金额:
$ 15.38万 - 项目类别:
Continuing Grant
相似海外基金
Development on Fault Diagnosis and Fault-Tolerant Cooperative Control Techniques with Applications to Safety-Critical Systems
故障诊断和容错协同控制技术的发展及其在安全关键系统中的应用
- 批准号:
RGPIN-2017-06680 - 财政年份:2022
- 资助金额:
$ 15.38万 - 项目类别:
Discovery Grants Program - Individual
Development on Fault Diagnosis and Fault-Tolerant Cooperative Control Techniques with Applications to Safety-Critical Systems
故障诊断和容错协同控制技术的发展及其在安全关键系统中的应用
- 批准号:
RGPIN-2017-06680 - 财政年份:2021
- 资助金额:
$ 15.38万 - 项目类别:
Discovery Grants Program - Individual
Development on Fault Diagnosis and Fault-Tolerant Cooperative Control Techniques with Applications to Safety-Critical Systems
故障诊断和容错协同控制技术的发展及其在安全关键系统中的应用
- 批准号:
RGPIN-2017-06680 - 财政年份:2020
- 资助金额:
$ 15.38万 - 项目类别:
Discovery Grants Program - Individual
Development on Fault Diagnosis and Fault-Tolerant Cooperative Control Techniques with Applications to Safety-Critical Systems
故障诊断和容错协同控制技术的发展及其在安全关键系统中的应用
- 批准号:
RGPIN-2017-06680 - 财政年份:2019
- 资助金额:
$ 15.38万 - 项目类别:
Discovery Grants Program - Individual
Development on Fault Diagnosis and Fault-Tolerant Cooperative Control Techniques with Applications to Safety-Critical Systems
故障诊断和容错协同控制技术的发展及其在安全关键系统中的应用
- 批准号:
RGPIN-2017-06680 - 财政年份:2018
- 资助金额:
$ 15.38万 - 项目类别:
Discovery Grants Program - Individual
Development on Fault Diagnosis and Fault-Tolerant Cooperative Control Techniques with Applications to Safety-Critical Systems
故障诊断和容错协同控制技术的发展及其在安全关键系统中的应用
- 批准号:
RGPIN-2017-06680 - 财政年份:2017
- 资助金额:
$ 15.38万 - 项目类别:
Discovery Grants Program - Individual
"Development and Validation of Fault-tolerant and Cooperative Guidance, Navigation and Control Techniques with Unmanned Systems"
“无人系统容错协作制导、导航和控制技术的开发和验证”
- 批准号:
341915-2012 - 财政年份:2016
- 资助金额:
$ 15.38万 - 项目类别:
Discovery Grants Program - Individual
"Development and Validation of Fault-tolerant and Cooperative Guidance, Navigation and Control Techniques with Unmanned Systems"
“无人系统容错协作制导、导航和控制技术的开发和验证”
- 批准号:
341915-2012 - 财政年份:2015
- 资助金额:
$ 15.38万 - 项目类别:
Discovery Grants Program - Individual
"Development and Validation of Fault-tolerant and Cooperative Guidance, Navigation and Control Techniques with Unmanned Systems"
“无人系统容错协作制导、导航和控制技术的开发和验证”
- 批准号:
341915-2012 - 财政年份:2014
- 资助金额:
$ 15.38万 - 项目类别:
Discovery Grants Program - Individual
"Development and Validation of Fault-tolerant and Cooperative Guidance, Navigation and Control Techniques with Unmanned Systems"
“无人系统容错协作制导、导航和控制技术的开发和验证”
- 批准号:
341915-2012 - 财政年份:2013
- 资助金额:
$ 15.38万 - 项目类别:
Discovery Grants Program - Individual