Scheduling Collaborative Computations on Heterogeneous Clusters
异构集群上的协同计算调度
基本信息
- 批准号:0342417
- 负责人:
- 金额:$ 24.2万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2004
- 资助国家:美国
- 起止时间:2004-02-15 至 2008-01-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Rosenberg, AronldUniversity of Massachusetts AmherstCCF-0342417The past decade has seen a paradigm shift in the computing platforms used for a wide variety of computations. Advances in technology and economic considerations have made clusters of workstations or pc's a viable medium for high-performance "collaborative'' computing (wherein many computers cooperate to solve a single computational problem). Many of the algorithmic devices that ensured efficient "collaborative'' computing in earlier computing environments no longer guarantee efficiency within such clusters, especially when the clusters are heterogeneous, in the sense that their workstations may differ in computational power. The program of research proposed herein is dedicated to achieving provably efficient computation in heterogeneous clusters, via efficient algorithms for large classes of important computations. The first component of the goal of "provably efficient computation'' will be the use of a combination of analytical and experimental methods to develop the insights needed to craft scheduling algorithms whose performance is predictable via rigorous mathematical and/or statistical analyses. The second component of the goal will be to validate the abstract models used to formulate and analyze algorithms. Implementations of algorithms on actual clusters, and comparison of measured performance against the predictions of the abstract algorithmic models, will be used to validate or, where necessary, to modify the models. The scheduling algorithms and, when appropriate, implementations thereof that result from this process will be the main product of the research.The proposed research will focus on "ad hoc'' clusters, which are assembled from "off the shelf'' workstations or pc's, interconnected via local-area networks. The major scheduling challenges here reside in the substantial cost of interworkstation communication and in the cluster's (likely) heterogeneity, i.e., the differences in computing power of its constituent workstations. Initial studies have led to a mathematical framework for the proposed research.Technical impact.. The proposed work will provide rigorously validated guidelines forscheduling a broad variety of significant computational problems efficiently on heterogeneous clusters. Research results will be disseminated in leading conferences and journals, and will beincorporated into courses and seminars. Via the training of students, and via collaborations and technical interactions with colleagues, both in the US and abroad, the work will lead to yet further technical progress. (There are ongoing collaborations with colleagues in Australia, France, and Italy.)Broader impacts. Prior support from NSF has led to the incorporation of cutting-edgetechnical material into independent studies, courses, and research seminars, which have been part of the training of several generations of students in several departments at U Mass Amherst. All but one of the PI's doctoral students have pursued (successful) academic careers at colleges or universities in the US. Two recent doctoral students have been women: one was born in China and one in eastern Europe; both are successfully pursuing careers in US educational institutions.Results from the proposed research will be incorporated into the educational program in the same way that prior results have been.
在过去的十年中,我们看到了用于各种计算的计算平台的范式转变。技术的进步和经济的考虑使得工作站集群或pc成为高性能“协作”计算(其中许多计算机合作解决单个计算问题)的可行媒介。在早期的计算环境中确保高效“协作”计算的许多算法设备不再保证此类集群中的效率,特别是当集群是异构的时候,因为它们的工作站可能在计算能力上有所不同。本文提出的研究方案致力于在异构集群中实现可证明的高效计算,通过高效的算法进行大规模的重要计算。“可证明的高效计算”目标的第一个组成部分将是使用分析和实验方法相结合的方法来开发设计调度算法所需的见解,这些算法的性能可以通过严格的数学和/或统计分析来预测。目标的第二个组成部分将是验证用于制定和分析算法的抽象模型。算法在实际集群上的实现,以及将测量的性能与抽象算法模型的预测进行比较,将用于验证或在必要时修改模型。在此过程中产生的调度算法和适当的实现将是研究的主要成果。拟议中的研究将集中在“特设”集群上,即由“现成的”工作站或个人电脑组装而成,通过局域网相互连接。这里的主要调度挑战在于工作站间通信的大量成本和集群(可能)的异构性,即其组成工作站的计算能力的差异。最初的研究已经为拟议的研究提供了一个数学框架。技术的影响。提出的工作将为在异构集群上有效地调度各种重要的计算问题提供严格验证的指导方针。研究成果将在主要会议和期刊上发表,并将纳入课程和研讨会。通过对学生的培训,以及与美国和国外同事的合作和技术互动,这项工作将导致进一步的技术进步。(目前正在与澳大利亚、法国和意大利的同事合作。)更广泛的影响。美国国家科学基金会之前的支持已经导致将尖端技术材料纳入独立研究,课程和研究研讨会,这已经成为马萨诸塞大学阿默斯特分校几个部门几代学生培训的一部分。除了一名博士生外,PI的所有博士生都在美国的学院或大学追求(成功的)学术生涯。最近有两位女博士生:一位出生在中国,另一位出生在东欧;两人都成功地在美国教育机构寻求职业发展。拟议研究的结果将以与先前结果相同的方式纳入教育计划。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Arnold Rosenberg其他文献
Arnold Rosenberg的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Arnold Rosenberg', 18)}}的其他基金
Collaborative Research: CI-ADDO-NEW: Parallel and Distributed Computing Curriculum Development and Educational Resources
合作研究:CI-ADDO-NEW:并行和分布式计算课程开发和教育资源
- 批准号:
1205437 - 财政年份:2012
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
CSR: Small: Collaborative Research: Pursuing High Performance on Clouds and Other Dynamically Heterogeneous Computing Platforms
CSR:小型:协作研究:追求云和其他动态异构计算平台的高性能
- 批准号:
1217981 - 财政年份:2012
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
Scheduling Parallel Computations in Clusters of Workstations
在工作站集群中调度并行计算
- 批准号:
0073401 - 财政年份:2000
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
Orchestrating Communication in High-Latency Parallel Environments
在高延迟并行环境中协调通信
- 批准号:
9710367 - 财政年份:1997
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
Algorithmic Support for Parallel Architectures
并行架构的算法支持
- 批准号:
9221785 - 财政年份:1993
- 资助金额:
$ 24.2万 - 项目类别:
Continuing Grant
Emulations among Processor Arrays: A Theoretical Study
处理器阵列之间的仿真:理论研究
- 批准号:
9013184 - 财政年份:1991
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
Parallel Architecture: Design, Validation, Implementation
并行架构:设计、验证、实现
- 批准号:
8812567 - 财政年份:1988
- 资助金额:
$ 24.2万 - 项目类别:
Continuing Grant
Theoretical Aspects of Vlsi and Related Graph-Embedding Theory
Vlsi 的理论方面及相关图嵌入理论
- 批准号:
8301213 - 财政年份:1983
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
相似海外基金
Collaborative Research: Hardware-Aware Matrix Computations for Deep Learning Applications
协作研究:深度学习应用的硬件感知矩阵计算
- 批准号:
2247014 - 财政年份:2023
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
Collaborative Research: Hardware-Aware Matrix Computations for Deep Learning Applications
协作研究:深度学习应用的硬件感知矩阵计算
- 批准号:
2247015 - 财政年份:2023
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
CDS&E: Collaborative Research: Deep learning enhanced parallel computations of fluid flow around moving boundaries on binarized octrees
CDS
- 批准号:
1953204 - 财政年份:2020
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
Collaborative Research: Learning for Faster Computations to Enhance Efficiency and Security of Power System Operations
协作研究:学习更快的计算以提高电力系统运行的效率和安全性
- 批准号:
2023531 - 财政年份:2020
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
CDS&E: Collaborative Research: Deep learning enhanced parallel computations of fluid flow around moving boundaries on binarized octrees
CDS
- 批准号:
1953222 - 财政年份:2020
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
Collaborative Research: Learning for Faster Computations to Enhance Efficiency and Security of Power System Operations
协作研究:学习更快的计算以提高电力系统运行的效率和安全性
- 批准号:
2025152 - 财政年份:2020
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
SaTC: CORE: Medium: Collaborative: Secure Distributed Coded Computations for IoT: An Information Theoretic and Network Approach
SaTC:核心:媒介:协作:物联网的安全分布式编码计算:信息论和网络方法
- 批准号:
1801630 - 财政年份:2018
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
SaTC: CORE: Medium: Collaborative: Secure Distributed Coded Computations for IoT: An Information Theoretic and Network Approach
SaTC:核心:媒介:协作:物联网的安全分布式编码计算:信息论和网络方法
- 批准号:
1801708 - 财政年份:2018
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
Collaborative Research: Algorithm and Theory for Interface Computations
协作研究:接口计算的算法和理论
- 批准号:
1852597 - 财政年份:2018
- 资助金额:
$ 24.2万 - 项目类别:
Standard Grant
Collaborative Research: Infection Multiplicity and Virus Evolution, from Experiments to Large Scale Multi-Population Stochastic Computations
合作研究:感染多重性和病毒进化,从实验到大规模多群体随机计算
- 批准号:
1662146 - 财政年份:2017
- 资助金额:
$ 24.2万 - 项目类别:
Continuing Grant