A Decoupled I/O Architecture for Data-Intensive Cluster Computing

数据密集型集群计算的解耦 I/O 架构

基本信息

  • 批准号:
    0083497
  • 负责人:
  • 金额:
    $ 12.8万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2001
  • 资助国家:
    美国
  • 起止时间:
    2001-08-15 至 2004-07-31
  • 项目状态:
    已结题

项目摘要

Cluster computing exploits the exponential growth in processing power, memory/disk capacity, and network link speed of commodity hardware developed in the PC industry to achieve excellent price-performance, and is becoming a major platform for scientific computation, Web page and file servicing, as well as large-scale information analysis applications such as data mining and knowledge discovery. A distinct characteristicshared by many emerging workloads on PC clusters is the requirement for intensive disk data accesses during program computation. The fundamental technique to address the performance problems associated with disk I/O is to overlap disk access with program computation so that the I/O delays are completely masked. Disk prefetching is one incarnation of this technique. In this project, we propose an automatic application-specific file prefetching scheme that exploits specific applications' access patterns to the fullest extent and achieves close to perfect prefetching. The key idea is to apply the concept of decoupled architecture, which was originally proposed to bridge the gap between CPU and memory, to overlap computation with disk I/O. Given an application program, the compiler automatically translates it into two threads, one correspondingto the original program (computation thread) and the other including all statements in the original program that are related to disk I/O (prefetch thread). At run time, the prefetch thread is scheduled to be suficiently ahead of the computation thread so that all the file access requests of the computation thread are satisfied in the file system cache, which is populated by the prefetch thread anticipatively. To validate the decoupled I/O architecture, we will design, implement, and evaluate an active parallel disk I/O subsystem called Platypus that embodies the application-specific file prefetching scheme and that is designed to be a modular building block for scalable PC cluster systems that aim at data-intensive computing. Platypus consists of a source-to-source translator to generate computation and prefetch threads from a SPMD parallel programautomatically, a run-time thread scheduler that coordinates the execution of prefetch threads on the I/O nodes, and a cache manager that maximizes main memory utilization efficiency by optimally balancing the benefits of file caching and prefetching.
集群计算利用PC工业中开发的商品硬件的处理能力、存储器/磁盘容量和网络链接速度的指数增长来实现优异的性价比,并且正在成为科学计算、网页和文件服务以及大规模信息分析应用(如数据挖掘和知识发现)的主要平台。PC集群上的许多新兴工作负载共享的一个明显特征是在程序计算期间需要密集的磁盘数据访问。解决与磁盘I/O相关的性能问题的基本技术是将磁盘访问与程序计算重叠,以便完全屏蔽I/O延迟。磁盘预取是这种技术的一个具体体现。在这个项目中,我们提出了一个自动的应用程序特定的文件预取计划,利用特定的应用程序的访问模式,以最大限度地实现接近完美的预取。其核心思想是应用解耦体系结构的概念,该概念最初是为了弥合CPU和内存之间的差距而提出的,以重叠计算与磁盘I/O。给定一个应用程序,编译器自动将其转换为两个线程,一个对应于原始程序(计算线程),另一个包括原始程序中与磁盘I/O相关的所有语句(预取线程)。在运行时,预取线程被调度为充分领先于计算线程,以便计算线程的所有文件访问请求在文件系统缓存中得到满足,该文件系统缓存由预取线程预先填充。为了验证解耦的I/O体系结构,我们将设计,实现和评估一个活跃的并行磁盘I/O子系统,称为Platypus,体现了应用程序特定的文件预取计划,并被设计为可扩展的PC集群系统,旨在数据密集型计算的模块化构建块。Platypus包括一个源到源翻译器,用于自动从SPMD并行处理器生成计算和预取线程,一个运行时线程调度器,用于协调I/O节点上预取线程的执行,以及一个缓存管理器,用于通过最佳平衡文件缓存和预取的好处来最大限度地提高主存利用率。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Tzi-Cker Chiueh其他文献

Integration of volume decompression and out-of-core iso-surface extraction from irregular volume data
  • DOI:
    10.1007/s00371-006-0003-9
  • 发表时间:
    2006-03-30
  • 期刊:
  • 影响因子:
    2.900
  • 作者:
    Chuan-Kai Yang;Tzi-Cker Chiueh
  • 通讯作者:
    Tzi-Cker Chiueh

Tzi-Cker Chiueh的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Tzi-Cker Chiueh', 18)}}的其他基金

Collaborative Research: CRI: IAD A Miniaturized Robotic Testbed for Development, Testing, and Evaluation of Protocols for Multi-Hop Wireless Networks
合作研究:CRI:IAD 用于开发、测试和评估多跳无线网络协议的小型机器人测试台
  • 批准号:
    0751121
  • 财政年份:
    2008
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Standard Grant
CT: Automatic Generation of High-Quality Attack Signatures and Patches
CT:自动生成高质量的攻击签名和补丁
  • 批准号:
    0627672
  • 财政年份:
    2006
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Continuing Grant
Quality of Service Guarantee for Scalable Parallel Storage Systems
可扩展并行存储系统的服务质量保证
  • 批准号:
    0621512
  • 财政年份:
    2006
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Standard Grant
Design Techniques for Repairable Data Systems
可修复数据系统的设计技术
  • 批准号:
    0410694
  • 财政年份:
    2004
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Continuing Grant
NeTS-ProWiN: Implementation Techniques for Last-Mile Wireless Mesh Networks
NetS-ProWiN:最后一英里无线网状网络的实现技术
  • 批准号:
    0435373
  • 财政年份:
    2004
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Continuing Grant
Quality of Service Management for Enterprise-Application Computing Utilities
企业应用程序计算实用程序的服务质量管理
  • 批准号:
    0342556
  • 财政年份:
    2004
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Standard Grant
SGER: NMI: An Initial Study on Software Protection Techniques
SGER:NMI:软件保护技术的初步研究
  • 批准号:
    0401777
  • 财政年份:
    2004
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Standard Grant
SOFTWARE: Resource Virtualization for Cluster-Based Network Storage Service
软件:基于集群的网络存储服务的资源虚拟化
  • 批准号:
    0234281
  • 财政年份:
    2003
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Continuing Grant
Implementation Techniques Building Scalable Cluster-Based 3D Graphics Engines
构建可扩展的基于集群的 3D 图形引擎的实现技术
  • 批准号:
    9907485
  • 财政年份:
    2000
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Standard Grant
Implementation Techniques for High-Performance Real-Time IP Routers
高性能实时IP路由器的实现技术
  • 批准号:
    9814934
  • 财政年份:
    1999
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Standard Grant

相似海外基金

Integrative modelling of single-cell data to elucidate the genetic architecture of complex disease
单细胞数据的综合建模以阐明复杂疾病的遗传结构
  • 批准号:
    10889304
  • 财政年份:
    2023
  • 资助金额:
    $ 12.8万
  • 项目类别:
NHLBI ENTERPRISE ARCHITECTURE AND CYBER SECURITY SUPPORT FOR DATA SCIENCE PROGRAMS
NHLBI 数据科学项目的企业架构和网络安全支持
  • 批准号:
    10974010
  • 财政年份:
    2023
  • 资助金额:
    $ 12.8万
  • 项目类别:
Collaborative Research: SpecEES: Designing A Spectrally Efficient and Energy Efficient Data Aided Demand Driven Elastic Architecture for future Networks (SpiderNET)
合作研究:SpecEES:为未来网络设计频谱效率高、能源效率高的数据辅助需求驱动弹性架构 (SpiderNET)
  • 批准号:
    2323300
  • 财政年份:
    2023
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Standard Grant
Collaborative Research: CNS Core: Small: NV-RGRA: Non-Volatile Nano-Second Right-Grained Reconfigurable Architecture for Data-Intensive Machine Learning and Graph Computing
合作研究:CNS 核心:小型:NV-RGRA:用于数据密集型机器学习和图计算的非易失性纳秒右粒度可重构架构
  • 批准号:
    2228239
  • 财政年份:
    2022
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Standard Grant
Reliable and Secure Data-Sharing Architecture, Situational Awareness and Electrical-failure Modelling for SMRs
可靠、安全的 SMR 数据共享架构、态势感知和电气故障建模
  • 批准号:
    580480-2022
  • 财政年份:
    2022
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Alliance Grants
Harnessing AI-powered big data techniques for 3D plant architecture phenotyping and growth pattern modeling
利用人工智能驱动的大数据技术进行 3D 植物结构表型分析和生长模式建模
  • 批准号:
    578508-2022
  • 财政年份:
    2022
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Alliance Grants
SHF: Small: Methods, Workflows, and Data Commons for Reducing Training Costs in Neural Architecture Search on High-Performance Computing Platforms
SHF:小型:降低高性能计算平台上神经架构搜索训练成本的方法、工作流程和数据共享
  • 批准号:
    2223704
  • 财政年份:
    2022
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Standard Grant
Collaborative Research: CNS Core: Small: NV-RGRA: Non-Volatile Nano-Second Right-Grained Reconfigurable Architecture for Data-Intensive Machine Learning and Graph Computing
合作研究:CNS 核心:小型:NV-RGRA:用于数据密集型机器学习和图计算的非易失性纳秒右粒度可重构架构
  • 批准号:
    2228240
  • 财政年份:
    2022
  • 资助金额:
    $ 12.8万
  • 项目类别:
    Standard Grant
Integrative network modeling of bulk and single-cell sequencing data to characterize multi-scale cell architecture
对批量和单细胞测序数据进行集成网络建模,以表征多尺度细胞架构
  • 批准号:
    10276091
  • 财政年份:
    2021
  • 资助金额:
    $ 12.8万
  • 项目类别:
The Architecture of Missing and Archaic Variation in Human Population Genomic Data
人类基因组数据中缺失和古老变异的结构
  • 批准号:
    10292375
  • 财政年份:
    2021
  • 资助金额:
    $ 12.8万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了