CSR: Medium: Collaborative Research: Facets: Exploring Semantic Equivalence of Files to Improve Storage Systems

CSR:媒介:协作研究:方面:探索文件的语义等价性以改进存储系统

基本信息

  • 批准号:
    1065373
  • 负责人:
  • 金额:
    $ 50万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2011
  • 资助国家:
    美国
  • 起止时间:
    2011-08-15 至 2016-07-31
  • 项目状态:
    已结题

项目摘要

Intellectual Merit:The focus of the proposal is on finding semantically equivalent files in an efficient and scalable manner. If two files are identical, they are candidates for optimizations to reduce storage costs, increase performance, and generally improve the system. Traditionally, two files are only considered equivalent if they are byte-by-byte identical - i.e., byte equivalence. However, this team's preliminary research shows that there are many other files that are essentially equivalent, even though the bytes they contain are not the same. This proposal will investigate how to find such cases and perform optimizations that make use of semantic equivalence, rather than byte equivalence.This project will design and implement a framework, Facets, which explores new capabilities by applying optimizations to files that are essentially transformed versions of each other. Many optimizations and improvements can be applied to semantically equivalent files, including: -Ensuring that the security of semantically equivalent files is preserved -Easing backup and maintenance of semantically equivalent files in various formats, fidelities, and versions -Using semantically equivalent files to improve performance, reliability, and availability -Regenerating semantically equivalent files to speed up recovery and network transfer -Selecting which semantically equivalent files to access according to performance or energy constraintsThis team's preliminary research shows that 5% of files on a typical user's machine are original content. The rest are copies of files from elsewhere or various derivatives of original content. While leveraging this observation to achieve advantages is not trivial, many significant improvements are possible if one can find these relationships and make proper use of them. These improvements include enhanced security, more efficient backup and restoration, better file caching, more intelligent tradeoffs in performance versus power use, and a host of other possibilities. Broader Impacts: The code and techniques developed will be released in open source form. The team will take steps (such as applying for supplemental REU grants) to involve undergraduates in the research. They will give talks and recruit at Hispanic-serving institutions. Materials and concepts from the research will be incorporated into classes taught by the principal investigators at their institutions.
智力优势:该提案的重点是以高效和可伸缩的方式找到语义等价的文件。如果两个文件相同,则它们是优化的候选文件,以降低存储成本、提高性能并总体上改进系统。传统上,两个文件只有在逐个字节相同的情况下才被认为是等价的,即字节等价。然而,该团队的初步研究表明,还有许多其他文件本质上是相同的,尽管它们包含的字节不同。该项目将设计并实现一个名为Facets的框架,该框架通过对实质上相互转换的文件应用优化来探索新的功能。许多优化和改进可以应用于语义等价文件,包括:-确保语义等价文件的安全性得到保护-简化各种格式、保真度和版本的语义等价文件的备份和维护-使用语义等价文件来提高性能、可靠性和可用性-重新生成语义等价文件以加快恢复和网络传输-根据性能或能量限制选择要访问的语义等价文件该团队的初步研究表明,典型用户计算机上5%的文件是原始内容。其余的是来自其他地方的文件的副本或原始内容的各种衍生品。虽然利用这一观察结果来实现优势并不是微不足道的,但如果一个人能够找到这些关系并适当地利用它们,许多重大的改进是可能的。这些改进包括增强的安全性、更高效的备份和恢复、更好的文件缓存、更智能地权衡性能和功耗,以及许多其他可能性。更广泛的影响:开发的代码和技术将以开放源码的形式发布。该团队将采取措施(如申请REU补充补助金),让本科生参与研究。他们将在为拉美裔服务的机构进行演讲和招聘。这项研究的材料和概念将被纳入其所在机构的主要研究人员的课程中。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

An-I Wang其他文献

An-I Wang的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('An-I Wang', 18)}}的其他基金

CyberCorps Scholarship for Service (Renewal): Defending the National Cyber Infrastructure
Cyber​​Corps 服务奖学金(续签):保卫国家网络基础设施
  • 批准号:
    2146354
  • 财政年份:
    2022
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
Using Fine-Grained Quantitative and Qualitative Data to Enhance Curricula and Broaden Participation in Computer Science
使用细粒度的定量和定性数据来增强课程并扩大计算机科学的参与
  • 批准号:
    2030070
  • 财政年份:
    2020
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
Broadening Participation in Computer Science
扩大对计算机科学的参与
  • 批准号:
    1259462
  • 财政年份:
    2013
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
CAREER: Tags: A Unifying Primitive to Build Storage Data Paths for Swiftly Evolving Workloads and Storage Media
职业:标签:为快速发展的工作负载和存储介质构建存储数据路径的统一原语
  • 批准号:
    0845672
  • 财政年份:
    2009
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
Collaborative Research. Conquest-2: Improving Energy Efficiency and Performance Through a Disk/RAM Hybrid File System
合作研究。
  • 批准号:
    0410896
  • 财政年份:
    2004
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant

相似海外基金

Collaborative Research: CSR: Medium: Scaling Secure Serverless Computing on Heterogeneous Datacenters
协作研究:CSR:中:在异构数据中心上扩展安全无服务器计算
  • 批准号:
    2312206
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
Collaborative Research: CSR: Medium: Architecting GPUs for Practical Homomorphic Encryption-based Computing
协作研究:CSR:中:为实用的同态加密计算构建 GPU
  • 批准号:
    2312276
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
Collaborative Research: CSR: Medium: Fortuna: Characterizing and Harnessing Performance Variability in Accelerator-rich Clusters
合作研究:CSR:Medium:Fortuna:表征和利用富含加速器的集群中的性能变异性
  • 批准号:
    2312689
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
Collaborative Research: CSR: Medium: Fortuna: Characterizing and Harnessing Performance Variability in Accelerator-rich Clusters
合作研究:CSR:Medium:Fortuna:表征和利用富含加速器的集群中的性能变异性
  • 批准号:
    2401244
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
Collaborative Research: CSR: Medium: Scaling Secure Serverless Computing on Heterogeneous Datacenters
协作研究:CSR:中:在异构数据中心上扩展安全无服务器计算
  • 批准号:
    2312207
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
Collaborative Research: CSR: Medium: Adaptive Environmental Awareness for Collaborative Augmented Reality
协作研究:企业社会责任:媒介:协作增强现实的自适应环境意识
  • 批准号:
    2312760
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
Collaborative Research: CSR: Core: Medium: Scaling Unix/Linux Shell Programs
协作研究:CSR:核心:中:扩展 Unix/Linux Shell 程序
  • 批准号:
    2312346
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
Collaborative Research: CSR: Medium: MemDrive: Memory-Driven Full-Stack Collaboration for Autonomous Embedded Systems
协作研究:CSR:媒介:MemDrive:自主嵌入式系统的内存驱动全栈协作
  • 批准号:
    2312397
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
Collaborative Research: CSR: Medium: MemDrive: Memory-Driven Full-Stack Collaboration for Autonomous Embedded Systems
协作研究:CSR:媒介:MemDrive:自主嵌入式系统的内存驱动全栈协作
  • 批准号:
    2312396
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
Collaborative Research: CSR: Medium: Adaptive Environmental Awareness for Collaborative Augmented Reality
协作研究:企业社会责任:媒介:协作增强现实的自适应环境意识
  • 批准号:
    2312761
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了