Collaborative Research: CNS Core: Small: Efficient Ways to Enlarge Practical DNA Storage Capacity by Integrating Bio-Computer Technologies
合作研究:中枢神经系统核心:小型:通过集成生物计算机技术扩大实用 DNA 存储容量的有效方法
基本信息
- 批准号:2204656
- 负责人:
- 金额:$ 30万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2022
- 资助国家:美国
- 起止时间:2022-07-15 至 2025-06-30
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
The world's digital data increases immensely each year. By 2025, it will reach 175 Zettabytes (ZB). Most human activities are recorded in digital format today. However, data recorded in digital media cannot last very long. Therefore, valuable data cannot be preserved today with our current storage technologies and devices for a long duration (beyond 15 years). The capacity of existing storage media cannot keep up with the growth of the amount of digital data. Also, all storage devices could become obsolete within several years, so the data stored are vulnerable as they perish as time goes by. Therefore, synthetic deoxyribonucleic acid (DNA) becomes an attractive alternative storage medium due to its high density and long durability. These characteristics of DNA storage make it a great candidate for archival storage. However, the preliminary study of the project indicates the practical DNA storage tube capacity based on current technologies is only around 250GB, which is much less than the expected capacity. The major reason is that primer-payload collisions in DNA storage can drastically reduce the number of usable primers in a tube as the data payload size increases. The use of primers is essential for random access to DNA data. In this project, an interdisciplinary team is formed to investigate both bio and storage approaches that can improve the scalability of DNA storage. Among the many factors that can scale up DNA storage, the project plans to investigate the following questions: 1) How to identify more primers for a primer library to be used in DNA storage? 2) Given a primer library, how to efficiently allocate payload data to avoid primer-payload collisions to increase DNA storage capacity? and 3) How to effectively use a popular technique called data deduplication in data backup applications to further increase the storage capability of DNA storage? With a deep understanding of molecular biology and computer storage technologies and systems, this interdisciplinary team fosters several innovative ways of understanding the fundamental issues of DNA storage and will develop necessary genome engineering, sequencing techniques, software, and new algorithms to optimize the process of converting the world's digital data to DNA storage for archiving and preserving today's valuable digital data for hundreds of years in the future. The goal of storing the world's digital data in DNA storage to preserve all human activities can move one step closer with this project. The potential research outcomes of the project include fostering the advancement of bioscience and storage technologies, preserving human activities in DNA storage for hundreds of years, and facilitating fundamental understanding, identifying tradeoffs, and creating efficient ways of scaling up DNA storage. The project will provide an ideal inter-disciplinary thinking, hands-on learning, and development environment to teach computer science and electrical and computer engineering graduate and undergraduate students important system building and experimental skills that are critical for today's and the future IT workforce. The research outcomes of the project will be incorporated into the classroom teaching of the team members, for both class projects and the core courses in computer science and electrical and computer engineering. The team plans to include the obtained research results in a new course on Storage Technologies /Systems for Big Data for students in a Data Science Program, as well as in undergraduate senior design and directed research studies. The team plans to disseminate the research advances to industrial collaborators, and through publications, presentations, and public release of research data, software tools, and prototype systems to the research community. The team is committed to recruiting underrepresented undergraduate and graduate students to the project. Research results will be made quickly available to the general public and disseminated via websites and open source repositories like GitHub.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
世界上的数字数据每年都在大幅增加。到2025年,它将达到175 Zettabytes(ZB)。今天,大多数人类活动都以数字格式记录。然而,记录在数字媒体中的数据不能持续很长时间。因此,使用我们目前的存储技术和设备,无法长期保存有价值的数据(超过15年)。现有存储介质的容量无法跟上数字数据量的增长。此外,所有存储设备都可能在几年内过时,因此存储的数据随着时间的推移而变得脆弱。因此,合成的脱氧核糖核酸(DNA)由于其高密度和长耐久性而成为一种有吸引力的替代存储介质。DNA存储的这些特性使其成为档案存储的绝佳候选者。然而,该项目的初步研究表明,基于现有技术的实际DNA存储管容量仅为250 GB左右,远远低于预期容量。主要原因是,随着数据有效载荷大小的增加,DNA存储中的引物-有效载荷碰撞会大幅减少试管中可用引物的数量。引物的使用对于随机获取DNA数据至关重要。在这个项目中,成立了一个跨学科的团队来研究可以提高DNA存储可扩展性的生物和存储方法。在可以扩大DNA存储的许多因素中,该项目计划研究以下问题:1)如何为用于DNA存储的引物库识别更多的引物?2)给定一个引物库,如何有效地分配有效载荷数据以避免引物-有效载荷碰撞,从而增加DNA存储容量? 3)如何在数据备份应用中有效地使用一种流行的称为数据去重的技术,以进一步提高DNA存储的存储能力? 凭借对分子生物学和计算机存储技术和系统的深刻理解,这个跨学科的团队培养了几种理解DNA存储基本问题的创新方法,并将开发必要的基因组工程,测序技术,软件,以及新的算法,以优化将世界数字数据转换为DNA存储的过程,以便将当今有价值的数字数据存档和保存数百年在未来通过这个项目,将世界上的数字数据存储在DNA存储中以保存所有人类活动的目标可以更近一步。该项目的潜在研究成果包括促进生物科学和存储技术的进步,将人类活动保存在DNA存储中数百年,促进基本理解,确定权衡,并创造扩大DNA存储的有效方法。该项目将提供一个理想的跨学科思维,动手学习和开发环境,教计算机科学和电气和计算机工程研究生和本科生重要的系统建设和实验技能,是至关重要的今天和未来的IT劳动力。该项目的研究成果将被纳入团队成员的课堂教学中,用于课堂项目以及计算机科学和电气与计算机工程的核心课程。该团队计划将获得的研究成果纳入数据科学课程学生的大数据存储技术/系统新课程,以及本科高级设计和指导研究。该团队计划将研究进展传播给工业合作者,并通过出版物,演示文稿和研究数据,软件工具和原型系统的公开发布到研究社区。该团队致力于招募代表性不足的本科生和研究生参加该项目。研究成果将迅速提供给公众,并通过网站和GitHub等开源存储库传播。该奖项反映了NSF的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(6)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
DP-DNA: A Digital Pattern-Aware DNA Encoding Scheme to Improve Encoding Density of DNA Storage
DP-DNA:一种提高 DNA 存储编码密度的数字模式感知 DNA 编码方案
- DOI:10.1109/mascots59514.2023.10387655
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:Li, Bingzhe;Ou, Li;Yuan, Bo;Du, David H.C.
- 通讯作者:Du, David H.C.
Work-in-Progress: ExpCache: Online-Learning based Cache Replacement Policy for Non-Volatile Memory
正在进行中的工作:ExpCache:基于在线学习的非易失性内存缓存替换策略
- DOI:10.1109/cases55004.2022.00010
- 发表时间:2022
- 期刊:
- 影响因子:0
- 作者:Yang, Jinfeng;Li, Bingzhe;Yuan, Jianjun;Shen, Zhaoyan;Du, David;Lilja, David
- 通讯作者:Lilja, David
SMRTS: A Performance and Cost-Effectiveness Optimized SSD-SMR Tiered File System with Data Deduplication
- DOI:10.1109/iccd58817.2023.00050
- 发表时间:2023-11
- 期刊:
- 影响因子:0
- 作者:Zhichao Cao;Hao Wen;Fenggang Wu;David Hung-Chang Du
- 通讯作者:Zhichao Cao;Hao Wen;Fenggang Wu;David Hung-Chang Du
Machine Learning-based Adaptive Migration Algorithm for Hybrid Storage Systems
- DOI:10.1109/nas55553.2022.9925545
- 发表时间:2022-10
- 期刊:
- 影响因子:0
- 作者:Milan M. Shetti;Bingzhe Li;D. Du
- 通讯作者:Milan M. Shetti;Bingzhe Li;D. Du
HL-DNA: A Hybrid Lossy/Lossless Encoding Scheme to Enhance DNA Storage Density and Robustness for Images
HL-DNA:一种增强图像 DNA 存储密度和鲁棒性的有损/无损混合编码方案
- DOI:10.1109/iccd56317.2022.00071
- 发表时间:2022
- 期刊:
- 影响因子:0
- 作者:Li, Yi;Du, David H.C.;Ou, Li;Li, Bingzhe
- 通讯作者:Li, Bingzhe
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
David Du其他文献
Rendering Color Information Using Haptic Feedback
使用触觉反馈渲染颜色信息
- DOI:
- 发表时间:
2005 - 期刊:
- 影响因子:0
- 作者:
S. Chakrabarti;S. Pramanik;David Du;Rajashree Paul - 通讯作者:
Rajashree Paul
David Du的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('David Du', 18)}}的其他基金
CSR: Small: Heterogeneous Storage Systems with Emerging Technologies for Solving Big Data Problems
CSR:小型:利用新兴技术解决大数据问题的异构存储系统
- 批准号:
1812537 - 财政年份:2018
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
CSR: Small: Efficient Usage of Shingled Magnetic Recording (SMR)Drives
CSR:小型:叠瓦式磁记录 (SMR) 驱动器的高效使用
- 批准号:
1525617 - 财政年份:2015
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
I/UCRC Phase II: Center on Intelligent Storage
I/UCRC二期:智能存储中心
- 批准号:
1439622 - 财政年份:2014
- 资助金额:
$ 30万 - 项目类别:
Continuing Grant
CSR: Small: Collaborative Research: Software Defined Energy Adaptation in Large Scale Data Centers
CSR:小型:协作研究:大型数据中心的软件定义能源适应
- 批准号:
1421913 - 财政年份:2014
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
Travel Support for the 43rd Annual International Conference on Parallel Processing (ICPP-2014)
第 43 届并行处理国际会议 (ICPP-2014) 的差旅支持
- 批准号:
1438816 - 财政年份:2014
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
II-NEW: One Cloud Does Not Fit All: Minnesota Integrated Cloud Systems Research Testbed (MiST)
II-新:单一云并不能满足所有需求:明尼苏达州集成云系统研究测试平台 (MiST)
- 批准号:
1305237 - 财政年份:2013
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
NeTS: Small: Information Dissemination in Vehicular Networks for Reducing Traffic Congestion
NeTS:小型:车载网络中的信息传播以减少交通拥堵
- 批准号:
1217572 - 财政年份:2012
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
CSR: Small: Prediction-Based Data Placement for New Memory and Storage Hierarchies
CSR:小型:新内存和存储层次结构的基于预测的数据放置
- 批准号:
1217569 - 财政年份:2012
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
Integrating Flash and Phase Change Memory into Memory/Storage Hierarchies for Enhancing High-End and Data-Intensive Computing
将闪存和相变存储器集成到内存/存储层次结构中以增强高端和数据密集型计算
- 批准号:
1127829 - 财政年份:2011
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
CSR:Small: Efficient FTL Buffer Management for High-Performance Solid State Drives
CSR:Small:适用于高性能固态硬盘的高效 FTL 缓冲区管理
- 批准号:
1115471 - 财政年份:2011
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
相似国自然基金
Research on Quantum Field Theory without a Lagrangian Description
- 批准号:24ZR1403900
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
Cell Research
- 批准号:31224802
- 批准年份:2012
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research
- 批准号:31024804
- 批准年份:2010
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research (细胞研究)
- 批准号:30824808
- 批准年份:2008
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
- 批准号:10774081
- 批准年份:2007
- 资助金额:45.0 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: CNS Core: Small: A Compilation System for Mapping Deep Learning Models to Tensorized Instructions (DELITE)
合作研究:CNS Core:Small:将深度学习模型映射到张量化指令的编译系统(DELITE)
- 批准号:
2230945 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core: Medium: Movement of Computation and Data in Splitkernel-disaggregated, Data-intensive Systems
合作研究:CNS 核心:媒介:Splitkernel 分解的数据密集型系统中的计算和数据移动
- 批准号:
2406598 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Continuing Grant
Collaborative Research: CNS Core: Small: SmartSight: an AI-Based Computing Platform to Assist Blind and Visually Impaired People
合作研究:中枢神经系统核心:小型:SmartSight:基于人工智能的计算平台,帮助盲人和视障人士
- 批准号:
2418188 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core: Medium: Reconfigurable Kernel Datapaths with Adaptive Optimizations
协作研究:CNS 核心:中:具有自适应优化的可重构内核数据路径
- 批准号:
2345339 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
Collaborative Research: NSF-AoF: CNS Core: Small: Towards Scalable and Al-based Solutions for Beyond-5G Radio Access Networks
合作研究:NSF-AoF:CNS 核心:小型:面向超 5G 无线接入网络的可扩展和基于人工智能的解决方案
- 批准号:
2225578 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core: Small: Creating An Extensible Internet Through Interposition
合作研究:CNS核心:小:通过介入创建可扩展的互联网
- 批准号:
2242503 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core: Small: Adaptive Smart Surfaces for Wireless Channel Morphing to Enable Full Multiplexing and Multi-user Gains
合作研究:CNS 核心:小型:用于无线信道变形的自适应智能表面,以实现完全复用和多用户增益
- 批准号:
2343959 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core: Small: Efficient Ways to Enlarge Practical DNA Storage Capacity by Integrating Bio-Computer Technologies
合作研究:中枢神经系统核心:小型:通过集成生物计算机技术扩大实用 DNA 存储容量的有效方法
- 批准号:
2343863 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core: Small: A Compilation System for Mapping Deep Learning Models to Tensorized Instructions (DELITE)
合作研究:CNS Core:Small:将深度学习模型映射到张量化指令的编译系统(DELITE)
- 批准号:
2341378 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core: Medium: Innovating Volumetric Video Streaming with Motion Forecasting, Intelligent Upsampling, and QoE Modeling
合作研究:CNS 核心:中:通过运动预测、智能上采样和 QoE 建模创新体积视频流
- 批准号:
2409008 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Continuing Grant