CIF: Medium: Coding Theory for DNA Storage: Synthesis, Retention, and Reconstruction

CIF:媒介:DNA 存储编码理论:合成、保留和重建

基本信息

  • 批准号:
    2212437
  • 负责人:
  • 金额:
    $ 120万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2022
  • 资助国家:
    美国
  • 起止时间:
    2022-08-01 至 2026-07-31
  • 项目状态:
    未结题

项目摘要

New information-storage technologies are needed to accommodate the growing deluge of data being collected and generated by modern society. In the past decade, several experiments have demonstrated that deoxyribonucleic acid (abbreviated DNA) – the molecule that carries the genetic information of living organisms – is a potentially viable storage medium. DNA-based storage would have many attractive features: unprecedented data density, a recording format that will not become obsolete, archival durability over thousands of years, and easy data replication. On the other hand, DNA storage requires fundamentally new methods for encoding data into DNA sequences to make the storage process efficient and reliable. The aim of the project is two-fold: (1) to understand the mathematical limits on the efficiency, reliability, and information density of DNA-based storage, and (2) to develop novel data-encoding and decoding algorithms to help achieve those limits. The project will provide a stimulating research opportunity for undergraduate and graduate students, encouraging teamwork across university boundaries and collaboration across disciplines.The project focuses on coding methods that address critical problems in the key stages of the DNA storage process: efficient synthesis of DNA sequences, stable retention of stored sequences, and reliable data retrieval and reconstruction. Specific objectives are: (1) establish fundamental information-theoretic limits on the storage capacity of DNA using mathematical abstractions of the DNA recording process, (2) develop source coding techniques to minimize the time needed to encode data into synthesized arrays of DNA sequences, (3) design coding algorithms to efficiently enforce constraints on the allowed nucleotide patterns in synthesized DNA sequences to ensure their long-term retention, and (4) develop reconstruction algorithms and error-correcting codes that can recover a set of DNA sequences from an unordered collection of copies that may be corrupted by insertions, deletions, and substitutions of nucleotides. The project develops tools to address classical problems in coding theory and information theory that underlie many aspects of the research. These include the construction of optimal codes for finite-state communication channels with symbol costs, the design of optimal short-length codes that correct multiple insertion and deletion errors, the development of efficient coding techniques that asymptotically approach the capacity of a communication channel with deletion errors, and the analysis of algorithms and codes that enable reconstruction of a sequence from multiple noisy observations, either exactly or within a small list of candidate sequences.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
需要新的信息存储技术来适应现代社会收集和产生的日益泛滥的数据。 在过去的十年中,一些实验已经证明脱氧核糖核酸(简称DNA)-携带生物体遗传信息的分子-是一种潜在可行的存储介质。基于DNA的存储将具有许多吸引人的特点:前所未有的数据密度,不会过时的记录格式,数千年的存档耐久性,以及易于复制的数据。 另一方面,DNA存储需要从根本上将数据编码到DNA序列中的新方法,以使存储过程高效可靠。 该项目的目的有两个:(1)了解基于DNA的存储的效率,可靠性和信息密度的数学限制,以及(2)开发新的数据编码和解码算法,以帮助实现这些限制。 该项目将为本科生和研究生提供一个令人振奋的研究机会,鼓励跨大学界限的团队合作和跨学科的合作。该项目侧重于解决DNA存储过程关键阶段关键问题的编码方法:DNA序列的有效合成,存储序列的稳定保留,以及可靠的数据检索和重建。具体目标是:(1)使用DNA记录过程的数学抽象建立DNA存储容量的基本信息理论限制,(2)开发源编码技术以最小化将数据编码到DNA序列的合成阵列中所需的时间,(3)设计编码算法以有效地对合成DNA序列中允许的核苷酸模式实施约束以确保它们的长期保留,以及(4)开发重建算法和纠错码,其可以从可能被核苷酸的插入、缺失和取代破坏的无序拷贝集合中恢复一组DNA序列。 该项目开发工具,以解决编码理论和信息理论的基础研究的许多方面的经典问题。这些包括符号成本的有限状态通信信道的最佳码的构造,纠正多个插入和删除错误的最佳短长度码的设计,渐进地接近具有删除错误的通信信道的容量的有效编码技术的发展,以及使从多个噪声观测重建序列的算法和代码的分析,完全或在一小部分候选序列中。该奖项反映了NSF的法定使命,并且通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(2)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Polar Codes with Local-Global Decoding
The Noisy Drawing Channel: Reliable Data Storage in DNA Sequences
嘈杂的绘图通道:DNA 序列中的可靠数据存储
  • DOI:
    10.1109/tit.2022.3231752
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    2.5
  • 作者:
    Lenz, Andreas;Siegel, Paul H.;Wachter-Zeh, Antonia;Yaakobi, Eitan
  • 通讯作者:
    Yaakobi, Eitan
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

paul siegel其他文献

paul siegel的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('paul siegel', 18)}}的其他基金

CIF: Medium: Collaborative Research: New Frontiers in Polar Coding: 5G and Beyond
CIF:媒介:协作研究:Polar 编码的新前沿:5G 及以上
  • 批准号:
    1764104
  • 财政年份:
    2018
  • 资助金额:
    $ 120万
  • 项目类别:
    Continuing Grant
CCF-BSF: CIF: Small: Coding Techniques for Emerging Storage Technologies.
CCF-BSF:CIF:小型:新兴存储技术的编码技术。
  • 批准号:
    1619053
  • 财政年份:
    2016
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
Non Volatile Memory Workshop 2015 to be held on March 1-3,2015 at LaJolla, California
2015 年非易失性存储器研讨会将于 2015 年 3 月 1 日至 3 日在加利福尼亚州拉霍亚举行
  • 批准号:
    1538734
  • 财政年份:
    2015
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
Workshop: Non-volatile Memories Workshop 2014. To Be Held March 9-11, 2014, on the UCSD campus in La Jolla, California.
研讨会:2014 年非易失性存储器研讨会。将于 2014 年 3 月 9 日至 11 日在加利福尼亚州拉霍亚的加州大学圣地亚哥分校校园举行。
  • 批准号:
    1427680
  • 财政年份:
    2014
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
Workshop: Non-volatile Memories Workshop 2012; Held on March 4-6, 2012 at Univ. California, San Diego in La Jolla, CA.
研讨会:非易失性存储器研讨会 2012;
  • 批准号:
    1230080
  • 财政年份:
    2012
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
CIF: Small: Coding for Non-Volatile Memories
CIF:小:非易失性存储器编码
  • 批准号:
    1116739
  • 财政年份:
    2011
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
Non-volatile Memories Workshop 2011. Workshop to be held on the campus of University of California, San Diego March 6-8, 2011 in La Jolla, CA.
2011 年非易失性存储器研讨会。研讨会将于 2011 年 3 月 6 日至 8 日在加利福尼亚州拉霍亚的加州大学圣地亚哥分校校园举行。
  • 批准号:
    1111679
  • 财政年份:
    2011
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
Code Representation and Performance of Graph-Based Decoding
基于图的解码的代码表示和性能
  • 批准号:
    0829865
  • 财政年份:
    2008
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
TF-Capacity-Approaching Coding and Detection for Page-Oriented Digital Recording Channels.
用于面向页面的数字记录通道的TF容量逼近编码和检测。
  • 批准号:
    0514859
  • 财政年份:
    2005
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant
ITR: Information-Theoretic Limits in Data Storage Systems
ITR:数据存储系统中的信息理论限制
  • 批准号:
    0219582
  • 财政年份:
    2002
  • 资助金额:
    $ 120万
  • 项目类别:
    Standard Grant

相似海外基金

Collaborative Research: CIF: Medium: QODED: Quantum codes Optimized for the Dynamics between Encoded Computation and Decoding using Classical Coding Techniques
协作研究:CIF:中:QODED:针对使用经典编码技术的编码计算和解码之间的动态进行优化的量子代码
  • 批准号:
    2106213
  • 财政年份:
    2021
  • 资助金额:
    $ 120万
  • 项目类别:
    Continuing Grant
Collaborative Research: CIF: Medium: QODED: Quantum codes Optimized for the Dynamics between Encoded Computation and Decoding using Classical Coding Techniques
协作研究:CIF:中:QODED:针对使用经典编码技术的编码计算和解码之间的动态进行优化的量子代码
  • 批准号:
    2106189
  • 财政年份:
    2021
  • 资助金额:
    $ 120万
  • 项目类别:
    Continuing Grant
CIF: Medium: Collaborative Research: New Frontiers in Polar Coding: 5G and Beyond
CIF:媒介:协作研究:Polar 编码的新前沿:5G 及以上
  • 批准号:
    1763348
  • 财政年份:
    2018
  • 资助金额:
    $ 120万
  • 项目类别:
    Continuing Grant
CIF: Medium: Collaborative Research: New Frontiers in Polar Coding: 5G and Beyond
CIF:媒介:协作研究:Polar 编码的新前沿:5G 及以上
  • 批准号:
    1764104
  • 财政年份:
    2018
  • 资助金额:
    $ 120万
  • 项目类别:
    Continuing Grant
CIF: Medium: Collaborative Research: Frontiers in coding for cloud storage systems
CIF:媒介:协作研究:云存储系统编码前沿
  • 批准号:
    1748585
  • 财政年份:
    2017
  • 资助金额:
    $ 120万
  • 项目类别:
    Continuing Grant
CIF: Medium: Collaborative Research: Frontiers in coding for cloud storage systems
CIF:媒介:协作研究:云存储系统编码前沿
  • 批准号:
    1563622
  • 财政年份:
    2016
  • 资助金额:
    $ 120万
  • 项目类别:
    Continuing Grant
CIF: Medium: Collaborative Research: Frontiers in coding for cloud storage systems
CIF:媒介:协作研究:云存储系统编码前沿
  • 批准号:
    1563742
  • 财政年份:
    2016
  • 资助金额:
    $ 120万
  • 项目类别:
    Continuing Grant
CIF: Medium: Collaborative Research: Frontiers in coding for cloud storage systems
CIF:媒介:协作研究:云存储系统编码前沿
  • 批准号:
    1564167
  • 财政年份:
    2016
  • 资助金额:
    $ 120万
  • 项目类别:
    Continuing Grant
CIF: Medium: Polar Coding for Data Storage: Theory and Applications
CIF:中:数据存储的极性编码:理论与应用
  • 批准号:
    1405119
  • 财政年份:
    2014
  • 资助金额:
    $ 120万
  • 项目类别:
    Continuing Grant
CIF: Medium: Collaborative Research: Estimating simultaneously structured models: from phase retrieval to network coding
CIF:媒介:协作研究:估计同时结构化模型:从相位检索到网络编码
  • 批准号:
    1409204
  • 财政年份:
    2014
  • 资助金额:
    $ 120万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了