Representing structural haplotypes and complex genetic variation in pan-genome graphs

表示泛基因组图中的结构单倍型和复杂的遗传变异

基本信息

  • 批准号:
    10337078
  • 负责人:
  • 金额:
    $ 30万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2020
  • 资助国家:
    美国
  • 起止时间:
    2020-02-10 至 2024-01-31
  • 项目状态:
    已结题

项目摘要

Title: Representing structural haplotypes and complex genetic variation in pan-genome graphs. PROJECT SUMMARY A pan-genome graph (PGG) reference must faithfully reflect structural haplotypes that differ in copy number, order, and orientation, which are currently poorly represented in a linear reference sequence. This effort focuses on the most copy variable and complex regions, including segmental duplications (SDs), inversions, short tandem repeats/variable number tandem repeats (copy-number-variable repeats, CNVRs) and combinations thereof that are frequently excluded or collapsed in reference genomes. The overarching goal of this project is to develop the tool infrastructure enabling the construction of whole-chromosome reference haplotypes that include all of these difficult classes of sequence. There are four specific aims. First, we will develop methods to construct PGGs from haplotype-phased de novo assemblies, ensuring the graph reflects both copy number variation and repeat structure, including CNVRs and SD. Second, we will develop software that will expand SD assembly methods to facilitate the curation of SD loci in PGGs. We will use SD assembly to detect variants specific to individual copies of a duplication, called paralog-specific variants (PSVs), and provide software to reconstruct local haplotype paths through the PGG that describe the different copies. Third, we will design novel methods to exploit single-cell template strand DNA sequencing data (Strand-seq) mapped to PGGs in order to thread chromosome-length "structural haplotypes" through the graph. Therefore, our software tool will allow the physical resolution of haplotypes comprising the full spectrum of structural variation, including inversions and inverted duplications. By virtue of the PSVs, the structural haplotypes will also embed sequence-resolved SDs. Fourth, we will develop a scalable open-source software framework to systematically assess how the inclusion of single-nucleotide variants, short indels, and structural variant classes in the PGG affects variant detection with short-read data. This will enable the optimization of the complexity encoded in the PGG for short-read variant detection. It will additionally provide a comprehensive view on polymorphic and fixed k-mers in human populations. We will develop tools to detect allele-specific k-mers and demonstrate how that enables the rapid genotyping of variants in the PGG based on k-mer composition of a short-read dataset. Once the framework for enhanced genome representation is established, we will focus on improving efficiency, scalability, and computational ease to cater to the needs of a broad range of users in genetics and genome science. This proposal will ensure that the most complex regions of the human genome are encoded into the PGG and that underlying genetic variation is ultimately assessed for association with disease. ​
泛基因组中代表结构单倍型和复杂遗传变异 图表。 项目总结 泛基因组图(PGG)参考必须忠实地反映拷贝数不同的结构单倍型, 顺序和取向,它们目前在线性参考序列中表示得很差。这一努力 重点研究复制最可变和最复杂的区域,包括片段复制(SD)、倒置、 短串联重复序列/可变数目串联重复序列(拷贝数可变重复序列,CNVR)和 它们的组合在参考基因组中经常被排除或崩溃。的首要目标是 这个项目是为了开发工具基础设施,使之能够构建全染色体参考 包括所有这些困难的序列类别的单倍型。有四个具体目标。首先,我们将 开发从单倍型阶段从头组装构建PGG的方法,确保图形反映 拷贝数变异和重复结构,包括CNVR和SD。第二,我们将开发软件 这将扩展SD组装方法,以促进PGG中SD基因座的筛选。我们将使用SD组件 要检测复制的各个副本的特定变体,称为平行对数特定变体(PSV),以及 提供软件以通过描述不同拷贝的PGG重建本地单倍型路径。第三, 我们将设计新的方法来利用单细胞模板链DNA测序数据(Strand-Seq) 为了将染色体长度的“结构单倍型”贯穿整个图表。因此,我们的 软件工具将允许包括结构变异的全谱的单倍型的物理分辨率, 包括倒置和倒置复制。凭借PSV,结构单倍型也将嵌入 序列解析的十二烷基硫酸酯。第四,我们将开发一个可扩展的开源软件框架,以系统地 评估如何在PGG中包括单核苷酸变体、短INDELs和结构变体类别 影响读取时间较短的数据的变量检测。这将能够优化编码在 用于短读变异检测的PGG。此外,它还将提供有关多态和 修复了人类群体中的k-MERS。我们将开发工具来检测等位基因特异的k-MERS,并演示如何 这使得能够基于短读数据集的k-mer组成对PGG中的变异进行快速基因分型。 一旦建立了增强基因组表示的框架,我们将专注于提高效率, 可伸缩性和计算简易性,以满足广泛的遗传学和基因组用户的需求 科学。这一提议将确保人类基因组中最复杂的区域被编码到 这种潜在的遗传变异最终被评估为与疾病有关。 ​

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Mark Chaisson其他文献

Mark Chaisson的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Mark Chaisson', 18)}}的其他基金

Representing structural haplotypes and complex genetic variation in pan-genome graphs
表示泛基因组图中的结构单倍型和复杂的遗传变异
  • 批准号:
    10832934
  • 财政年份:
    2023
  • 资助金额:
    $ 30万
  • 项目类别:
Detection and genotyping complex human genetic variation using single-molecule sequencing
使用单分子测序对复杂的人类遗传变异进行检测和基因分型
  • 批准号:
    10186109
  • 财政年份:
    2021
  • 资助金额:
    $ 30万
  • 项目类别:
Detection and genotyping complex human genetic variation using single-molecule sequencing
使用单分子测序对复杂的人类遗传变异进行检测和基因分型
  • 批准号:
    10655573
  • 财政年份:
    2021
  • 资助金额:
    $ 30万
  • 项目类别:
Detection and genotyping complex human genetic variation using single-molecule sequencing
使用单分子测序对复杂的人类遗传变异进行检测和基因分型
  • 批准号:
    10447193
  • 财政年份:
    2021
  • 资助金额:
    $ 30万
  • 项目类别:
Representing structural haplotypes and complex genetic variation in pan-genome graphs
表示泛基因组图中的结构单倍型和复杂的遗传变异
  • 批准号:
    9906038
  • 财政年份:
    2020
  • 资助金额:
    $ 30万
  • 项目类别:

相似海外基金

RII Track-4:NSF: From the Ground Up to the Air Above Coastal Dunes: How Groundwater and Evaporation Affect the Mechanism of Wind Erosion
RII Track-4:NSF:从地面到沿海沙丘上方的空气:地下水和蒸发如何影响风蚀机制
  • 批准号:
    2327346
  • 财政年份:
    2024
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
BRC-BIO: Establishing Astrangia poculata as a study system to understand how multi-partner symbiotic interactions affect pathogen response in cnidarians
BRC-BIO:建立 Astrangia poculata 作为研究系统,以了解多伙伴共生相互作用如何影响刺胞动物的病原体反应
  • 批准号:
    2312555
  • 财政年份:
    2024
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
How Does Particle Material Properties Insoluble and Partially Soluble Affect Sensory Perception Of Fat based Products
不溶性和部分可溶的颗粒材料特性如何影响脂肪基产品的感官知觉
  • 批准号:
    BB/Z514391/1
  • 财政年份:
    2024
  • 资助金额:
    $ 30万
  • 项目类别:
    Training Grant
Graduating in Austerity: Do Welfare Cuts Affect the Career Path of University Students?
紧缩毕业:福利削减会影响大学生的职业道路吗?
  • 批准号:
    ES/Z502595/1
  • 财政年份:
    2024
  • 资助金额:
    $ 30万
  • 项目类别:
    Fellowship
感性個人差指標 Affect-X の構築とビスポークAIサービスの基盤確立
建立个人敏感度指数 Affect-X 并为定制人工智能服务奠定基础
  • 批准号:
    23K24936
  • 财政年份:
    2024
  • 资助金额:
    $ 30万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Insecure lives and the policy disconnect: How multiple insecurities affect Levelling Up and what joined-up policy can do to help
不安全的生活和政策脱节:多种不安全因素如何影响升级以及联合政策可以提供哪些帮助
  • 批准号:
    ES/Z000149/1
  • 财政年份:
    2024
  • 资助金额:
    $ 30万
  • 项目类别:
    Research Grant
How does metal binding affect the function of proteins targeted by a devastating pathogen of cereal crops?
金属结合如何影响谷类作物毁灭性病原体靶向的蛋白质的功能?
  • 批准号:
    2901648
  • 财政年份:
    2024
  • 资助金额:
    $ 30万
  • 项目类别:
    Studentship
ERI: Developing a Trust-supporting Design Framework with Affect for Human-AI Collaboration
ERI:开发一个支持信任的设计框架,影响人类与人工智能的协作
  • 批准号:
    2301846
  • 财政年份:
    2023
  • 资助金额:
    $ 30万
  • 项目类别:
    Standard Grant
Investigating how double-negative T cells affect anti-leukemic and GvHD-inducing activities of conventional T cells
研究双阴性 T 细胞如何影响传统 T 细胞的抗白血病和 GvHD 诱导活性
  • 批准号:
    488039
  • 财政年份:
    2023
  • 资助金额:
    $ 30万
  • 项目类别:
    Operating Grants
How motor impairments due to neurodegenerative diseases affect masticatory movements
神经退行性疾病引起的运动障碍如何影响咀嚼运动
  • 批准号:
    23K16076
  • 财政年份:
    2023
  • 资助金额:
    $ 30万
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了