Representing structural haplotypes and complex genetic variation in pan-genome graphs

表示泛基因组图中的结构单倍型和复杂的遗传变异

基本信息

  • 批准号:
    9906038
  • 负责人:
  • 金额:
    $ 38.9万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2020
  • 资助国家:
    美国
  • 起止时间:
    2020-02-10 至 2023-01-31
  • 项目状态:
    已结题

项目摘要

Title: Representing structural haplotypes and complex genetic variation in pan-genome graphs. PROJECT SUMMARY A pan-genome graph (PGG) reference must faithfully reflect structural haplotypes that differ in copy number, order, and orientation, which are currently poorly represented in a linear reference sequence. This effort focuses on the most copy variable and complex regions, including segmental duplications (SDs), inversions, short tandem repeats/variable number tandem repeats (copy-number-variable repeats, CNVRs) and combinations thereof that are frequently excluded or collapsed in reference genomes. The overarching goal of this project is to develop the tool infrastructure enabling the construction of whole-chromosome reference haplotypes that include all of these difficult classes of sequence. There are four specific aims. First, we will develop methods to construct PGGs from haplotype-phased de novo assemblies, ensuring the graph reflects both copy number variation and repeat structure, including CNVRs and SD. Second, we will develop software that will expand SD assembly methods to facilitate the curation of SD loci in PGGs. We will use SD assembly to detect variants specific to individual copies of a duplication, called paralog-specific variants (PSVs), and provide software to reconstruct local haplotype paths through the PGG that describe the different copies. Third, we will design novel methods to exploit single-cell template strand DNA sequencing data (Strand-seq) mapped to PGGs in order to thread chromosome-length "structural haplotypes" through the graph. Therefore, our software tool will allow the physical resolution of haplotypes comprising the full spectrum of structural variation, including inversions and inverted duplications. By virtue of the PSVs, the structural haplotypes will also embed sequence-resolved SDs. Fourth, we will develop a scalable open-source software framework to systematically assess how the inclusion of single-nucleotide variants, short indels, and structural variant classes in the PGG affects variant detection with short-read data. This will enable the optimization of the complexity encoded in the PGG for short-read variant detection. It will additionally provide a comprehensive view on polymorphic and fixed k-mers in human populations. We will develop tools to detect allele-specific k-mers and demonstrate how that enables the rapid genotyping of variants in the PGG based on k-mer composition of a short-read dataset. Once the framework for enhanced genome representation is established, we will focus on improving efficiency, scalability, and computational ease to cater to the needs of a broad range of users in genetics and genome science. This proposal will ensure that the most complex regions of the human genome are encoded into the PGG and that underlying genetic variation is ultimately assessed for association with disease. ​
代表泛基因组中的结构单倍型和复杂遗传变异 图表。 项目摘要 泛基因组图(PGG)参考必须忠实地反映拷贝数不同的结构单倍型, 顺序和方向,目前在线性参考序列中表现不佳。这一努力 集中在最拷贝可变和复杂的区域,包括片段重复(SD),倒置, 短串联重复序列/可变数目串联重复序列(拷贝数可变重复序列,CNVR)和 其组合在参考基因组中经常被排除或折叠。的首要目标 该项目旨在开发工具基础设施,以构建全染色体参考 包括所有这些复杂的序列类型的单倍型。有四个具体目标。一是 开发从单体型定相的从头组装构建PGG的方法,确保图反映 拷贝数变异和重复结构,包括CNVRs和SD。第二,我们将开发软件 这将扩展SD组装方法,以促进PGG中SD基因座的管理。我们将使用SD组装 检测重复的个体拷贝特异性的变体,称为旁系同源特异性变体(PSV),和 提供软件以通过描述不同拷贝的PGG重建局部单体型路径。第三、 我们将设计新的方法来利用单细胞模板链DNA测序数据(Strand-seq)映射 以使染色体长度的“结构单倍型”穿过图。所以我们的 软件工具将允许包括结构变异的全谱的单倍型的物理解析, 包括倒置和倒置复制。通过PSV,结构单倍型也将嵌入 序列分辨SD。第四,我们将开发一个可扩展的开源软件框架, 评估如何在PGG中包含单核苷酸变体,短插入缺失和结构变体类别 影响短读数据的变体检测。这将使得能够优化编码在 用于短读变体检测的PGG。此外,它还将提供有关多态和 在人群中固定k聚体。我们将开发检测等位基因特异性k-mer的工具,并演示如何 这使得能够基于短读段数据集的k聚体组成对PGG中的变体进行快速基因分型。 一旦建立了增强基因组表示的框架,我们将专注于提高效率, 可扩展性和计算简便性,以满足遗传学和基因组领域广泛用户的需求 科学这一提议将确保人类基因组中最复杂的区域被编码到 PGG并且最终评估潜在的遗传变异与疾病的相关性。 ​

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Mark Chaisson其他文献

Mark Chaisson的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Mark Chaisson', 18)}}的其他基金

Representing structural haplotypes and complex genetic variation in pan-genome graphs
表示泛基因组图中的结构单倍型和复杂的遗传变异
  • 批准号:
    10832934
  • 财政年份:
    2023
  • 资助金额:
    $ 38.9万
  • 项目类别:
Detection and genotyping complex human genetic variation using single-molecule sequencing
使用单分子测序对复杂的人类遗传变异进行检测和基因分型
  • 批准号:
    10186109
  • 财政年份:
    2021
  • 资助金额:
    $ 38.9万
  • 项目类别:
Detection and genotyping complex human genetic variation using single-molecule sequencing
使用单分子测序对复杂的人类遗传变异进行检测和基因分型
  • 批准号:
    10655573
  • 财政年份:
    2021
  • 资助金额:
    $ 38.9万
  • 项目类别:
Detection and genotyping complex human genetic variation using single-molecule sequencing
使用单分子测序对复杂的人类遗传变异进行检测和基因分型
  • 批准号:
    10447193
  • 财政年份:
    2021
  • 资助金额:
    $ 38.9万
  • 项目类别:
Representing structural haplotypes and complex genetic variation in pan-genome graphs
表示泛基因组图中的结构单倍型和复杂的遗传变异
  • 批准号:
    10337078
  • 财政年份:
    2020
  • 资助金额:
    $ 38.9万
  • 项目类别:

相似海外基金

How Does Particle Material Properties Insoluble and Partially Soluble Affect Sensory Perception Of Fat based Products
不溶性和部分可溶的颗粒材料特性如何影响脂肪基产品的感官知觉
  • 批准号:
    BB/Z514391/1
  • 财政年份:
    2024
  • 资助金额:
    $ 38.9万
  • 项目类别:
    Training Grant
BRC-BIO: Establishing Astrangia poculata as a study system to understand how multi-partner symbiotic interactions affect pathogen response in cnidarians
BRC-BIO:建立 Astrangia poculata 作为研究系统,以了解多伙伴共生相互作用如何影响刺胞动物的病原体反应
  • 批准号:
    2312555
  • 财政年份:
    2024
  • 资助金额:
    $ 38.9万
  • 项目类别:
    Standard Grant
RII Track-4:NSF: From the Ground Up to the Air Above Coastal Dunes: How Groundwater and Evaporation Affect the Mechanism of Wind Erosion
RII Track-4:NSF:从地面到沿海沙丘上方的空气:地下水和蒸发如何影响风蚀机制
  • 批准号:
    2327346
  • 财政年份:
    2024
  • 资助金额:
    $ 38.9万
  • 项目类别:
    Standard Grant
Graduating in Austerity: Do Welfare Cuts Affect the Career Path of University Students?
紧缩毕业:福利削减会影响大学生的职业道路吗?
  • 批准号:
    ES/Z502595/1
  • 财政年份:
    2024
  • 资助金额:
    $ 38.9万
  • 项目类别:
    Fellowship
感性個人差指標 Affect-X の構築とビスポークAIサービスの基盤確立
建立个人敏感度指数 Affect-X 并为定制人工智能服务奠定基础
  • 批准号:
    23K24936
  • 财政年份:
    2024
  • 资助金额:
    $ 38.9万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Insecure lives and the policy disconnect: How multiple insecurities affect Levelling Up and what joined-up policy can do to help
不安全的生活和政策脱节:多种不安全因素如何影响升级以及联合政策可以提供哪些帮助
  • 批准号:
    ES/Z000149/1
  • 财政年份:
    2024
  • 资助金额:
    $ 38.9万
  • 项目类别:
    Research Grant
How does metal binding affect the function of proteins targeted by a devastating pathogen of cereal crops?
金属结合如何影响谷类作物毁灭性病原体靶向的蛋白质的功能?
  • 批准号:
    2901648
  • 财政年份:
    2024
  • 资助金额:
    $ 38.9万
  • 项目类别:
    Studentship
Investigating how double-negative T cells affect anti-leukemic and GvHD-inducing activities of conventional T cells
研究双阴性 T 细胞如何影响传统 T 细胞的抗白血病和 GvHD 诱导活性
  • 批准号:
    488039
  • 财政年份:
    2023
  • 资助金额:
    $ 38.9万
  • 项目类别:
    Operating Grants
New Tendencies of French Film Theory: Representation, Body, Affect
法国电影理论新动向:再现、身体、情感
  • 批准号:
    23K00129
  • 财政年份:
    2023
  • 资助金额:
    $ 38.9万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
The Protruding Void: Mystical Affect in Samuel Beckett's Prose
突出的虚空:塞缪尔·贝克特散文中的神秘影响
  • 批准号:
    2883985
  • 财政年份:
    2023
  • 资助金额:
    $ 38.9万
  • 项目类别:
    Studentship
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了