EAGER: Haplotype Phasing Algorithms and Clark Consistency Graphs

EAGER:单倍型定相算法和克拉克一致性图

基本信息

  • 批准号:
    1048831
  • 负责人:
  • 金额:
    $ 20万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2011
  • 资助国家:
    美国
  • 起止时间:
    2011-01-01 至 2012-12-31
  • 项目状态:
    已结题

项目摘要

Currently, most SNP assay data is made available from genome-wide association studies (GWAS), which proceed by identifying a number of individuals carrying a disease or a trait and comparing them to those that do not or are not known to carry the disease/trait. Both sets of individuals are then genotyped for a large number of single-nucleotide polymorphism (SNP) genetic variants that are then tested for association to the disease/trait. GWAS studies using tens of thousands of individuals are becoming commonplace and are increasingly the norm in the association of genetic variants to disease. These studies generally proceed by pooling large amounts of genome-wide data from multiple studies, for a combined total of tens of thousands of individuals in a single meta-analysis study. It can be expected that hundreds of thousands, if not millions, of individuals will soon be studied for association to a single disease or trait. Although SNPs are the most abundant form of variation between two individuals, other forms of variation exist, such as copy-number variation ? large-scale chromosomal deletions, insertions, and duplications. These variations, which have been shown to be influential factors in many diseases, are not probed using the current technology of SNP arrays. An emerging trend in accounting for ?missing heritability? is ?parent-of-origin? effects, where genetic variants confer risk only when inherited from a specific parent. Long-range haplotype phasing is key to identifying the association of the haplotype pattern to the specific parent of origin.The premise of this research is that long tracts are unlikely (to be shared) unless the haplotypes are identical by descent (IBD), in contrast to short shared tracts, which may be identical by state (IBS). A difficult algorithmic challenge is that of tract finding in genotype matrices of a sample of m people genotyped at n SNPs. The premise of our research is that long tracts are unlikely (to be shared) unless the haplotypes are identical by descent (IBD), in contrast to short shared tracts, which may be identical by state (IBS). A difficult algorithmic challenge is that of tract finding in genotype matrices of a sample of m people genotyped at n SNPs. To apply such a long-range phasing algorithm to the U.S. population, it is estimated that 2 million individuals must be genotyped. Algorithmic strategies proposed here show promise that the combinatorial structure of Clark Consistency Graphs can provide the basis for powerful algorithms that will decrease this number substantially. The primary output of this project will be new long-range phasing software, documentation, and source code, all to be immediately and continually available to the scientific community as open-source for research and education.
目前,大多数 SNP 检测数据均来自全基因组关联研究 (GWAS),该研究通过识别许多携带某种疾病或性状的个体并将其与那些不携带或未知携带该疾病/性状的个体进行比较来进行。然后对两组个体进行大量单核苷酸多态性 (SNP) 遗传变异的基因分型,然后测试这些变异与疾病/性状的关联。使用数以万计的个体进行的全基因组关联研究(GWAS)正在变得司空见惯,并且越来越成为遗传变异与疾病关联的常态。这些研究通常通过汇集来自多项研究的大量全基因组数据来进行,在一项荟萃分析研究中总共涉及数万名个体。可以预期,数十万甚至数百万的个体很快将被研究与单一疾病或性状的关联。尽管 SNP 是两个个体之间最丰富的变异形式,但也存在其他形式的变异,例如拷贝数变异?大规模染色体缺失、插入和重复。这些变异已被证明是许多疾病的影响因素,但目前的 SNP 阵列技术尚未对其进行探测。解释“遗传性缺失”的新兴趋势是?原产地父母?效应,其中遗传变异仅在从特定父母遗传时才会带来风险。长距离单倍型定相是识别单倍型模式与特定起源亲本关联的关键。这项研究的前提是,除非单倍型在血统上相同(IBD),否则长链不太可能(共享),而短共享链则可能在州内相同(IBS)。一个困难的算法挑战是在 n 个 SNP 基因分型的 m 个人样本的基因型矩阵中寻找区域。 我们研究的前提是,除非单倍型在血统上相同(IBD),否则长束不太可能(共享),而短共享束则可能在州内相同(IBS)。一个困难的算法挑战是在 n 个 SNP 基因分型的 m 个人样本的基因型矩阵中寻找区域。为了将这种远程定相算法应用于美国人口,估计必须对 200 万人进行基因分型。这里提出的算法策略表明克拉克一致性图的组合结构可以为强大的算法提供基础,从而大大减少这个数字。 该项目的主要成果将是新的远程分阶段软件、文档和源代码,所有这些都将作为开源研究和教育立即持续地提供给科学界。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Sorin Istrail其他文献

Sorin Istrail的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Sorin Istrail', 18)}}的其他基金

III: Small: Genome-Wide Algorithms for Haplotype Reconstruction and Beyond: A Combined Haplotype Assembly and Identical-by-Descent Tracts Approach
III:小:用于单倍型重建及其他的全基因组算法:单倍型组装和相同血统相结合的方法
  • 批准号:
    1321000
  • 财政年份:
    2013
  • 资助金额:
    $ 20万
  • 项目类别:
    Standard Grant
The Genome and the Computational Sciences, A Workshop at Brown University, December 8-12, 2008
基因组和计算科学,布朗大学研讨会,2008 年 12 月 8-12 日
  • 批准号:
    0714609
  • 财政年份:
    2008
  • 资助金额:
    $ 20万
  • 项目类别:
    Standard Grant
The cisGRN Browser and Database: cis-Regulatory Information Behind the Network
cisGRN 浏览器和数据库:网络背后的 cis 监管信息
  • 批准号:
    0645955
  • 财政年份:
    2007
  • 资助金额:
    $ 20万
  • 项目类别:
    Continuing Grant
RUI: Structured Operational Semantics of Concurrency
RUI:并发的结构化操作语义
  • 批准号:
    8801174
  • 财政年份:
    1988
  • 资助金额:
    $ 20万
  • 项目类别:
    Standard Grant

相似国自然基金

Multistage,haplotype and functional tests-based FCAR 基因和IgA肾病相关关系研究
  • 批准号:
    30771013
  • 批准年份:
    2007
  • 资助金额:
    30.0 万元
  • 项目类别:
    面上项目
应用常染色体单倍域(Haplotype Block)研究中国人群的遗传结构
  • 批准号:
    30571060
  • 批准年份:
    2005
  • 资助金额:
    22.0 万元
  • 项目类别:
    面上项目
客家人G6PD基因位点Haplotype Block的研究
  • 批准号:
    30470949
  • 批准年份:
    2004
  • 资助金额:
    18.0 万元
  • 项目类别:
    面上项目

相似海外基金

CAREER: Practical algorithms and high dimensional statistical methods for multimodal haplotype modelling
职业:多模态单倍型建模的实用算法和高维统计方法
  • 批准号:
    2239870
  • 财政年份:
    2023
  • 资助金额:
    $ 20万
  • 项目类别:
    Standard Grant
Development of a haplotype-aware structural variant analysis method for polyploid genomes
开发多倍体基因组的单倍型感知结构变异分析方法
  • 批准号:
    23K19338
  • 财政年份:
    2023
  • 资助金额:
    $ 20万
  • 项目类别:
    Grant-in-Aid for Research Activity Start-up
Robust and cost-effective computational methods for haplotype-resolved genome assemblies
用于单倍型解析基因组组装的稳健且经济有效的计算方法
  • 批准号:
    10572305
  • 财政年份:
    2023
  • 资助金额:
    $ 20万
  • 项目类别:
Uncovering the Genetic Mechanisms of the Chromosome 17q21.31 Tau Haplotype on Neurodegeneration Risk in FTD and PSP
揭示染色体 17q21.31 Tau 单倍型对 FTD 和 PSP 神经变性风险的遗传机制
  • 批准号:
    10789246
  • 财政年份:
    2023
  • 资助金额:
    $ 20万
  • 项目类别:
Haplotype-resolved genome assemblies and chromosomal rearrangements in arboviral vector Aedes albopictus
虫媒病毒载体白纹伊蚊中单倍型解析的基因组组装和染色体重排
  • 批准号:
    10573961
  • 财政年份:
    2022
  • 资助金额:
    $ 20万
  • 项目类别:
Haplotype-aware models of gene and isoform expression with application to genetic studies of disease in diverse populations
基因和亚型表达的单倍型感知模型及其应用于不同人群疾病遗传学研究
  • 批准号:
    10540421
  • 财政年份:
    2021
  • 资助金额:
    $ 20万
  • 项目类别:
Uncovering the genetic mechanisms of the Chromosome 17q21.31 Tau haplotype on neurodegeneration risk in FTD and PSP
揭示染色体 17q21.31 Tau 单倍型对 FTD 和 PSP 神经变性风险的遗传机制
  • 批准号:
    10902613
  • 财政年份:
    2021
  • 资助金额:
    $ 20万
  • 项目类别:
Haplotype-aware models of gene and isoform expression with application to genetic studies of disease in diverse populations
基因和亚型表达的单倍型感知模型及其应用于不同人群疾病遗传学研究
  • 批准号:
    10390207
  • 财政年份:
    2021
  • 资助金额:
    $ 20万
  • 项目类别:
Haplotype-aware models of gene and isoform expression with application to genetic studies of disease in diverse populations
基因和亚型表达的单倍型感知模型及其应用于不同人群疾病遗传学研究
  • 批准号:
    10360462
  • 财政年份:
    2021
  • 资助金额:
    $ 20万
  • 项目类别:
Genetic analysis of species-specific haplotype diversification causing rice reproductive isolation.
导致水稻生殖隔离的物种特异性单倍型多样化的遗传分析。
  • 批准号:
    21H02168
  • 财政年份:
    2021
  • 资助金额:
    $ 20万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了