Software for exploring all forms of genetic variation in any species

用于探索任何物种中所有形式的遗传变异的软件

基本信息

  • 批准号:
    9749979
  • 负责人:
  • 金额:
    $ 45.75万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2017
  • 资助国家:
    美国
  • 起止时间:
    2017-08-01 至 2021-07-31
  • 项目状态:
    已结题

项目摘要

Modern DNA sequencing technologies have revolutionized the design of experiments investigating the biology of the genome and the genetic basis of traits. Arguably the most powerful application of these technologies has been the creation of exquisitely detailed catalogs describing the landscape of genetic variation in multiple species. However, discovery of genetic variation is merely the beginning. Exploration and analysis of the resulting catalogs is required to catalyze new insights into the relationship between genotype and phenotype. This proposal is motivated by two fundamental limitations inhibiting discovery from genetic variation datasets. First, existing software for mining variation to understand disease and other traits does not scale to large datasets involving thousands of samples. Second, most existing tools are focused on human studies; consequently, this inhibits the application of modern DNA sequencing to genetic studies of model organisms, livestock genetics, and newly sequenced species. We propose to solve these challenges by building upon our GEMINI framework. Since 2012, we have maintained GEMINI as a powerful software framework for exploring genome variation. GEMINI's strength is that it integrates genetic variation with a diverse set of genome annotations into a database to facilitate variant prioritization. It allows researchers to conduct complex analyses with simple queries based on sample genotypes, phenotypes, inheritance patterns, and genome annotations. GEMINI has quickly become a very popular tool for rare human disease research leading to discoveries by multiple labs, including our own. Despite its power and popularity, GEMINI has three important limitations. It was not designed for studies involving genetic variation from more than a few hundred samples. Furthermore, its focus is the analysis of single-nucleotide (SNP) and insertion-deletion (INDEL); it is blind to structural and copy number variation. Finally, GEMINI can only analyze genetic variation datasets for the human genome; no other species or genome builds are supported. Therefore, this proposal seeks to provide geneticists studying any species with a powerful, flexible and simple to use software system that is fast and scalable enough to support genetic research for many years to come. We will do this but achieving the following Specific Aims: (1) Develop a scalable, high performance genotype and haplotype query engine to empower large scale genome studies. (2) Devise new methods for genotyping, integrating and prioritizing structural variation. (3) Enable scalable, flexible genome analysis in any species and genome build. In summary, by completing these aims, the proposed research will provide geneticists studying any species with a powerful, flexible and simple to use software system that is fast and scalable enough to support genetic research for many years to come.
现代DNA测序技术彻底改变了研究生物多样性的实验设计 基因组生物学和性状的遗传基础。可以说,这些功能最强大的应用程序 技术已经创建了描述基因景观的精致详细的目录 多种物种的变异。然而,基因变异的发现仅仅是个开始。探索和 需要对产生的目录进行分析,以催化对基因型之间关系的新见解 和表型。这一建议的动机是两个根本的限制,阻碍了对基因的发现 变异数据集。首先,现有的用于挖掘变异以了解疾病和其他特征的软件不能 扩展到涉及数千个样本的大型数据集。其次,现有的大多数工具都是以人为中心 因此,这阻碍了现代DNA测序在模型遗传学研究中的应用 生物体、家畜遗传学和新测序物种。 我们建议通过建立我们的双子座框架来解决这些挑战。自2012年以来,我们已经 保持双子座作为探索基因组变异的强大软件框架。双子座的力量是 它将遗传变异和一组不同的基因组注释整合到一个数据库中,以促进变异 确定优先顺序。它允许研究人员使用基于样本的简单查询进行复杂的分析 基因类型、表型、遗传模式和基因组注释。双子座很快就变成了一个非常 用于罕见人类疾病研究的流行工具,导致多个实验室的发现,包括我们自己的实验室。 尽管双子座的力量和受欢迎程度很高,但它有三个重要的局限性。它不是为 涉及数百个样本的遗传变异的研究。此外,它的重点是 单核苷酸(SNP)和插入-缺失(Indel)分析;它对结构和拷贝数是盲目的 变种。最后,双子座只能分析人类基因组的遗传变异数据集;不能分析其他物种 或者支持基因组构建。因此,这项提议试图为研究任何物种的遗传学家提供 具有强大、灵活且简单易用的软件系统,该软件系统具有足够快的速度和足够的可扩展性,可支持 在未来的许多年里进行研究。我们将这样做,但要实现以下具体目标: (1)开发可扩展、高性能的基因和单倍型查询引擎,为用户赋能 大规模的基因组研究。 (2)设计新的基因分型、整合和区分结构变异的方法。 (3)在任何物种和基因组构建中实现可扩展、灵活的基因组分析。 总之,通过完成这些目标,拟议的研究将为遗传学家提供研究任何 具有强大、灵活且简单易用的软件系统的物种,该软件系统具有足够快的速度和足够的可扩展性来支持 在未来的许多年里进行基因研究。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Aaron R Quinlan其他文献

Extending reference assembly models
  • DOI:
    10.1186/s13059-015-0587-3
  • 发表时间:
    2015-01-24
  • 期刊:
  • 影响因子:
    9.400
  • 作者:
    Deanna M Church;Valerie A Schneider;Karyn Meltz Steinberg;Michael C Schatz;Aaron R Quinlan;Chen-Shan Chin;Paul A Kitts;Bronwen Aken;Gabor T Marth;Michael M Hoffman;Javier Herrero;M Lisandra Zepeda Mendoza;Richard Durbin;Paul Flicek
  • 通讯作者:
    Paul Flicek
Erratum: A reference bacterial genome dataset generated on the MinIONTM portable single-molecule nanopore sequencer
  • DOI:
    10.1186/s13742-015-0043-z
  • 发表时间:
    2015-02-13
  • 期刊:
  • 影响因子:
    3.900
  • 作者:
    Joshua Quick;Aaron R Quinlan;Nicholas J Loman
  • 通讯作者:
    Nicholas J Loman

Aaron R Quinlan的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Aaron R Quinlan', 18)}}的其他基金

New algorithms and tools for large-scale genomic analyses
用于大规模基因组分析的新算法和工具
  • 批准号:
    10357060
  • 财政年份:
    2022
  • 资助金额:
    $ 45.75万
  • 项目类别:
New algorithms and tools for large-scale genomic analyses
用于大规模基因组分析的新算法和工具
  • 批准号:
    10560502
  • 财政年份:
    2022
  • 资助金额:
    $ 45.75万
  • 项目类别:
Scalable detection and interpretation of structural variation in human genomes
人类基因组结构变异的可扩展检测和解释
  • 批准号:
    10576268
  • 财政年份:
    2020
  • 资助金额:
    $ 45.75万
  • 项目类别:
Scalable detection and interpretation of structural variation in human genomes
人类基因组结构变异的可扩展检测和解释
  • 批准号:
    9973582
  • 财政年份:
    2020
  • 资助金额:
    $ 45.75万
  • 项目类别:
Scalable detection and interpretation of structural variation in human genomes
人类基因组结构变异的可扩展检测和解释
  • 批准号:
    10341175
  • 财政年份:
    2020
  • 资助金额:
    $ 45.75万
  • 项目类别:
Scalable detection and interpretation of structural variation in human genomes
人类基因组结构变异的可扩展检测和解释
  • 批准号:
    10153847
  • 财政年份:
    2020
  • 资助金额:
    $ 45.75万
  • 项目类别:
New algorithms and tools for large-scale genomic analyses
用于大规模基因组分析的新算法和工具
  • 批准号:
    8273206
  • 财政年份:
    2012
  • 资助金额:
    $ 45.75万
  • 项目类别:
New algorithms and tools for large-scale genomic analyses
用于大规模基因组分析的新算法和工具
  • 批准号:
    9272425
  • 财政年份:
    2012
  • 资助金额:
    $ 45.75万
  • 项目类别:
New algorithms and tools for large-scale genomic analyses
用于大规模基因组分析的新算法和工具
  • 批准号:
    8661785
  • 财政年份:
    2012
  • 资助金额:
    $ 45.75万
  • 项目类别:
New algorithms and tools for large-scale genomic analyses
用于大规模基因组分析的新算法和工具
  • 批准号:
    8460819
  • 财政年份:
    2012
  • 资助金额:
    $ 45.75万
  • 项目类别:

相似海外基金

Linkage of HIV amino acid variants to protective host alleles at CHD1L and HLA class I loci in an African population
非洲人群中 HIV 氨基酸变异与 CHD1L 和 HLA I 类基因座的保护性宿主等位基因的关联
  • 批准号:
    502556
  • 财政年份:
    2024
  • 资助金额:
    $ 45.75万
  • 项目类别:
Olfactory Epithelium Responses to Human APOE Alleles
嗅觉上皮对人类 APOE 等位基因的反应
  • 批准号:
    10659303
  • 财政年份:
    2023
  • 资助金额:
    $ 45.75万
  • 项目类别:
Deeply analyzing MHC class I-restricted peptide presentation mechanistics across alleles, pathways, and disease coupled with TCR discovery/characterization
深入分析跨等位基因、通路和疾病的 MHC I 类限制性肽呈递机制以及 TCR 发现/表征
  • 批准号:
    10674405
  • 财政年份:
    2023
  • 资助金额:
    $ 45.75万
  • 项目类别:
An off-the-shelf tumor cell vaccine with HLA-matching alleles for the personalized treatment of advanced solid tumors
具有 HLA 匹配等位基因的现成肿瘤细胞疫苗,用于晚期实体瘤的个性化治疗
  • 批准号:
    10758772
  • 财政年份:
    2023
  • 资助金额:
    $ 45.75万
  • 项目类别:
Identifying genetic variants that modify the effect size of ApoE alleles on late-onset Alzheimer's disease risk
识别改变 ApoE 等位基因对迟发性阿尔茨海默病风险影响大小的遗传变异
  • 批准号:
    10676499
  • 财政年份:
    2023
  • 资助金额:
    $ 45.75万
  • 项目类别:
New statistical approaches to mapping the functional impact of HLA alleles in multimodal complex disease datasets
绘制多模式复杂疾病数据集中 HLA 等位基因功能影响的新统计方法
  • 批准号:
    2748611
  • 财政年份:
    2022
  • 资助金额:
    $ 45.75万
  • 项目类别:
    Studentship
Recessive lethal alleles linked to seed abortion and their effect on fruit development in blueberries
与种子败育相关的隐性致死等位基因及其对蓝莓果实发育的影响
  • 批准号:
    22K05630
  • 财政年份:
    2022
  • 资助金额:
    $ 45.75万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Genome and epigenome editing of induced pluripotent stem cells for investigating osteoarthritis risk alleles
诱导多能干细胞的基因组和表观基因组编辑用于研究骨关节炎风险等位基因
  • 批准号:
    10532032
  • 财政年份:
    2022
  • 资助金额:
    $ 45.75万
  • 项目类别:
Investigating the Effect of APOE Alleles on Neuro-Immunity of Human Brain Borders in Normal Aging and Alzheimer's Disease Using Single-Cell Multi-Omics and In Vitro Organoids
使用单细胞多组学和体外类器官研究 APOE 等位基因对正常衰老和阿尔茨海默病中人脑边界神经免疫的影响
  • 批准号:
    10525070
  • 财政年份:
    2022
  • 资助金额:
    $ 45.75万
  • 项目类别:
Leveraging the Evolutionary History to Improve Identification of Trait-Associated Alleles and Risk Stratification Models in Native Hawaiians
利用进化历史来改进夏威夷原住民性状相关等位基因的识别和风险分层模型
  • 批准号:
    10689017
  • 财政年份:
    2022
  • 资助金额:
    $ 45.75万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了