Deep learning for population genetics

群体遗传学的深度学习

基本信息

  • 批准号:
    10574510
  • 负责人:
  • 金额:
    $ 42.04万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2020
  • 资助国家:
    美国
  • 起止时间:
    2020-04-21 至 2025-02-28
  • 项目状态:
    未结题

项目摘要

Project Summary The revolution in genome sequencing technologies over the past 15 years has created an explosion of population genomic data but has left in its wake a gap in our ability to make sense of data at this scale. In particular, whereas population genetics as a field has been traditionally data-limited, the massive volume of current sequencing means that previously unanswerable questions may now be within reach. To capitalize on this flood of information we need new methods and modes of analysis. In the past 5 years the world of machine learning has been revolutionized by the rise of deep neural networks. These so-called deep learning methods offer incredible flexibility as well as astounding improvements in performance for a wide array of machine learning tasks, including computer vision, speech recognition, and natural language processing. This proposal aims to harness the great potential of deep learning for population genetic inference. In recent years our group has made great strides in using supervised machine learning for population genomic analysis (reviewed in Schrider and Kern 2018). However, this work has focused primarily on using more traditional machine learning methods such as random forests. As we argue in this proposal, DNA sequence data are particularly well suited for modern deep learning techniques, and we demonstrate that the application of these methods can rapidly lead to state-of-the-art performance in very difficult population genetic tasks such as estimating rates of recombination. The power of these methods for handling genetic data stems in part from their ability to automatically learn to extract as much useful information as possible from an alignment of DNA sequences in order to solve the task at hand, rather than relying on one or more predefined summary statistics which are generally problem-specific and may omit information present in the raw data. In this proposal we lay out a systematic approach for both empowering the field with these tools and understanding their shortcomings. In particular, we propose to design deep neural networks for solving population genetic problems, and incorporate successful networks into user-friendly software tools that will be shared with the community. We will also investigate a variety of methods for estimating the uncertainty of predictions produced by deep learning methods; this area is understudied in machine learning but of great importance to biological researchers who require an accurate measure of the degree of uncertainty surrounding an estimate. Finally, we will explore the impact of training data misspecification—wherein the data used to train a machine learning method differ systematically from the data to which it will be applied in practice. We will devise techniques to mitigate the impact of such misspecification in order to ensure that our tools will be robust to the complications inherent in analyzing real genomic data sets. Together, these advances have the potential to transform the methodological landscape of population genetic inference.
项目总结

项目成果

期刊论文数量(11)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
A quantitative genetic model of background selection in humans.
人类背景选择的定量遗传模型。
  • DOI:
    10.1371/journal.pgen.1011144
  • 发表时间:
    2024
  • 期刊:
  • 影响因子:
    4.5
  • 作者:
    Buffalo,Vince;Kern,AndrewD
  • 通讯作者:
    Kern,AndrewD
Shared evolutionary processes shape landscapes of genomic variation in the great apes.
共享的进化过程塑造了类人猿的基因组变异景观。
Evaluating evidence for co-geography in the Anopheles-Plasmodium host-parasite system.
评估按蚊-疟原虫宿主-寄生虫系统中协同地理学的证据。
Dispersal inference from population genetic variation using a convolutional neural network.
  • DOI:
    10.1093/genetics/iyad068
  • 发表时间:
    2023-05-26
  • 期刊:
  • 影响因子:
    3.3
  • 作者:
    Smith, Chris C. R.;Tittes, Silas;Ralph, Peter L.;Kern, Andrew D.
  • 通讯作者:
    Kern, Andrew D.
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

ANDREW D KERN其他文献

ANDREW D KERN的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('ANDREW D KERN', 18)}}的其他基金

Computational Population Genetics
计算群体遗传学
  • 批准号:
    10552275
  • 财政年份:
    2023
  • 资助金额:
    $ 42.04万
  • 项目类别:
Deep learning for population genetics
群体遗传学的深度学习
  • 批准号:
    9976348
  • 财政年份:
    2020
  • 资助金额:
    $ 42.04万
  • 项目类别:
Deep learning for population genetics
群体遗传学的深度学习
  • 批准号:
    10349557
  • 财政年份:
    2020
  • 资助金额:
    $ 42.04万
  • 项目类别:
Population genomics of adaptation
适应的群体基因组学
  • 批准号:
    9383198
  • 财政年份:
    2017
  • 资助金额:
    $ 42.04万
  • 项目类别:
POPULATION GENOMICS OF ADAPTATION
适应的群体基因组学
  • 批准号:
    9753261
  • 财政年份:
    2017
  • 资助金额:
    $ 42.04万
  • 项目类别:
Human Population Genomics
人类基因组学
  • 批准号:
    7053104
  • 财政年份:
    2005
  • 资助金额:
    $ 42.04万
  • 项目类别:
Human Population Genomics
人类基因组学
  • 批准号:
    7283831
  • 财政年份:
    2005
  • 资助金额:
    $ 42.04万
  • 项目类别:
Human Population Genomics
人类基因组学
  • 批准号:
    7146707
  • 财政年份:
    2005
  • 资助金额:
    $ 42.04万
  • 项目类别:

相似国自然基金

层出镰刀菌氮代谢调控因子AreA 介导伏马菌素 FB1 生物合成的作用机理
  • 批准号:
    2021JJ40433
  • 批准年份:
    2021
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
寄主诱导梢腐病菌AreA和CYP51基因沉默增强甘蔗抗病性机制解析
  • 批准号:
    32001603
  • 批准年份:
    2020
  • 资助金额:
    24.0 万元
  • 项目类别:
    青年科学基金项目
AREA国际经济模型的移植.改进和应用
  • 批准号:
    18870435
  • 批准年份:
    1988
  • 资助金额:
    2.0 万元
  • 项目类别:
    面上项目

相似海外基金

Tribal Intertidal Digital Ecological Surveys Project: Using Large-Area Imaging to Assess Intertidal Biological Response to Changing Oceanographic Conditions in Partnership with Indigenous Nations
部落潮间带数字生态调查项目:与土著民族合作,利用大面积成像评估潮间带生物对不断变化的海洋条件的反应
  • 批准号:
    532685-2019
  • 财政年份:
    2022
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Postgraduate Scholarships - Doctoral
Tribal Intertidal Digital Ecological Surveys Project: Using Large-Area Imaging to Assess Intertidal Biological Response to Changing Oceanographic Conditions in Partnership with Indigenous Nations
部落潮间带数字生态调查项目:与土著民族合作,利用大面积成像评估潮间带生物对不断变化的海洋条件的反应
  • 批准号:
    532685-2019
  • 财政年份:
    2020
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Postgraduate Scholarships - Doctoral
biological interactions among forest-dwelling fungus gnats and their natural enemies in shiitake mashroom production area
香菇产区森林真菌蚊与其天敌之间的生物相互作用
  • 批准号:
    19K06152
  • 财政年份:
    2019
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Tribal Intertidal Digital Ecological Surveys Project: Using Large-Area Imaging to Assess Intertidal Biological Response to Changing Oceanographic Conditions in Partnership with Indigenous Nations
部落潮间带数字生态调查项目:与土著民族合作,利用大面积成像评估潮间带生物对不断变化的海洋条件的反应
  • 批准号:
    532685-2019
  • 财政年份:
    2019
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Postgraduate Scholarships - Doctoral
To what extent does governance play a role in how effectively a marine protected area in the Irish Sea reaches its biological and socioeconomic goals?
治理在多大程度上对爱尔兰海海洋保护区如何有效实现其生物和社会经济目标发挥作用?
  • 批准号:
    2287487
  • 财政年份:
    2019
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Studentship
War and Biological Ageing in Vietnam: A Planning Grant to Foster Collaboration on a Novel Area of Global Research in Health and Ageing
越南的战争与生物衰老:一项规划拨款,以促进全球健康与老龄化研究新领域的合作
  • 批准号:
    404425
  • 财政年份:
    2019
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Miscellaneous Programs
Impact assessment of Noctiluca scintillans red tide on nutrient dynamics, biological processes in lower trophic levels and material cycle in the neritic area of Sagami Bay
夜光藻赤潮对相模湾浅海区营养动态、低营养层生物过程和物质循环的影响评估
  • 批准号:
    18K05794
  • 财政年份:
    2018
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Large-area graphene based chemical and biological sensors
基于大面积石墨烯的化学和生物传感器
  • 批准号:
    355863-2011
  • 财政年份:
    2015
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Discovery Grants Program - Individual
Large-area graphene based chemical and biological sensors
基于大面积石墨烯的化学和生物传感器
  • 批准号:
    355863-2011
  • 财政年份:
    2014
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Discovery Grants Program - Individual
Theoretical simulation and experimental study on biological weathering mechanism of the rock around coastal area in Yaeyama Islands
八重山群岛沿岸岩石生物风化机制的理论模拟与实验研究
  • 批准号:
    26790079
  • 财政年份:
    2014
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了