Principled phylogenomic analysis without gene tree estimation

无需基因树估计的有原则的系统发育分析

基本信息

  • 批准号:
    2308495
  • 负责人:
  • 金额:
    $ 29.53万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2023
  • 资助国家:
    美国
  • 起止时间:
    2023-08-01 至 2026-07-31
  • 项目状态:
    未结题

项目摘要

This project aims to improve the estimation of species trees from genomic datasets. This estimation is challenging because different genomic regions evolve under processes that make their evolutionary histories (i.e., gene trees) discordant. This issue is exacerbated by widespread gene tree estimation errors in modern phylogenomic analyses. To address this challenge, this project's primary objective is to devise innovative mathematical, statistical, and computational techniques to analyze phylogenomic datasets without relying on gene tree estimation. This approach will produce more reliable species tree estimates in the presence of confounding processes. Species trees provide an evolutionary and comparative context in which many biological questions can be addressed. They play a vital role in understanding gene evolution, estimating divergence dates, detecting adaptation, studying trait evolution, etc. The developed methods will enhance the precision of biological discoveries based on species trees, advancing research that utilizes phylogenies. The project includes interdisciplinary research training for graduate students as well as the involvement of undergraduate students recruited through local initiatives. New course materials based on the proposed research will be developed for existing graduate courses and be made available through the PI’s website. The project will leverage connections to NSF-funded interdisciplinary institutes.It is well established that different regions of a genome can evolve under different gene trees, due to processes such as incomplete lineage sorting, gene duplication and loss, and lateral gene transfer, complicating the estimation of species trees. Many methods that first estimate gene trees and then combine this information to estimate a species tree are known to have good theoretical guarantees, under the assumption that the true gene trees are known. That assumption is not satisfied in practice. Accounting theoretically for gene tree estimation error has proved challenging and few results are available. Building on prior work by the PI on the rigorous study of stochastic processes arising in this phylogenomic context, the proposed research will establish much-needed theoretical foundations for the analysis of multi-locus, multi-site datasets and the estimation of species trees without gene trees, including the development of novel estimators, the derivation of impossibility results and matching finite sample bounds, and the investigation of the effect of intra-locus recombination. This project will also enable the development of statistically rigorous, scalable algorithms. This interdisciplinary research will involve a close integration of applied probability, statistical theory, graph algorithms, and evolutionary biology.This proposal is jointly funded by the Mathematical Biology and Statistics Programs at the Division of Mathematical Sciences.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
该项目旨在改进从基因组数据集估计物种树。这种估计是具有挑战性的,因为不同的基因组区域在形成其进化历史的过程下进化(即,基因树)不一致。现代基因组学分析中普遍存在的基因树估计错误加剧了这一问题。为了应对这一挑战,该项目的主要目标是设计创新的数学,统计和计算技术,以分析基因组数据集,而不依赖于基因树估计。这种方法将产生更可靠的物种树的估计存在的混淆过程。物种树提供了一个进化和比较的背景下,许多生物学问题可以解决。它们在理解基因进化、估计分歧日期、检测适应、研究性状进化等方面发挥着至关重要的作用。所开发的方法将提高基于物种树的生物发现的精度,推进利用遗传学的研究。该项目包括为研究生提供跨学科研究培训,以及通过地方举措招募的本科生参与。将根据拟议的研究为现有的研究生课程开发新的课程材料,并通过PI的网站提供。该项目将利用与NSF资助的跨学科研究机构的联系。众所周知,由于不完整的谱系排序、基因复制和丢失以及横向基因转移等过程,基因组的不同区域可以在不同的基因树下进化,从而使物种树的估计复杂化。许多方法,首先估计基因树,然后联合收割机,这些信息来估计一个物种树已知有良好的理论保证,假设真实的基因树是已知的。这一假设在实践中并不成立。从理论上解释基因树估计误差已被证明是具有挑战性的,并且几乎没有结果。基于PI先前对这一基因组背景下产生的随机过程进行严格研究的工作,拟议的研究将为多位点,多位点数据集的分析和没有基因树的物种树的估计建立急需的理论基础,包括开发新的估计量,推导不可能结果和匹配有限样本界限,以及研究基因座内重组的影响。该项目还将使统计上严格的,可扩展的算法的发展。这项跨学科的研究将涉及应用概率、统计理论、图形算法和进化生物学的紧密结合。这项提案由数学科学部的数学生物学和统计学项目共同资助。该奖项反映了NSF的法定使命,并通过使用基金会的智力价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Sebastien Roch其他文献

Sebastien Roch的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Sebastien Roch', 18)}}的其他基金

Scalable Statistical Inference in Small-World Networks
小世界网络中的可扩展统计推断
  • 批准号:
    1916378
  • 财政年份:
    2019
  • 资助金额:
    $ 29.53万
  • 项目类别:
    Standard Grant
Probability Questions in Phylogenetics
系统发育学中的概率问题
  • 批准号:
    1614242
  • 财政年份:
    2016
  • 资助金额:
    $ 29.53万
  • 项目类别:
    Standard Grant
Probabilistic Techniques in Mathematical Phylogenetics
数学系统发育学中的概率技术
  • 批准号:
    1248176
  • 财政年份:
    2012
  • 资助金额:
    $ 29.53万
  • 项目类别:
    Standard Grant
CAREER: Phylogenomics - New Computational Methods through Stochastic Modeling and Analysis
职业:系统基因组学 - 通过随机建模和分析的新计算方法
  • 批准号:
    1149312
  • 财政年份:
    2012
  • 资助金额:
    $ 29.53万
  • 项目类别:
    Continuing Grant
Probabilistic Techniques in Mathematical Phylogenetics
数学系统发育学中的概率技术
  • 批准号:
    1007144
  • 财政年份:
    2010
  • 资助金额:
    $ 29.53万
  • 项目类别:
    Standard Grant

相似海外基金

Phylogenomic mechanisms of trait evolution and resilience to disease
性状进化和疾病恢复力的系统发育机制
  • 批准号:
    10713885
  • 财政年份:
    2023
  • 资助金额:
    $ 29.53万
  • 项目类别:
A comparative analysis of the nutrient provisioning capacities and phylogenomic analysis of the alphaproteobacterial leech endosymbionts Reichenowia spp.
α变形菌水蛭内共生体Reichenowia spp的营养供应能力的比较分析和系统发育分析。
  • 批准号:
    553678-2020
  • 财政年份:
    2020
  • 资助金额:
    $ 29.53万
  • 项目类别:
    Alexander Graham Bell Canada Graduate Scholarships - Master's
Phylogenomic Analysis of Leeches (Hirudinea)
水蛭(水蛭)的系统发育分析
  • 批准号:
    518435-2018
  • 财政年份:
    2020
  • 资助金额:
    $ 29.53万
  • 项目类别:
    Postgraduate Scholarships - Doctoral
Phylogenomic Analysis of Leeches (Hirudinea)
水蛭(水蛭)的系统发育分析
  • 批准号:
    518435-2018
  • 财政年份:
    2019
  • 资助金额:
    $ 29.53万
  • 项目类别:
    Postgraduate Scholarships - Doctoral
Phylogenomic Analysis of Leeches (Hirudinea)
水蛭(水蛭)的系统发育分析
  • 批准号:
    518435-2018
  • 财政年份:
    2018
  • 资助金额:
    $ 29.53万
  • 项目类别:
    Postgraduate Scholarships - Doctoral
Phylogenomic analysis of Nereididae (Annelida)
沙蚕科(Annelida)的系统发育分析
  • 批准号:
    406092100
  • 财政年份:
    2018
  • 资助金额:
    $ 29.53万
  • 项目类别:
    Research Grants
PHYLOGENOMIC, TRANSCRIPTOMIC, VIROMIC, AND IMMUNOPROTEOMIC DETERMINANTS OF NECROTIZING ENTEROCOLITIS
坏死性小肠结肠炎的系统基因组、转录组、病毒组和免疫蛋白质组决定因素
  • 批准号:
    9559708
  • 财政年份:
    2017
  • 资助金额:
    $ 29.53万
  • 项目类别:
PHYLOGENOMIC, TRANSCRIPTOMIC, VIROMIC, AND IMMUNOPROTEOMIC DETERMINANTS OF NECROTIZING ENTEROCOLITIS
坏死性小肠结肠炎的系统基因组、转录组、病毒组和免疫蛋白质组决定因素
  • 批准号:
    10164835
  • 财政年份:
    2017
  • 资助金额:
    $ 29.53万
  • 项目类别:
PHYLOGENOMIC, TRANSCRIPTOMIC, VIROMIC, AND IMMUNOPROTEOMIC DETERMINANTS OF NECROTIZING ENTEROCOLITIS
坏死性小肠结肠炎的系统基因组、转录组、病毒组和免疫蛋白质组决定因素
  • 批准号:
    9369551
  • 财政年份:
    2017
  • 资助金额:
    $ 29.53万
  • 项目类别:
Phylogenomic epidemiology of Clostridium difficile
艰难梭菌的系统发育流行病学
  • 批准号:
    8867125
  • 财政年份:
    2012
  • 资助金额:
    $ 29.53万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了