Bayesian Joint Estimation of Alignment and Phylogeny

比对和系统发育的贝叶斯联合估计

基本信息

  • 批准号:
    7660485
  • 负责人:
  • 金额:
    $ 29.87万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2008
  • 资助国家:
    美国
  • 起止时间:
    2008-08-01 至 2013-07-31
  • 项目状态:
    已结题

项目摘要

DESCRIPTION (provided by applicant): Phylogenetic reconstruction is an invaluable tool for studying molecular sequences. Starting from a description of how the characters in the sequences mutate over time, the methods attempt to uncover the sequences' relatedness. Common applications range from describing the evolutionary histories of living organisms in evolutionary biology to estimating genetic distances and constructing protein families in molecular biology and bioinformatics. Standard reconstruction methods rely on sequence alignments that specify which characters in the sequences are homologous, deriving from common ancestors. A fundamental difficulty is that sequence alignments are not directly observed; they are inferred properties of the raw sequence data and must be estimated along with the phylogeny. Current tools handle this inference sequentially, first determining a sometimes poor estimate of the alignment and then conditioning on the truth of alignment to reconstruct the phylogeny. This project provides practical tools for end-users to simultaneously infer alignment and phylogeny, side-stepping biases that sequential estimation introduces. The tools assume both a character substitution model and an insertion/deletion (indel) process through which characters are added or removed generating an alignment. Further, these indels supply previously under-utilized information from the data to infer phytogenies. Major advances make this phylo-alignment framework useful for real-life datasets. The framework draws heavily on hidden Markov models, Bayesian computation and clever parameter integration to produce a computationally efficient inference engine. Expert prior knowledge helps inform the indel process. From this, realistic priors enable Bayes factor tests to address if specific indels are shared by descent or are homoplastic, reducing controversy over their value in phylogenetics. Modeling assumptions better reflect the underlying biology. Allowing spatial variation in the indel process provides more accurate phytogenies and alignments. The extensions also provide for heterogeneity tests to identify evolutionary interesting sequence regions. Examples of the methods span all time-scales of evolution, across billions of years to infer early branches in the Tree of Life to matters of months to describe the diversification of rapidly evolving viruses within infected hosts. This project markedly impacts many fields across biomedical research. For example, the project furnishes mathematical and statistical training in bioinformatics which will play a prime role in discovery during the 21st century, and rigorous inference tools employing phylo-alignment deliver improved molecular, comparative studies, a more accurate understanding of human evolution and new perspectives from which to battle infectious diseases.
描述(由申请人提供):系统发育重建是研究分子序列的宝贵工具。从描述序列中的字符如何随时间变化开始,这些方法试图揭示序列的相关性。常见的应用范围从进化生物学中描述生物体的进化历史到分子生物学和生物信息学中估计遗传距离和构建蛋白质家族。标准的重建方法依赖于序列比对,该序列比对指定序列中的哪些特征是同源的,源自共同的祖先。一个根本的困难是序列比对不能直接观察到,它们是原始序列数据的推断性质,必须沿着估计序列发生。目前的工具顺序地处理这种推断,首先确定对比对的有时较差的估计,然后根据比对的真实性来重建同源性。该项目为最终用户提供了实用的工具,可以同时推断序列估计引入的对齐和重复性,侧步偏差。这些工具假设字符替换模型和插入/删除(indel)过程,通过插入/删除过程添加或删除字符以生成对齐。此外,这些插入缺失从数据中提供先前未充分利用的信息以推断植物发生。主要的进步使这个对齐框架对现实生活中的数据集很有用。该框架在很大程度上借鉴了隐马尔可夫模型,贝叶斯计算和巧妙的参数集成,以产生一个计算效率高的推理引擎。专家的先验知识有助于为indel过程提供信息。由此,现实的先验使贝叶斯因子检验能够解决特定的插入缺失是由血统共享还是同源的,从而减少了关于它们在遗传学中的价值的争议。建模假设更好地反映了潜在的生物学。在插入缺失过程中允许空间变化提供了更准确的植物发生和比对。扩展还提供了异质性测试,以确定进化感兴趣的序列区域。这些方法的例子跨越了进化的所有时间尺度,从数十亿年来推断生命之树的早期分支到几个月来描述受感染宿主内快速进化的病毒的多样化。 该项目对生物医学研究的许多领域产生了显著影响。例如,该项目在生物信息学中提供数学和统计培训,这将在21世纪的发现中发挥主要作用,采用双序列比对的严格推理工具提供改进的分子比较研究,更准确地了解人类进化和对抗传染病的新视角。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Marc A. Suchard其他文献

Unlocking efficiency in real-world collaborative studies: a multi-site international study with one-shot lossless GLMM algorithm
在现实世界的协作研究中释放效率:一项具有一次性无损广义线性混合模型算法的多站点国际研究
  • DOI:
    10.1038/s41746-025-01846-1
  • 发表时间:
    2025-07-19
  • 期刊:
  • 影响因子:
    15.100
  • 作者:
    Jiayi Tong;Jenna M. Reps;Chongliang Luo;Yiwen Lu;Lu Li;Juan Manuel Ramirez-Anguita;Milou T. Brand;Scott L. DuVall;Thomas Falconer;Alex Mayer Fuentes;Xing He;Michael E. Matheny;Miguel A. Mayer;Bhavnisha K. Patel;Katherine R. Simon;Marc A. Suchard;Guojun Tang;Benjamin Viernes;Ross D. Williams;Mui van Zandt;Fei Wang;Jiang Bian;Jiayu Zhou;David A. Asch;Yong Chen
  • 通讯作者:
    Yong Chen
Authors’ Response to Huang et al.’s Comment on “Serially Combining Epidemiological Designs Does Not Improve Overall Signal Detection in Vaccine Safety Surveillance”
  • DOI:
    10.1007/s40264-024-01411-x
  • 发表时间:
    2024-03-05
  • 期刊:
  • 影响因子:
    3.800
  • 作者:
    Fan Bu;Faaizah Arshad;George Hripcsak;Patrick B. Ryan;Martijn J. Schuemie;Marc A. Suchard
  • 通讯作者:
    Marc A. Suchard
Transmission dynamics of the 2022 mpox epidemic in New York City
2022 年猴痘疫情在纽约市的传播动态
  • DOI:
    10.1038/s41591-025-03526-9
  • 发表时间:
    2025-03-25
  • 期刊:
  • 影响因子:
    50.000
  • 作者:
    Jonathan E. Pekar;Yu Wang;Jade C. Wang;Yucai Shao;Faten Taki;Lisa A. Forgione;Helly Amin;Tyler Clabby;Kimberly Johnson;Lucia V. Torian;Sarah L. Braunstein;Preeti Pathela;Enoma Omoregie;Scott Hughes;Marc A. Suchard;Tetyana I. Vasylyeva;Philippe Lemey;Joel O. Wertheim
  • 通讯作者:
    Joel O. Wertheim
BEAST X for Bayesian phylogenetic, phylogeographic and phylodynamic inference
用于贝叶斯系统发育、系统地理和系统动态推断的 BEAST X
  • DOI:
    10.1038/s41592-025-02751-x
  • 发表时间:
    2025-07-07
  • 期刊:
  • 影响因子:
    32.100
  • 作者:
    Guy Baele;Xiang Ji;Gabriel W. Hassler;John T. McCrone;Yucai Shao;Zhenyu Zhang;Andrew J. Holbrook;Philippe Lemey;Alexei J. Drummond;Andrew Rambaut;Marc A. Suchard
  • 通讯作者:
    Marc A. Suchard
Artificial intelligence for modelling infectious disease epidemics
用于模拟传染病流行的人工智能
  • DOI:
    10.1038/s41586-024-08564-w
  • 发表时间:
    2025-02-19
  • 期刊:
  • 影响因子:
    48.500
  • 作者:
    Moritz U. G. Kraemer;Joseph L.-H. Tsui;Serina Y. Chang;Spyros Lytras;Mark P. Khurana;Samantha Vanderslott;Sumali Bajaj;Neil Scheidwasser;Jacob Liam Curran-Sebastian;Elizaveta Semenova;Mengyan Zhang;H. Juliette T. Unwin;Oliver J. Watson;Cathal Mills;Abhishek Dasgupta;Luca Ferretti;Samuel V. Scarpino;Etien Koua;Oliver Morgan;Houriiyah Tegally;Ulrich Paquet;Loukas Moutsianas;Christophe Fraser;Neil M. Ferguson;Eric J. Topol;David A. Duchêne;Tanja Stadler;Patricia Kingori;Michael J. Parker;Francesca Dominici;Nigel Shadbolt;Marc A. Suchard;Oliver Ratmann;Seth Flaxman;Edward C. Holmes;Manuel Gomez-Rodriguez;Bernhard Schölkopf;Christl A. Donnelly;Oliver G. Pybus;Simon Cauchemez;Samir Bhatt
  • 通讯作者:
    Samir Bhatt

Marc A. Suchard的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Marc A. Suchard', 18)}}的其他基金

Statistical innovation to integrate sequences and phenotypes for scalable phylodynamic inference
统计创新整合序列和表型以进行可扩展的系统动力学推断
  • 批准号:
    10584588
  • 财政年份:
    2021
  • 资助金额:
    $ 29.87万
  • 项目类别:
Statistical innovation to integrate sequences and phenotypes for scalable phylodynamic inference
统计创新整合序列和表型以进行可扩展的系统动力学推断
  • 批准号:
    10390334
  • 财政年份:
    2021
  • 资助金额:
    $ 29.87万
  • 项目类别:
Statistical innovation to integrate sequences and phenotypes for scalable phylodynamic inference
统计创新整合序列和表型以进行可扩展的系统动力学推断
  • 批准号:
    10177121
  • 财政年份:
    2021
  • 资助金额:
    $ 29.87万
  • 项目类别:
Consortium for Viral Systems Biology Modeling Core
病毒系统生物学建模核心联盟
  • 批准号:
    10579085
  • 财政年份:
    2018
  • 资助金额:
    $ 29.87万
  • 项目类别:
Consortium for Viral Systems Biology Modeling Core
病毒系统生物学建模核心联盟
  • 批准号:
    10374718
  • 财政年份:
    2018
  • 资助金额:
    $ 29.87万
  • 项目类别:
Consortium for Viral Systems Biology Modeling Core
病毒系统生物学建模核心联盟
  • 批准号:
    10310604
  • 财政年份:
    2018
  • 资助金额:
    $ 29.87万
  • 项目类别:
Bayesian Joint Estimation of Alignment and Phylogeny
比对和系统发育的贝叶斯联合估计
  • 批准号:
    7596504
  • 财政年份:
    2008
  • 资助金额:
    $ 29.87万
  • 项目类别:
Bayesian Joint Estimation of Alignment and Phylogeny
比对和系统发育的贝叶斯联合估计
  • 批准号:
    8116012
  • 财政年份:
    2008
  • 资助金额:
    $ 29.87万
  • 项目类别:
Bayesian Joint Estimation of Alignment and Phylogeny
比对和系统发育的贝叶斯联合估计
  • 批准号:
    7883433
  • 财政年份:
    2008
  • 资助金额:
    $ 29.87万
  • 项目类别:
Bayesian Joint Estimation of Alignment and Phylogeny
比对和系统发育的贝叶斯联合估计
  • 批准号:
    8302280
  • 财政年份:
    2008
  • 资助金额:
    $ 29.87万
  • 项目类别:

相似海外基金

Novel Data Structures And Scalable Algorithms For High Throughput Bioinformatics
高通量生物信息学的新颖数据结构和可扩展算法
  • 批准号:
    RGPIN-2019-06640
  • 财政年份:
    2022
  • 资助金额:
    $ 29.87万
  • 项目类别:
    Discovery Grants Program - Individual
Bioinformatics Algorithms for Protein Interactions and Applications
蛋白质相互作用和应用的生物信息学算法
  • 批准号:
    RGPIN-2021-03978
  • 财政年份:
    2022
  • 资助金额:
    $ 29.87万
  • 项目类别:
    Discovery Grants Program - Individual
Novel Data Structures And Scalable Algorithms For High Throughput Bioinformatics
高通量生物信息学的新颖数据结构和可扩展算法
  • 批准号:
    RGPIN-2019-06640
  • 财政年份:
    2021
  • 资助金额:
    $ 29.87万
  • 项目类别:
    Discovery Grants Program - Individual
Bioinformatics Algorithms for Protein Interactions and Applications
蛋白质相互作用和应用的生物信息学算法
  • 批准号:
    RGPIN-2021-03978
  • 财政年份:
    2021
  • 资助金额:
    $ 29.87万
  • 项目类别:
    Discovery Grants Program - Individual
Bioinformatics Algorithms
生物信息学算法
  • 批准号:
    CRC-2017-00215
  • 财政年份:
    2021
  • 资助金额:
    $ 29.87万
  • 项目类别:
    Canada Research Chairs
Bioinformatics Algorithms and Software for Proteomics
蛋白质组学生物信息学算法和软件
  • 批准号:
    RGPIN-2016-03998
  • 财政年份:
    2021
  • 资助金额:
    $ 29.87万
  • 项目类别:
    Discovery Grants Program - Individual
Novel Data Structures And Scalable Algorithms For High Throughput Bioinformatics
高通量生物信息学的新颖数据结构和可扩展算法
  • 批准号:
    RGPIN-2019-06640
  • 财政年份:
    2020
  • 资助金额:
    $ 29.87万
  • 项目类别:
    Discovery Grants Program - Individual
Bioinformatics algorithms
生物信息学算法
  • 批准号:
    CRC-2017-00215
  • 财政年份:
    2020
  • 资助金额:
    $ 29.87万
  • 项目类别:
    Canada Research Chairs
Bioinformatics algorithms
生物信息学算法
  • 批准号:
    CRC-2017-00215
  • 财政年份:
    2019
  • 资助金额:
    $ 29.87万
  • 项目类别:
    Canada Research Chairs
Bioinformatics Algorithms and Software for Proteomics
蛋白质组学生物信息学算法和软件
  • 批准号:
    RGPIN-2016-03998
  • 财政年份:
    2019
  • 资助金额:
    $ 29.87万
  • 项目类别:
    Discovery Grants Program - Individual
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了