Computational Methods for Microbial and Microbiome Sequence Analysis

微生物和微生物组序列分析的计算方法

基本信息

  • 批准号:
    10550160
  • 负责人:
  • 金额:
    $ 40.34万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2019
  • 资助国家:
    美国
  • 起止时间:
    2019-02-01 至 2025-01-31
  • 项目状态:
    未结题

项目摘要

Project Summary This project will support our work on computational methods for microbial sequence analysis, including gene finding, whole-genome alignment, genome assembly, and metagenomic sequence analysis. Over the years we have developed multiple systems to solve problems in these areas, some of which are very widely used. These tools need continued updates and improvements to keep pace with changes in sequencing technology, changes in experimental design, and the ever-growing number of sequenced genomes. One of these systems is Glimmer, a computational method for finding genes in bacteria, viruses, archaea, and simple eukaryotes. Glimmer is highly accurate, finding over 99% of the genes in most prokaryotic genomes. It has been used by thousands of scientists around the world and in the majority of published bacterial genome sequencing projects over the past decade. Collectively the three main publications describing Glimmer have been cited over 4,700 times, including >700 citations in 2016-17 alone. Usage of Glimmer has been increased in recent years due to the explosion in next-generation sequencing projects, which are particularly cost-effective for bacterial genomes. A second system, MUMmer, is an efficient whole-genome aligner that is used to compare genomes to one another and to compare genome assemblies to detect changes, both large and small. MUMmer and its components, especially Nucmer, have been widely used and incorporated in other systems, including multi-genome aligners and several genome assembly packages. The three main publications describing MUMmer have been cited over 3,600 times including >750 citations in 2016-17. In recent years we have focused our efforts on developing methods for the analysis of metagenomics data, producing several newer tools, including Kraken and Centrifuge. Both of these systems attempt to assign a species identifier to every read in a metagenomics data set. Because the Kraken algorithm is not only accurate but far faster than earlier methods, it was rapidly adopted by many labs soon after its release, and its usage continues to grow. The even newer and more space- efficient Centrifuge system has also been highly successful and was recently incorporated into the analysis package of one of the new third-generation sequencing companies. We continue to work on improving the performance of both algorithms, and this project will allow us to extend them to handle the newest long-read data that is increasingly being used for metagenomics experiments. Finally, a new direction of the lab is the use of metagenomic shotgun sequencing to diagnose infections, for which we are not only modifying our algorithms, but also building customized genome databases where we rigorously screen the genomes to identify and remove contaminants and low-complexity sequences that create false positives. As we have done for many years, we will release all of the software and data generated by this project for free under an open source license, allowing other scientists to use, modify, and redistribute them without restrictions of any kind.
项目总结

项目成果

期刊论文数量(47)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
3D-Beacons: decreasing the gap between protein sequences and structures through a federated network of protein structure data resources.
  • DOI:
    10.1093/gigascience/giac118
  • 发表时间:
    2022-11-30
  • 期刊:
  • 影响因子:
    9.2
  • 作者:
  • 通讯作者:
JASPER: A fast genome polishing tool that improves accuracy of genome assemblies.
  • DOI:
    10.1371/journal.pcbi.1011032
  • 发表时间:
    2023-03
  • 期刊:
  • 影响因子:
    4.3
  • 作者:
  • 通讯作者:
Rapidly fatal infection with Bacillus cereus/thuringiensis: genome assembly of the responsible pathogen and consideration of possibly contributing toxins.
  • DOI:
    10.1016/j.diagmicrobio.2021.115534
  • 发表时间:
    2021-12
  • 期刊:
  • 影响因子:
    2.9
  • 作者:
    Butcher, Monica;Puiu, Daniela;Romagnoli, Mark;Carroll, Karen C.;Salzberg, Steven L.;Nauen, David W.
  • 通讯作者:
    Nauen, David W.
CHESS 3: an improved, comprehensive catalog of human genes and transcripts based on large-scale expression data, phylogenetic analysis, and protein structure.
  • DOI:
    10.1186/s13059-023-03088-4
  • 发表时间:
    2023-10-30
  • 期刊:
  • 影响因子:
    12.3
  • 作者:
    Varabyou, Ales;Sommer, Markus J;Erdogdu, Beril;Shinder, Ida;Minkin, Ilia;Chao, Kuan-Hao;Park, Sukhwan;Heinz, Jakob;Pockrandt, Christopher;Shumate, Alaina;Rincon, Natalia;Puiu, Daniela;Steinegger, Martin;Salzberg, Steven L;Pertea, Mihaela
  • 通讯作者:
    Pertea, Mihaela
Releasing the Kraken.
  • DOI:
    10.3389/fbinf.2021.808003
  • 发表时间:
    2021
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Salzberg, Steven L;Wood, Derrick E
  • 通讯作者:
    Wood, Derrick E
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Steven L. Salzberg其他文献

The 15th Genomic Standards Consortium meeting
  • DOI:
    10.4056/sigs.3457
  • 发表时间:
    2013-01-01
  • 期刊:
  • 影响因子:
    5.400
  • 作者:
    Lynn Schriml;Ilene Mizrachi;Peter Sterk;Dawn Field;Lynette Hirschman;Tatiana Tatusova;Susanna Sansone;Jack Gilbert;David Schindel;Neil Davies;Chris Meyer;Folker Meyer;George Garrity;Lita Proctor;M. H. Medema;Yemin Lan;Anna Klindworth;Frank Oliver Glöckner;Tonia Korves;Antonia Gonzalez;Peter Dwayndt;Markus Göker;Anjette Johnston;Evangelos Pafilis;Susanne Schneider;K. Baker;Cynthia Parr;G. Sutton;H. H. Creasy;Nikos Kyrpides;K. Eric Wommack;Patricia L. Whetzel;Daniel Nasko;Hilmar Lapp;Takamoto Fujisawa;Adam M. Phillippy;Renzo Kottman;Judith A. Blake;Junhua Li;Elizabeth M. Glass;Petra ten Hoopen;Rob Knight;Susan Holmes;Curtis Huttenhower;Steven L. Salzberg;Bing Ma;Owen White
  • 通讯作者:
    Owen White
C4.5: Programs for Machine Learning by J. Ross Quinlan. Morgan Kaufmann Publishers, Inc., 1993
  • DOI:
    10.1007/bf00993309
  • 发表时间:
    1994-09-01
  • 期刊:
  • 影响因子:
    2.900
  • 作者:
    Steven L. Salzberg
  • 通讯作者:
    Steven L. Salzberg
Reply to Austin and Korem, “Compositional transformations can reasonably introduce phenotype-associated values into sparse features”
回复奥斯汀和科雷姆,“组合变换可以合理地将与表型相关的值引入稀疏特征”
  • DOI:
    10.1128/msystems.00248-25
  • 发表时间:
    2025-04-30
  • 期刊:
  • 影响因子:
    4.600
  • 作者:
    Steven L. Salzberg
  • 通讯作者:
    Steven L. Salzberg
Yeast rises again
酵母再次兴起
  • DOI:
    10.1038/423233a
  • 发表时间:
    2003-05-15
  • 期刊:
  • 影响因子:
    48.500
  • 作者:
    Steven L. Salzberg
  • 通讯作者:
    Steven L. Salzberg
Q UALITY ASSESSMENT OF SPLICE SITE ANNOTATION BASED ON CONSERVATION ACROSS MULTIPLE SPECIES
基于多物种保护的剪接位点注释质量评估
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Ilia Minkin;Steven L. Salzberg
  • 通讯作者:
    Steven L. Salzberg

Steven L. Salzberg的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Steven L. Salzberg', 18)}}的其他基金

Comprehensive Human Expressed Sequences in Brain (CHESS-BRAIN) and their roles in neuropsychiatric illness
大脑中综合人类表达序列(CHESS-BRAIN)及其在神经精神疾病中的作用
  • 批准号:
    10541887
  • 财政年份:
    2021
  • 资助金额:
    $ 40.34万
  • 项目类别:
Comprehensive Human Expressed Sequences in Brain (CHESS-BRAIN) and their roles in neuropsychiatric illness
大脑中综合人类表达序列(CHESS-BRAIN)及其在神经精神疾病中的作用
  • 批准号:
    10362615
  • 财政年份:
    2021
  • 资助金额:
    $ 40.34万
  • 项目类别:
Comprehensive Human Expressed Sequences in Brain (CHESS-BRAIN) and their roles in neuropsychiatric illness
大脑中综合人类表达序列(CHESS-BRAIN)及其在神经精神疾病中的作用
  • 批准号:
    10205617
  • 财政年份:
    2021
  • 资助金额:
    $ 40.34万
  • 项目类别:
Computational Methods for Microbial and Microbiome Sequence Analysis
微生物和微生物组序列分析的计算方法
  • 批准号:
    10331733
  • 财政年份:
    2019
  • 资助金额:
    $ 40.34万
  • 项目类别:
Computational Methods for Microbial and Microbiome Sequence Analysis
微生物和微生物组序列分析的计算方法
  • 批准号:
    10083744
  • 财政年份:
    2019
  • 资助金额:
    $ 40.34万
  • 项目类别:
The Terabase Search Engine
Terabase 搜索引擎
  • 批准号:
    8882493
  • 财政年份:
    2014
  • 资助金额:
    $ 40.34万
  • 项目类别:
The Terabase Search Engine
Terabase 搜索引擎
  • 批准号:
    8688406
  • 财政年份:
    2014
  • 资助金额:
    $ 40.34万
  • 项目类别:
Computational Gene Modeling and Genome Sequence Assembly
计算基因建模和基因组序列组装
  • 批准号:
    8329127
  • 财政年份:
    2011
  • 资助金额:
    $ 40.34万
  • 项目类别:
Alignment Software for Second-Generation Sequencing
用于第二代测序的比对软件
  • 批准号:
    8068060
  • 财政年份:
    2011
  • 资助金额:
    $ 40.34万
  • 项目类别:
Alignment Software for Second-Generation Sequencing
用于第二代测序的比对软件
  • 批准号:
    8464182
  • 财政年份:
    2011
  • 资助金额:
    $ 40.34万
  • 项目类别:

相似海外基金

How novices write code: discovering best practices and how they can be adopted
新手如何编写代码:发现最佳实践以及如何采用它们
  • 批准号:
    2315783
  • 财政年份:
    2023
  • 资助金额:
    $ 40.34万
  • 项目类别:
    Standard Grant
One or Several Mothers: The Adopted Child as Critical and Clinical Subject
一位或多位母亲:收养的孩子作为关键和临床对象
  • 批准号:
    2719534
  • 财政年份:
    2022
  • 资助金额:
    $ 40.34万
  • 项目类别:
    Studentship
A comparative study of disabled children and their adopted maternal figures in French and English Romantic Literature
英法浪漫主义文学中残疾儿童及其收养母亲形象的比较研究
  • 批准号:
    2633211
  • 财政年份:
    2020
  • 资助金额:
    $ 40.34万
  • 项目类别:
    Studentship
A material investigation of the ceramic shards excavated from the Omuro Ninsei kiln site: Production techniques adopted by Nonomura Ninsei.
对大室仁清窑遗址出土的陶瓷碎片进行材质调查:野野村仁清采用的生产技术。
  • 批准号:
    20K01113
  • 财政年份:
    2020
  • 资助金额:
    $ 40.34万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
A comparative study of disabled children and their adopted maternal figures in French and English Romantic Literature
英法浪漫主义文学中残疾儿童及其收养母亲形象的比较研究
  • 批准号:
    2436895
  • 财政年份:
    2020
  • 资助金额:
    $ 40.34万
  • 项目类别:
    Studentship
A comparative study of disabled children and their adopted maternal figures in French and English Romantic Literature
英法浪漫主义文学中残疾儿童及其收养母亲形象的比较研究
  • 批准号:
    2633207
  • 财政年份:
    2020
  • 资助金额:
    $ 40.34万
  • 项目类别:
    Studentship
The limits of development: State structural policy, comparing systems adopted in two European mountain regions (1945-1989)
发展的限制:国家结构政策,比较欧洲两个山区采用的制度(1945-1989)
  • 批准号:
    426559561
  • 财政年份:
    2019
  • 资助金额:
    $ 40.34万
  • 项目类别:
    Research Grants
Securing a Sense of Safety for Adopted Children in Middle Childhood
确保被收养儿童的中期安全感
  • 批准号:
    2236701
  • 财政年份:
    2019
  • 资助金额:
    $ 40.34万
  • 项目类别:
    Studentship
A Study on Mutual Funds Adopted for Individual Defined Contribution Pension Plans
个人设定缴存养老金计划采用共同基金的研究
  • 批准号:
    19K01745
  • 财政年份:
    2019
  • 资助金额:
    $ 40.34万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Structural and functional analyses of a bacterial protein translocation domain that has adopted diverse pathogenic effector functions within host cells
对宿主细胞内采用多种致病效应功能的细菌蛋白易位结构域进行结构和功能分析
  • 批准号:
    415543446
  • 财政年份:
    2019
  • 资助金额:
    $ 40.34万
  • 项目类别:
    Research Fellowships
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了