Methods for sequencing data analysis and archive-scale data science
排序数据分析和档案规模数据科学的方法
基本信息
- 批准号:10322369
- 负责人:
- 金额:$ 51.41万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2021
- 资助国家:美国
- 起止时间:2021-01-01 至 2025-12-31
- 项目状态:未结题
- 来源:
- 关键词:ArchivesBiologicalBiological AssayBiologyCatalogsClassificationCollectionComputer softwareDNADNA sequencingDataData AnalysesData ScienceData SetDiseaseInfrastructureInvestigationMetadataMetagenomicsMethodsPropertyRNAReportingResearchResearch DesignResearch PersonnelScientistSystemWorkarchive dataarchived datadata archivegenomic toolsimprovedindexinginsightsearch enginetool
项目摘要
PROJECT SUMMARY
We will develop methods and maintain software that make it radically easier for biomedical researchers to
use and understand sequencing data. The project will support our maintaining and improving our popular
“upstream” tools for analyzing sequencing data. These include the Bowtie and Bowtie 2 tools for read
alignment, the Kraken 2 tool for metagenomics classification and the Dashing tool for genomic sketching
and comparison. We will also develop new systems that allow researchers to use these same core tools
(Bowtie, Kraken 2, Dashing) to rapidly discover and vet archived datasets. We will enable researchers to
quickly ascertain whether a dataset is of high quality, what species are present, whether contaminants
are present, what assay was performed, what datasets are similar to each other, and what datasets are
inconsistent with annotated metadata. In this way, researchers can distill relevant archived datasets, those
having the expected biological properties, in a way that does not hinge on the accuracy of the associated
metadata. Finally, we will work to develop new infrastructure for large-scale reanalysis and indexing of
archived data, ultimately yielding new “search engines” for scientific question-answering. In particular,
we will extend our past work on the Rail-RNA, recount2 and Snaptron so that we can more effectively
analyze huge collections of archived data, converting them into a variety of useful summary forms, and
than adding a layer of indexing so that users can query the summaries in the context of a scientific
investigation. We will also create new catalogs and mechanisms whereby researchers can share their
archive-assisted study designs, so that useful combinations of archived datasets, and insights into where
their metadata might be incorrect or incomplete, can be reported and shared.
项目总结
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Benjamin Thomas Langmead其他文献
Benjamin Thomas Langmead的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Benjamin Thomas Langmead', 18)}}的其他基金
Methods for sequencing data analysis and archive-scale data science
排序数据分析和档案规模数据科学的方法
- 批准号:
10548746 - 财政年份:2021
- 资助金额:
$ 51.41万 - 项目类别:
Personal and panel references for improved alignment
用于改进对齐的个人和面板参考
- 批准号:
10242948 - 财政年份:2020
- 资助金额:
$ 51.41万 - 项目类别:
Personal and panel references for improved alignment
用于改进对齐的个人和面板参考
- 批准号:
10057490 - 财政年份:2020
- 资助金额:
$ 51.41万 - 项目类别:
Personal and panel references for improved alignment
用于改进对齐的个人和面板参考
- 批准号:
10655473 - 财政年份:2020
- 资助金额:
$ 51.41万 - 项目类别:
Personal and panel references for improved alignment
用于改进对齐的个人和面板参考
- 批准号:
10443815 - 财政年份:2020
- 资助金额:
$ 51.41万 - 项目类别:
相似海外基金
NSF/BIO-DFG: Biological Fe-S intermediates in the synthesis of nitrogenase metalloclusters
NSF/BIO-DFG:固氮酶金属簇合成中的生物 Fe-S 中间体
- 批准号:
2335999 - 财政年份:2024
- 资助金额:
$ 51.41万 - 项目类别:
Standard Grant
Collaborative Research: Conference: Large Language Models for Biological Discoveries (LLMs4Bio)
合作研究:会议:生物发现的大型语言模型 (LLMs4Bio)
- 批准号:
2411529 - 财政年份:2024
- 资助金额:
$ 51.41万 - 项目类别:
Standard Grant
Collaborative Research: Conference: Large Language Models for Biological Discoveries (LLMs4Bio)
合作研究:会议:生物发现的大型语言模型 (LLMs4Bio)
- 批准号:
2411530 - 财政年份:2024
- 资助金额:
$ 51.41万 - 项目类别:
Standard Grant
Collaborative Research: NSF-ANR MCB/PHY: Probing Heterogeneity of Biological Systems by Force Spectroscopy
合作研究:NSF-ANR MCB/PHY:通过力谱探测生物系统的异质性
- 批准号:
2412551 - 财政年份:2024
- 资助金额:
$ 51.41万 - 项目类别:
Standard Grant
Elucidating mechanisms of biological hydrogen conversion through model metalloenzymes
通过模型金属酶阐明生物氢转化机制
- 批准号:
2419343 - 财政年份:2024
- 资助金额:
$ 51.41万 - 项目类别:
Standard Grant
Collaborative Research: The Interplay of Water Condensation and Fungal Growth on Biological Surfaces
合作研究:水凝结与生物表面真菌生长的相互作用
- 批准号:
2401507 - 财政年份:2024
- 资助金额:
$ 51.41万 - 项目类别:
Standard Grant
DESIGN: Driving Culture Change in a Federation of Biological Societies via Cohort-Based Early-Career Leaders
设计:通过基于队列的早期职业领袖推动生物协会联盟的文化变革
- 批准号:
2334679 - 财政年份:2024
- 资助金额:
$ 51.41万 - 项目类别:
Standard Grant
REU Site: Modeling the Dynamics of Biological Systems
REU 网站:生物系统动力学建模
- 批准号:
2243955 - 财政年份:2024
- 资助金额:
$ 51.41万 - 项目类别:
Standard Grant
Defining the biological boundaries to sustain extant life on Mars
定义维持火星现存生命的生物边界
- 批准号:
DP240102658 - 财政年份:2024
- 资助金额:
$ 51.41万 - 项目类别:
Discovery Projects
Advanced Multiscale Biological Imaging using European Infrastructures
利用欧洲基础设施进行先进的多尺度生物成像
- 批准号:
EP/Y036654/1 - 财政年份:2024
- 资助金额:
$ 51.41万 - 项目类别:
Research Grant