Data Discovery: Computational Methods for Searching Short-Read Sequencing Experiments - Administrative Supplement

数据发现:搜索短读测序实验的计算方法 - 行政补充

基本信息

  • 批准号:
    10393953
  • 负责人:
  • 金额:
    $ 0.82万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2017
  • 资助国家:
    美国
  • 起止时间:
    2017-05-01 至 2022-04-30
  • 项目状态:
    已结题

项目摘要

PROJECT SUMMARY / ABSTRACT This proposal aims to solve the sequencing experiment discovery problem. The data from hundreds of thou- sands of short-read sequencing experiments are now publicly available, and private collections of sequencing experiments are also growing rapidly. These experiments include hundreds of thousands of whole genome sequencing experiments, and tens of thousands of RNA-seq, metagenomic, and tumor sequencing samples. However, these experiments are vastly underused, with few analyses making use of more than a handful of ex- periments at a time and most analyses ignoring this collection of raw data entirely. One crucial reason for this is that merely finding the appropriate experiments is a significant barrier to their use in downstream analyses. This is due to the lack of a computational platform that can search for relevant short-read sequencing data sets by the sequences they contain. It is not currently possible to find all the metagenomic experiments in which the genes that form a particular pathway are present or to find all experiments in which a novel lncRNA is observed. The experiment discovery problem is that of finding — on a global scale — those experiments that are relevant to an isoform, variant, or species under study. By building on our existing work in large-scale sequence search, we propose to develop a new distributed platform to index and search hundreds of thousands of raw short-read se- quencing data sets to enable researchers to quickly find experiments that contain their query sequences. We will apply this system to searching RNA-seq, metagenomic, and cancer tumor samples. The research questions we will solve include how to improve the computational scaling, increase the types of biologically meaningful queries that can be answered, and increase our ability to find relevant experiments in situations where muta- tions are common. We will produce a high-quality open-source implementation of the developed computational methods. The project will significantly expand the usefulness of large repositories of raw sequencing reads and enabled new approaches for large-scale reanalysis and reuse of short-read experiments. The system will unlock a rich source of biological information for gene function prediction, for understanding microbial communities, and for connecting genetic variation with disease progression.
项目总结/摘要 该方案旨在解决测序实验发现问题。数据来源于数十万- 大量的短读测序实验现在是公开的, 实验也在迅速增长。这些实验包括数十万个全基因组 测序实验,以及数以万计的RNA-seq、宏基因组和肿瘤测序样本。 然而,这些实验被大大低估,很少有分析使用超过少数的前- 大多数分析完全忽略了原始数据的收集。其中一个关键原因是 仅仅找到合适的实验是将其用于下游分析的重大障碍。这 这是由于缺乏可以通过测序技术搜索相关短读测序数据集的计算平台。 它们包含的序列。目前还不可能找到所有的宏基因组实验,其中基因 形成一个特定的途径,或找到所有实验中观察到一种新的lncRNA。的 实验发现问题是指在全球范围内发现那些与一个实验相关的实验。 同种型、变体或研究中的种。通过建立在我们现有的大规模序列搜索工作,我们 建议开发一个新的分布式平台来索引和搜索数十万个原始的短读序列, 对数据集进行测序,使研究人员能够快速找到包含查询序列的实验。我们将 将该系统应用于搜索RNA-seq、宏基因组和癌症肿瘤样本。研究问题 我们将解决的问题包括如何提高计算规模,增加生物学意义的类型, 可以回答的查询,并提高我们在穆塔的情况下找到相关实验的能力- 这些都是常见的。我们将产生一个高质量的开源实现开发的计算 方法.该项目将显著扩大原始测序读数的大型储存库的有用性, 为大规模再分析和重复使用短读实验提供了新的方法。系统会解锁 一个丰富的生物信息来源,用于基因功能预测,用于了解微生物群落, 将遗传变异与疾病进展联系起来

项目成果

期刊论文数量(20)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Maximum likelihood reconstruction of ancestral networks by integer linear programming
  • DOI:
    10.1093/bioinformatics/btaa931
  • 发表时间:
    2021-04-15
  • 期刊:
  • 影响因子:
    5.8
  • 作者:
    Rajan,Vaibhav;Zhang,Ziqi;Zhang,Xiuwei
  • 通讯作者:
    Zhang,Xiuwei
An Efficient, Scalable, and Exact Representation of High-Dimensional Color Information Enabled Using de Bruijn Graph Search
  • DOI:
    10.1089/cmb.2019.0322
  • 发表时间:
    2020-03-16
  • 期刊:
  • 影响因子:
    1.7
  • 作者:
    Almodaresi, Fatemeh;Pandey, Prashant;Patro, Rob
  • 通讯作者:
    Patro, Rob
Accurate assembly of multi-end RNA-seq data with Scallop2
  • DOI:
    10.1038/s43588-022-00216-1
  • 发表时间:
    2022-03
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Qimin Zhang;Qian Shi;Mingfu Shao
  • 通讯作者:
    Qimin Zhang;Qian Shi;Mingfu Shao
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Carleton Lee Kingsford其他文献

Carleton Lee Kingsford的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Carleton Lee Kingsford', 18)}}的其他基金

Improved genomic sketching for MUMmer and metagenomics
改进了 MUMmer 和宏基因组的基因组草图
  • 批准号:
    10453031
  • 财政年份:
    2022
  • 资助金额:
    $ 0.82万
  • 项目类别:
Improved genomic sketching for MUMmer and metagenomics
改进了 MUMmer 和宏基因组的基因组草图
  • 批准号:
    10670162
  • 财政年份:
    2022
  • 资助金额:
    $ 0.82万
  • 项目类别:
Data Discovery: Computational Methods for Searching Short-Read Sequencing Experiments
数据发现:搜索短读测序实验的计算方法
  • 批准号:
    9287168
  • 财政年份:
    2017
  • 资助金额:
    $ 0.82万
  • 项目类别:
Algorithms for Managing Uncertainty in Chromosome Conformation Capture Data
管理染色体构象捕获数据不确定性的算法
  • 批准号:
    8739540
  • 财政年份:
    2013
  • 资助金额:
    $ 0.82万
  • 项目类别:
Algorithms for Managing Uncertainty in Chromosome Conformation Capture Data
管理染色体构象捕获数据不确定性的算法
  • 批准号:
    8579049
  • 财政年份:
    2013
  • 资助金额:
    $ 0.82万
  • 项目类别:
Fast k-mer Counting to Quantify Gene Expression and Improve Genome Assembly
快速 k-mer 计数可量化基因表达并改善基因组组装
  • 批准号:
    8642468
  • 财政年份:
    2012
  • 资助金额:
    $ 0.82万
  • 项目类别:
Fast k-mer Counting to Quantify Gene Expression and Improve Genome Assembly
快速 k-mer 计数可量化基因表达并改善基因组组装
  • 批准号:
    8518438
  • 财政年份:
    2012
  • 资助金额:
    $ 0.82万
  • 项目类别:
Accurate Computational Detection of Influenza Reassortments
流感重组的准确计算检测
  • 批准号:
    8072578
  • 财政年份:
    2010
  • 资助金额:
    $ 0.82万
  • 项目类别:
Accurate Computational Detection of Influenza Reassortments
流感重组的准确计算检测
  • 批准号:
    7772829
  • 财政年份:
    2010
  • 资助金额:
    $ 0.82万
  • 项目类别:

相似海外基金

HNDS-R: Connectivity, Inclusiveness, and the Permeability of Basic Science
HNDS-R:基础科学的连通性、包容性和渗透性
  • 批准号:
    2318404
  • 财政年份:
    2023
  • 资助金额:
    $ 0.82万
  • 项目类别:
    Standard Grant
Advancing the basic science of membrane permeability in macrocyclic peptides
推进大环肽膜渗透性的基础科学
  • 批准号:
    10552484
  • 财政年份:
    2023
  • 资助金额:
    $ 0.82万
  • 项目类别:
Computer Vision for Malaria Microscopy: Automated Detection and Classification of Plasmodium for Basic Science and Pre-Clinical Applications
用于疟疾显微镜的计算机视觉:用于基础科学和临床前应用的疟原虫自动检测和分类
  • 批准号:
    10576701
  • 财政年份:
    2023
  • 资助金额:
    $ 0.82万
  • 项目类别:
Bringing together communities and basic science researchers to build stronger relationships
将社区和基础科学研究人员聚集在一起,建立更牢固的关系
  • 批准号:
    480914
  • 财政年份:
    2023
  • 资助金额:
    $ 0.82万
  • 项目类别:
    Miscellaneous Programs
“L-form” bacteria: basic science, antibiotics, evolution and biotechnology
L 型细菌:基础科学、抗生素、进化和生物技术
  • 批准号:
    FL210100071
  • 财政年份:
    2022
  • 资助金额:
    $ 0.82万
  • 项目类别:
    Australian Laureate Fellowships
Coordinating and Data Management Center for Translational and Basic Science Research in Early Lesions
早期病变转化和基础科学研究协调和数据管理中心
  • 批准号:
    10517004
  • 财政年份:
    2022
  • 资助金额:
    $ 0.82万
  • 项目类别:
Developing science communication on large scale basic science represented by accelerator science
发展以加速器科学为代表的大规模基础科学科学传播
  • 批准号:
    22K02974
  • 财政年份:
    2022
  • 资助金额:
    $ 0.82万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
UCSF - UCB TRAC Basic Science CORE
UCSF - UCB TRAC 基础科学核心
  • 批准号:
    10674711
  • 财政年份:
    2022
  • 资助金额:
    $ 0.82万
  • 项目类别:
Basic Science Core - Imaging
基础科学核心 - 成像
  • 批准号:
    10588228
  • 财政年份:
    2022
  • 资助金额:
    $ 0.82万
  • 项目类别:
Basic Science Core - Biosafety & Biocontainment Core (BBC)
基础科学核心 - 生物安全
  • 批准号:
    10431468
  • 财政年份:
    2022
  • 资助金额:
    $ 0.82万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了