Context-Sensitive Search of Human Expression Compendia

人类表达概要的上下文相关搜索

基本信息

  • 批准号:
    8464761
  • 负责人:
  • 金额:
    $ 36.52万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2011
  • 资助国家:
    美国
  • 起止时间:
    2011-06-28 至 2015-04-30
  • 项目状态:
    已结题

项目摘要

DESCRIPTION (provided by applicant): Gene expression experiments are an abundant and robust source of functional genomics data, with thousands of microarray and a growing number of high throughput RNA sequencing studies publicly available, most interrogating clinical and biological systems relevant to disease. They hold the promise of data-driven characterization of gene function and regulation, including in specific tissues, cell lines, and disease states, and can advance the understanding and modeling of regulatory changes that form the basis of human disease. However, these data remain largely underutilized, as biology researchers do not have effective tools to explore and analyze the entire data collection to generate novel hypotheses and direct experiments. The situation is similar to that of the Internet before the search engines - a biology researcher has to know a priori which datasets pertain to the biological question she is asking, reflect the tissue/cell-lineage specific signals of interest to her, and accurately measure the expression of genes related to her pathways of interest. There is a clear need for methods that will enable biology researchers to use their domain-specific knowledge to direct their exploration of public human expression data, enabling them to generate hypotheses and direct experiments addressing challenging biomedical questions. Such a system should provide users with ability to effectively explore automatically identified datasets relevant to their biological question of interest, leverage metazoan complexity including cell lineage and disease specific signals, and allow the researcher to securely include their unpublished data in the analysis. To address these challenges, this proposal describes a "Google-style" public search engine for large collections of gene expression data built using novel search algorithms and leveraging cloud-computing technologies. This system implements a novel query-based context-sensitive algorithm for search of large expression compendia that exploits the complexity of metazoan organisms, including cell-lineage complexity and disease aspects inherent to human expression studies. Furthermore, the challenge of heterogeneity in human samples will be addressed by developing novel hierarchical learning methods to predict cell-lineage or tissue-specific gene expression based on the compendium and to identify these signals in each dataset. This will enable users to explore tissue-specific expression and also will be integrated with the search algorithm to improve search accuracy. Proposed algorithms, search engine, and user interface will be extensively evaluated in close collaboration with biology researchers, and top predictions will be tested experimentally. These methods will be implemented in a user-friendly public search system that will leverage cloud computing to provide robust interactive query response and will enable biology researchers to explore both published data collections and their own pre-publication datasets in a context-specific, integrated, and secure manner.
描述(由申请人提供):基因表达实验是功能基因组学数据的丰富和可靠的来源,有数千个微阵列和越来越多的高通量RNA测序研究可公开获得,大多数询问与疾病相关的临床和生物系统。它们有望对基因功能和调控进行数据驱动的表征,包括在特定组织、细胞系和疾病状态中,并且可以促进对构成人类疾病基础的调控变化的理解和建模。然而,这些数据在很大程度上仍未得到充分利用,因为生物学研究人员没有有效的工具来探索和分析整个数据集,以产生新的假设和直接实验。这种情况类似于搜索引擎出现之前的互联网--生物学研究人员必须事先知道哪些数据集与她所问的生物学问题有关,反映她感兴趣的组织/细胞谱系特异性信号,并准确测量与她感兴趣的途径相关的基因表达。显然需要一种方法,使生物学研究人员能够利用其特定领域的知识来指导他们对公共人类表达数据的探索,使他们能够生成假设并指导解决具有挑战性的生物医学问题的实验。这样的系统应该为用户提供有效地探索与他们感兴趣的生物学问题相关的自动识别的数据集的能力,利用后生动物的复杂性,包括细胞谱系和疾病特异性信号,并允许研究人员安全地将其未发表的数据包括在分析中。为了应对这些挑战,该提案描述了一个“谷歌式”的公共搜索引擎,用于使用新颖的搜索算法和利用云计算技术构建的大量基因表达数据。该系统实现了一种新的基于查询的上下文敏感的算法搜索大型表达纲要,利用后生动物生物体的复杂性,包括细胞谱系的复杂性和疾病方面固有的人类表达研究。此外,人类样本异质性的挑战将通过开发新的分层学习方法来解决,以基于纲要预测细胞谱系或组织特异性基因表达,并在每个数据集中识别这些信号。这将使用户能够探索组织特定的表达,并且还将与搜索算法集成以提高搜索准确性。提出的算法,搜索引擎和用户界面将与生物学研究人员密切合作进行广泛评估,并将通过实验测试最佳预测。这些方法将在一个用户友好的公共搜索系统中实施,该系统将利用云计算提供强大的交互式查询响应,并使生物学研究人员能够以特定于上下文的,集成的和安全的方式探索已发布的数据集和他们自己的预发布数据集。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

OLGA G TROYANSKAYA其他文献

OLGA G TROYANSKAYA的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('OLGA G TROYANSKAYA', 18)}}的其他基金

Context-Sensitive Search of Human Expression Compendia
人类表达概要的上下文相关搜索
  • 批准号:
    8290295
  • 财政年份:
    2011
  • 资助金额:
    $ 36.52万
  • 项目类别:
Context-Sensitive Search of Human Expression Compendia
人类表达概要的上下文相关搜索
  • 批准号:
    8024978
  • 财政年份:
    2011
  • 资助金额:
    $ 36.52万
  • 项目类别:
lntegration and Visualization of Diverse Biological Data
多种生物数据的整合和可视化
  • 批准号:
    10393642
  • 财政年份:
    2005
  • 资助金额:
    $ 36.52万
  • 项目类别:
Integration and Visualization of Diverse Biological Data
多种生物数据的整合与可视化
  • 批准号:
    7036576
  • 财政年份:
    2005
  • 资助金额:
    $ 36.52万
  • 项目类别:
Integration and visualization of diverse biological data
多种生物数据的整合和可视化
  • 批准号:
    8041717
  • 财政年份:
    2005
  • 资助金额:
    $ 36.52万
  • 项目类别:
Integration and visualization of diverse biological data
多种生物数据的整合和可视化
  • 批准号:
    8209212
  • 财政年份:
    2005
  • 资助金额:
    $ 36.52万
  • 项目类别:
Integration and Visualization of Diverse Biological Data
多种生物数据的整合与可视化
  • 批准号:
    9266422
  • 财政年份:
    2005
  • 资助金额:
    $ 36.52万
  • 项目类别:
Integration and Visualization of Diverse Biological Data
多种生物数据的整合与可视化
  • 批准号:
    7404447
  • 财政年份:
    2005
  • 资助金额:
    $ 36.52万
  • 项目类别:
Integration and visualization of diverse biological data
多种生物数据的整合和可视化
  • 批准号:
    8601095
  • 财政年份:
    2005
  • 资助金额:
    $ 36.52万
  • 项目类别:
lntegration and Visualization of Diverse Biological Data
多种生物数据的整合和可视化
  • 批准号:
    9902503
  • 财政年份:
    2005
  • 资助金额:
    $ 36.52万
  • 项目类别:

相似海外基金

Reconstruction algorithms for time-domain diffuse optical tomography imaging of small animals
小动物时域漫射光学断层成像重建算法
  • 批准号:
    RGPIN-2015-05926
  • 财政年份:
    2019
  • 资助金额:
    $ 36.52万
  • 项目类别:
    Discovery Grants Program - Individual
Reconstruction algorithms for time-domain diffuse optical tomography imaging of small animals
小动物时域漫射光学断层成像重建算法
  • 批准号:
    RGPIN-2015-05926
  • 财政年份:
    2018
  • 资助金额:
    $ 36.52万
  • 项目类别:
    Discovery Grants Program - Individual
Reconstruction algorithms for time-domain diffuse optical tomography imaging of small animals
小动物时域漫射光学断层成像重建算法
  • 批准号:
    RGPIN-2015-05926
  • 财政年份:
    2017
  • 资助金额:
    $ 36.52万
  • 项目类别:
    Discovery Grants Program - Individual
Reconstruction algorithms for time-domain diffuse optical tomography imaging of small animals
小动物时域漫射光学断层成像重建算法
  • 批准号:
    RGPIN-2015-05926
  • 财政年份:
    2016
  • 资助金额:
    $ 36.52万
  • 项目类别:
    Discovery Grants Program - Individual
Event detection algorithms in decision support for animals health surveillance
动物健康监测决策支持中的事件检测算法
  • 批准号:
    385453-2009
  • 财政年份:
    2015
  • 资助金额:
    $ 36.52万
  • 项目类别:
    Collaborative Research and Development Grants
Algorithms to generate designs of potency experiments that use far fewer animals
生成使用更少动物的效力实验设计的算法
  • 批准号:
    8810865
  • 财政年份:
    2015
  • 资助金额:
    $ 36.52万
  • 项目类别:
Reconstruction algorithms for time-domain diffuse optical tomography imaging of small animals
小动物时域漫射光学断层成像重建算法
  • 批准号:
    RGPIN-2015-05926
  • 财政年份:
    2015
  • 资助金额:
    $ 36.52万
  • 项目类别:
    Discovery Grants Program - Individual
Event detection algorithms in decision support for animals health surveillance
动物健康监测决策支持中的事件检测算法
  • 批准号:
    385453-2009
  • 财政年份:
    2013
  • 资助金额:
    $ 36.52万
  • 项目类别:
    Collaborative Research and Development Grants
Development of population-level algorithms for modelling genomic variation and its impact on cellular function in animals and plants
开发群体水平算法来建模基因组变异及其对动植物细胞功能的影响
  • 批准号:
    FT110100972
  • 财政年份:
    2012
  • 资助金额:
    $ 36.52万
  • 项目类别:
    ARC Future Fellowships
Advanced computational algorithms for brain imaging studies of freely moving animals
用于自由活动动物脑成像研究的先进计算算法
  • 批准号:
    DP120103813
  • 财政年份:
    2012
  • 资助金额:
    $ 36.52万
  • 项目类别:
    Discovery Projects
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了