Computational Tools for Mining Large Amounts of ChIP and Gene Expression Data

用于挖掘大量 ChIP 和基因表达数据的计算工具

基本信息

  • 批准号:
    8856618
  • 负责人:
  • 金额:
    $ 39.38万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2012
  • 资助国家:
    美国
  • 起止时间:
    2012-07-25 至 2016-04-30
  • 项目状态:
    已结题

项目摘要

DESCRIPTION (provided by applicant): ChIP-seq and ChIP-chip, hereinafter referred to as ChIPx, are powerful technologies to map genome-wide protein-DNA interactions (PDIs). Microarray, exon array and RNA-seq, on the other hand, are widely used to measure gene expression. Integrating ChIPx and gene expression data provides a powerful approach to study gene regulation both during development and in diseases. Traditionally, ChIPx and gene expression experiments conducted by a single laboratory are mainly used to study a specific biological system. The collective efforts of many labs have resulted in a large volume of data representing diverse biological systems. Jointly, these data contain enormous amounts of information that have not been fully utilized by each individual lab. This proposal aims to develop a coordinated set of computational, statistical and software tools to allow scientists to synthesize information in 3000+ publicly available ChIPx samples and 60,000+ gene expression profiles in human and mouse to make new discoveries. The project will turn these heterogeneous data into a tool for high-throughput discovery of biological contexts (i.e., cell types, tissues and diseases) associated with gene regulatory pathway activities. First, a statistical method named Gene Set Context Analysis (GSCA) will be developed. GSCA utilizes large amounts of public gene expression data to infer biological contexts and diseases in which one or more gene sets (i.e., groups of genes) are coordinately activated or inactivated. Second, based on the GSCA, a method called Transcription Factor Context Analysis (TFCA) will be developed. TFCA discovers novel functional contexts of transcription factors (TFs) and gene regulatory pathways. This method first classifies target genes of a TF into different functional categories by integrating one's own ChIPx and gene expression data with public ChIPx and Gene Ontology data. It then uses GSCA to systematically discover biological contexts (including diseases) associated with the function of each category. Collectively, GSCA and TFCA will establish a new paradigm for analyzing ChIPx and gene expression data. The conventional approach analyzes data tied to a particular system. In the new approach, one also leverages the rich information in public ChIPx and gene expression data to extend findings in one system to other biological systems. By allowing one to make novel discoveries beyond the scope of the original experiments and connect gene regulatory pathways to diseases, the new approach will significantly increase the value of both new and existing data. Applying GSCA and TFCA, 3000+ ChIPx samples and 60,000+ gene expression samples in human and mouse will be analyzed together to systematically map TF functions and ChIPx defined regulatory pathway activ- ities to diseases. Some new predictions will be validated experimentally. In addition to creating new knowledge about a variety of diseases, this research will provide urgently needed data integration and data mining tools to help scientists to translate the rich information in the publicly available ChIPx and gene expression data into new discoveries, and identify promising new areas of biomedical research.
描述(申请人提供):CHIP-SEQ和CHIP-CHIP,以下简称ChIPx,是绘制全基因组蛋白质-DNA相互作用(PDI)的强大技术。另一方面,微阵列、外显子阵列和RNA-SEQ被广泛用于基因表达的检测。整合ChIPx和基因表达数据为研究发育和疾病中的基因调控提供了一种强有力的方法。传统上,ChIPx和基因表达实验由单个实验室进行,主要用于研究特定的生物系统。许多实验室的集体努力已经产生了代表不同生物系统的大量数据。总而言之,这些数据包含了大量的信息,每个实验室都没有充分利用这些信息。这项提议旨在开发一套协调的计算、统计和软件工具,使科学家能够在3000多个公开可用的ChIPx样本和60,000多个人类和老鼠的基因表达谱中合成信息,以做出新的发现。该项目将把这些异质数据转化为高通量发现与基因调控途径活动相关的生物学背景(即细胞类型、组织和疾病)的工具。首先,将开发一种名为基因集上下文分析(GSCA)的统计方法。GSCA利用大量公开的基因表达数据来推断生物学背景和疾病,在这些疾病中,一个或多个基因集(即,基因组)被协同激活或失活。其次,将在GSCA的基础上开发一种称为转录因子上下文分析(TFCA)的方法。TFCA发现了转录因子(TF)和基因调控途径的新功能背景。该方法首先通过将自己的ChIPx和基因表达数据与公开的ChIPx和基因本体论数据相结合,将转铁蛋白的目标基因分类为不同的功能类别。然后,它使用GSCA系统地发现与每个类别的功能相关的生物背景(包括疾病)。总而言之,GSCA和TFCA将建立一个分析ChIPx和基因表达数据的新范式。传统方法分析绑定到特定系统的数据。在新的方法中,人们还利用公共ChIPx和基因表达数据中的丰富信息来将一个系统中的发现扩展到其他生物系统。通过允许人们做出超出原始实验范围的新发现,并将基因调控途径与疾病联系起来,新方法将显著增加新数据和现有数据的价值。应用GSCA和TFCA,我们将对3000多个ChIPx样本和6万多个人和小鼠的基因表达样本进行分析,以系统地定位Tf的功能,并确定ChIPx对疾病的调控途径活性。一些新的预测将得到实验验证。除了创造关于各种疾病的新知识外,这项研究还将提供迫切需要的数据集成和数据挖掘工具,以帮助科学家将公开可用的ChIPx和基因表达数据中的丰富信息转化为新发现,并确定有前途的生物医学研究的新领域。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Hongkai Ji其他文献

Hongkai Ji的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Hongkai Ji', 18)}}的其他基金

Immune Development Across the Life Course: Integrating Exposures and Multi-Omics in the Boston Birth Cohort
整个生命过程中的免疫发展:在波士顿出生队列中整合暴露和多组学
  • 批准号:
    10418079
  • 财政年份:
    2022
  • 资助金额:
    $ 39.38万
  • 项目类别:
Immune Development Across the Life Course: Integrating Exposures and Multi-Omics in the Boston Birth Cohort
整个生命过程中的免疫发展:在波士顿出生队列中整合暴露和多组学
  • 批准号:
    10704536
  • 财政年份:
    2022
  • 资助金额:
    $ 39.38万
  • 项目类别:
Computational tools for regulome mapping using single-cell genomic data
使用单细胞基因组数据进行调节组图谱的计算工具
  • 批准号:
    10205134
  • 财政年份:
    2019
  • 资助金额:
    $ 39.38万
  • 项目类别:
Computational tools for regulome mapping using single-cell genomic data
使用单细胞基因组数据进行调节组图谱的计算工具
  • 批准号:
    10443743
  • 财政年份:
    2019
  • 资助金额:
    $ 39.38万
  • 项目类别:
Computational tools for regulome mapping using single-cell genomic data
使用单细胞基因组数据进行调节组图谱的计算工具
  • 批准号:
    10001077
  • 财政年份:
    2019
  • 资助金额:
    $ 39.38万
  • 项目类别:
Big Data Methods for Decoding Gene Regulation
解码基因调控的大数据方法
  • 批准号:
    10171879
  • 财政年份:
    2018
  • 资助金额:
    $ 39.38万
  • 项目类别:
Big Data Methods for Decoding Gene Regulation
解码基因调控的大数据方法
  • 批准号:
    9762143
  • 财政年份:
    2018
  • 资助金额:
    $ 39.38万
  • 项目类别:
Computational Tools for Mining Large Amounts of ChIP and Gene Expression Data
用于挖掘大量 ChIP 和基因表达数据的计算工具
  • 批准号:
    8516554
  • 财政年份:
    2012
  • 资助金额:
    $ 39.38万
  • 项目类别:
Computational Tools for Mining Large Amounts of ChIP and Gene Expression Data
用于挖掘大量 ChIP 和基因表达数据的计算工具
  • 批准号:
    8372529
  • 财政年份:
    2012
  • 资助金额:
    $ 39.38万
  • 项目类别:
Statistical and Computational Tools for Next-generation ChIP-seq Applications
用于下一代 ChIP-seq 应用的统计和计算工具
  • 批准号:
    8342445
  • 财政年份:
    2012
  • 资助金额:
    $ 39.38万
  • 项目类别:

相似国自然基金

层出镰刀菌氮代谢调控因子AreA 介导伏马菌素 FB1 生物合成的作用机理
  • 批准号:
    2021JJ40433
  • 批准年份:
    2021
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
寄主诱导梢腐病菌AreA和CYP51基因沉默增强甘蔗抗病性机制解析
  • 批准号:
    32001603
  • 批准年份:
    2020
  • 资助金额:
    24.0 万元
  • 项目类别:
    青年科学基金项目
AREA国际经济模型的移植.改进和应用
  • 批准号:
    18870435
  • 批准年份:
    1988
  • 资助金额:
    2.0 万元
  • 项目类别:
    面上项目

相似海外基金

Tribal Intertidal Digital Ecological Surveys Project: Using Large-Area Imaging to Assess Intertidal Biological Response to Changing Oceanographic Conditions in Partnership with Indigenous Nations
部落潮间带数字生态调查项目:与土著民族合作,利用大面积成像评估潮间带生物对不断变化的海洋条件的反应
  • 批准号:
    532685-2019
  • 财政年份:
    2022
  • 资助金额:
    $ 39.38万
  • 项目类别:
    Postgraduate Scholarships - Doctoral
Tribal Intertidal Digital Ecological Surveys Project: Using Large-Area Imaging to Assess Intertidal Biological Response to Changing Oceanographic Conditions in Partnership with Indigenous Nations
部落潮间带数字生态调查项目:与土著民族合作,利用大面积成像评估潮间带生物对不断变化的海洋条件的反应
  • 批准号:
    532685-2019
  • 财政年份:
    2020
  • 资助金额:
    $ 39.38万
  • 项目类别:
    Postgraduate Scholarships - Doctoral
biological interactions among forest-dwelling fungus gnats and their natural enemies in shiitake mashroom production area
香菇产区森林真菌蚊与其天敌之间的生物相互作用
  • 批准号:
    19K06152
  • 财政年份:
    2019
  • 资助金额:
    $ 39.38万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Tribal Intertidal Digital Ecological Surveys Project: Using Large-Area Imaging to Assess Intertidal Biological Response to Changing Oceanographic Conditions in Partnership with Indigenous Nations
部落潮间带数字生态调查项目:与土著民族合作,利用大面积成像评估潮间带生物对不断变化的海洋条件的反应
  • 批准号:
    532685-2019
  • 财政年份:
    2019
  • 资助金额:
    $ 39.38万
  • 项目类别:
    Postgraduate Scholarships - Doctoral
To what extent does governance play a role in how effectively a marine protected area in the Irish Sea reaches its biological and socioeconomic goals?
治理在多大程度上对爱尔兰海海洋保护区如何有效实现其生物和社会经济目标发挥作用?
  • 批准号:
    2287487
  • 财政年份:
    2019
  • 资助金额:
    $ 39.38万
  • 项目类别:
    Studentship
War and Biological Ageing in Vietnam: A Planning Grant to Foster Collaboration on a Novel Area of Global Research in Health and Ageing
越南的战争与生物衰老:一项规划拨款,以促进全球健康与老龄化研究新领域的合作
  • 批准号:
    404425
  • 财政年份:
    2019
  • 资助金额:
    $ 39.38万
  • 项目类别:
    Miscellaneous Programs
Impact assessment of Noctiluca scintillans red tide on nutrient dynamics, biological processes in lower trophic levels and material cycle in the neritic area of Sagami Bay
夜光藻赤潮对相模湾浅海区营养动态、低营养层生物过程和物质循环的影响评估
  • 批准号:
    18K05794
  • 财政年份:
    2018
  • 资助金额:
    $ 39.38万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Large-area graphene based chemical and biological sensors
基于大面积石墨烯的化学和生物传感器
  • 批准号:
    355863-2011
  • 财政年份:
    2015
  • 资助金额:
    $ 39.38万
  • 项目类别:
    Discovery Grants Program - Individual
Large-area graphene based chemical and biological sensors
基于大面积石墨烯的化学和生物传感器
  • 批准号:
    355863-2011
  • 财政年份:
    2014
  • 资助金额:
    $ 39.38万
  • 项目类别:
    Discovery Grants Program - Individual
Theoretical simulation and experimental study on biological weathering mechanism of the rock around coastal area in Yaeyama Islands
八重山群岛沿岸岩石生物风化机制的理论模拟与实验研究
  • 批准号:
    26790079
  • 财政年份:
    2014
  • 资助金额:
    $ 39.38万
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了