Statistical Power Calculations for ChIP-seq experiments
ChIP-seq 实验的统计功效计算
基本信息
- 批准号:8284083
- 负责人:
- 金额:$ 18.41万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2012
- 资助国家:美国
- 起止时间:2012-05-01 至 2014-03-31
- 项目状态:已结题
- 来源:
- 关键词:AllelesBase SequenceBindingBinomial ModelBioconductorBiologic CharacteristicBiologicalCellsChIP-seqCommunitiesComputer AnalysisComputer softwareDNADataData AnalysesData SetDatabasesDetectionDevelopmentDiagnosisDiseaseFamilyGene ExpressionGeneticGenomeGenomicsGoalsGuanine + Cytosine CompositionImmune SeraImmunoglobulin GIndividualLeadLettersLocationMapsMethodsModelingPlayPublic HealthReadingResearchResearch PersonnelResourcesRoleSamplingSimulateStagingStatistical ModelsTechnologyTissuesTrainingUnited States National Institutes of HealthValidationVariantbasechromatin immunoprecipitationdesignepigenomicsgenome sequencinggenome wide association studygenome-widehuman diseasenext generationnovelprogramsresearch studysimulationsoftware developmenttooltranscription factor
项目摘要
DESCRIPTION (provided by applicant): The advent of high throughput next generation sequencing (NGS) technologies have revolutionized the fields of genetics and genomics by allowing rapid and inexpensive sequencing of billions of bases. Among the NGS applications, ChIP-seq (chromatin immunoprecipitation followed by NGS) is perhaps the most successful to date. ChIP-seq technology enables investigators to study genome-wide binding of transcription factors and mapping of epigenomic marks. Both of these play crucial roles in programming of cell specific gene expression; therefore their genome-wide mapping can significantly advance our ability to understand and diagnose human diseases. Although basic analysis tools for ChIP-seq data are rapidly increasing, there has not been much progress on the design problems regarding ChIP-seq experiments. A challenging question that the researchers planning a ChIP-seq experiment need to answer is: how deeply should the ChIP and the control samples be sequenced? The answer depends on multiple factors some of which can be set by the experimenter based on pilot/preliminary data. The sequencing depth of a ChIP-seq experiment is one of the key factors that determine whether or not all the underlying targets (e.g., binding locations or epigenomic profiles) can be identified with a targeted power. This is especially important when the goal is the analysis of individual-to-individual and allele specific variation o transcription factor binding and epigenomic profiles. Insufficient sequencing depths may lead to spurious differences in binding or epigenome profiles. In this proposal, we aim to develop a general framework for power calculations in ChIP-seq experiments with three specific aims and by considering statistical models commonly used in ChIP-seq analysis: (1) Power calculations based on the conditional Binomial model; (2) Power calculations based on the Poisson and Negative Binomial regression models; (3) A power calculation tool for GALAXY and Bioconductor. This project will be accomplished through a combination of theoretical/methodological development, simulation, computational analysis, and experimental validation. Methods will be developed and evaluated using datasets from the ENCODE, modENCODE, and the RoadMap Epigenomics consortiums as well as novel datasets from collaborators. Statistical resources generated from the project, which will be disseminated in publicly available software, will provide essential tools for the efficient design of ChIP-seq experiments.
PUBLIC HEALTH RELEVANCE: The proposed research is relevant to public health because capturing genome-wide binding of transcription factors and epigenomic information by ChIP-seq technology is invaluable for comprehensively understanding development, differentiation, and disease. Design of ChIP-seq experiments present unprecedented challenges. We will develop a statistical framework for power calculations in designing ChIP-seq experiments and disseminate results and software to the research community.
描述(由申请人提供):高通量下一代测序(NGS)技术的出现使数十亿个碱基的快速和廉价测序成为遗传学和基因组学领域的革命性变革。在NGS的应用中,CHIP-SEQ(染色质免疫沉淀后NGS)可能是迄今为止最成功的。ChIP-SEQ技术使研究人员能够研究转录因子的全基因组结合和表观基因组标记的映射。这两种基因在细胞特异性基因表达的编程中都起着至关重要的作用;因此,它们的全基因组图谱可以显著提高我们理解和诊断人类疾病的能力。尽管用于CHIP-SEQ数据的基本分析工具正在迅速增加,但是关于CHIP-SEQ实验的设计问题并没有太大的进展。计划进行芯片序列实验的研究人员需要回答的一个具有挑战性的问题是:芯片和对照样本的测序应该有多深?答案取决于多个因素,其中一些因素可以由实验者根据试点/初步数据设定。芯片序列实验的测序深度是决定所有潜在靶点(例如结合位置或表观基因组图谱)是否都能用靶向能力识别的关键因素之一。当目标是分析转录因子结合和表观基因组图谱的个体对个体和等位基因特异性变异时,这一点尤其重要。测序深度不足可能会导致结合或表观基因组图谱中的虚假差异。在这项建议中,我们旨在开发一个通用的框架,在CHIP-SEQ实验中有三个特定的目标,并通过考虑在CHIP-SEQ分析中常用的统计模型:(1)基于条件二项模型的功率计算;(2)基于泊松和负二项回归模型的功率计算;(3)银河和生物导体的功率计算工具。这个项目将通过理论/方法开发、模拟、计算分析和实验验证相结合的方式完成。将使用ENCODE、modENCODE和路线图表观基因组学联盟的数据集以及来自合作者的新数据集来开发和评估方法。该项目产生的统计资源将通过公开可用的软件传播,将为有效设计芯片序列实验提供必要的工具。
公共卫生相关性:拟议的研究与公共健康相关,因为通过芯片序列技术捕获转录因子和表观基因组信息的全基因组结合对于全面了解发育、分化和疾病是非常宝贵的。芯片序列实验的设计提出了前所未有的挑战。我们将开发一个统计框架,用于在设计芯片序列实验时进行功率计算,并将结果和软件传播给研究界。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Sunduz Keles其他文献
Sunduz Keles的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Sunduz Keles', 18)}}的其他基金
Statistical methods for co-expression network analysis of population-scale scRNA-seq data
群体规模 scRNA-seq 数据共表达网络分析的统计方法
- 批准号:
10740240 - 财政年份:2023
- 资助金额:
$ 18.41万 - 项目类别:
Functionally relevant mapping of human GWAS SNPs on model organisms
人类 GWAS SNP 在模式生物上的功能相关图谱
- 批准号:
10056966 - 财政年份:2020
- 资助金额:
$ 18.41万 - 项目类别:
High dimensional statistical data modeling and integration for studying regulatory variation
用于研究监管变化的高维统计数据建模和集成
- 批准号:
10413927 - 财政年份:2007
- 资助金额:
$ 18.41万 - 项目类别:
Statistical Analysis Methods and Software for ChIP-seq Data
ChIP-seq 数据的统计分析方法和软件
- 批准号:
8605900 - 财政年份:2007
- 资助金额:
$ 18.41万 - 项目类别:
Statistical Analysis Methods and Software for ChIP-seq Data
ChIP-seq 数据的统计分析方法和软件
- 批准号:
8785690 - 财政年份:2007
- 资助金额:
$ 18.41万 - 项目类别:
Statistical Methods for the Analysis of ChlP-chip Data
ChlP 芯片数据分析的统计方法
- 批准号:
7253510 - 财政年份:2007
- 资助金额:
$ 18.41万 - 项目类别:
Statistical Analysis Methods and Software for ChIP-seq Data
ChIP-seq 数据的统计分析方法和软件
- 批准号:
8370723 - 财政年份:2007
- 资助金额:
$ 18.41万 - 项目类别:
Statistical Methods for the Analysis of ChlP-chip Data
ChlP 芯片数据分析的统计方法
- 批准号:
7799293 - 财政年份:2007
- 资助金额:
$ 18.41万 - 项目类别:
High dimensional statistical data integration for studying regulatory variation
用于研究监管变化的高维统计数据集成
- 批准号:
9344668 - 财政年份:2007
- 资助金额:
$ 18.41万 - 项目类别:
High dimensional statistical data modeling and integration for studying regulatory variation
用于研究监管变化的高维统计数据建模和集成
- 批准号:
10610872 - 财政年份:2007
- 资助金额:
$ 18.41万 - 项目类别:
相似海外基金
Quantum chemical challenge to elucidate the functional mechanism of base sequence specificity deciding removal of the DNA damage
量子化学挑战阐明碱基序列特异性决定去除 DNA 损伤的功能机制
- 批准号:
19K22903 - 财政年份:2019
- 资助金额:
$ 18.41万 - 项目类别:
Grant-in-Aid for Challenging Research (Exploratory)
Theoretical Study on Relation of Base sequence and Electronic Structures toward Elucidation of Mechanism of DNA Electric Conductivity.
碱基序列与电子结构关系的理论研究,阐明DNA导电机制。
- 批准号:
16K05666 - 财政年份:2016
- 资助金额:
$ 18.41万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Prediction and control of base sequence recognition ability for nucleic acid binding proteins by using computer experiments.
利用计算机实验预测和控制核酸结合蛋白的碱基序列识别能力。
- 批准号:
14598001 - 财政年份:2002
- 资助金额:
$ 18.41万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
FLANKING BASE SEQUENCE ON MUTAGENICITY OF 8 OXOGUANINE
8 氧鸟嘌呤致突变性的侧翼碱基序列
- 批准号:
6362773 - 财政年份:2001
- 资助金额:
$ 18.41万 - 项目类别:
FLANKING BASE SEQUENCE ON MUTAGENICITY OF 8 OXOGUANINE
8 氧鸟嘌呤致突变性的侧翼碱基序列
- 批准号:
6137753 - 财政年份:2000
- 资助金额:
$ 18.41万 - 项目类别:
GROWTH HOROMON LOCALIZATION AND ITS BASE SEQUENCE IN BOVINE PANCREATIC
牛胰腺生长激素定位及其碱基序列
- 批准号:
10460134 - 财政年份:1998
- 资助金额:
$ 18.41万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
DNA BASE SEQUENCE EFFECTS IN CHEMICAL CARCINOGENESIS
DNA 碱基序列在化学致癌作用中的作用
- 批准号:
2488608 - 财政年份:1997
- 资助金额:
$ 18.41万 - 项目类别:
DNA BASE SEQUENCE EFFECTS IN CHEMICAL CARCINOGENESIS
DNA 碱基序列在化学致癌作用中的作用
- 批准号:
6475917 - 财政年份:1997
- 资助金额:
$ 18.41万 - 项目类别:
DNA BASE SEQUENCE EFFECTS IN CHEMICAL CARCINOGENESIS
DNA 碱基序列在化学致癌作用中的作用
- 批准号:
6329024 - 财政年份:1997
- 资助金额:
$ 18.41万 - 项目类别:
DNA BASE SEQUENCE EFFECTS IN CHEMICAL CARCINOGENESIS
DNA 碱基序列在化学致癌作用中的作用
- 批准号:
6124462 - 财政年份:1997
- 资助金额:
$ 18.41万 - 项目类别: