Computational modeling of DNA methylation-mediated gene regulation
DNA甲基化介导的基因调控的计算模型
基本信息
- 批准号:9896942
- 负责人:
- 金额:$ 36.24万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2019
- 资助国家:美国
- 起止时间:2019-09-16 至 2023-05-31
- 项目状态:已结题
- 来源:
- 关键词:AffectBindingBinding ProteinsBinding SitesBiological AssayBrainCatalogingCatalogsCellsCharacteristicsChemicalsClinicClinicalClustered Regularly Interspaced Short Palindromic RepeatsCommunitiesComputer SimulationComputer softwareCytosineDNADNA MethylationDNA SequenceDNA sequencingDataData SetDevelopmentDiagnosticDiseaseDistalElementsEnhancersEpigenetic ProcessEtiologyFutureGene ExpressionGene Expression RegulationGenerationsGenesGenetic TranscriptionGenetic VariationGenomeGliomaGoalsIndividualInternationalLabelLaboratoriesMachine LearningMalignant NeoplasmsMapsMeasuresMediatingMethodsMethylationModelingModificationMusNatural Language ProcessingNeuronsNucleic Acid Regulatory SequencesPaperPathologicPatientsPatternPlayPublishingRecurrenceRegulationRegulatory ElementReporterResolutionRetrievalRoleSamplingSeriesSignal TransductionSiteSoftware ToolsTestingThe Cancer Genome AtlasTrainingTranslatingbasecancer genomecancer typeclinical sequencingcofactordemethylationembryonic stem cellepigenetic therapyepigenomeepigenome editingepigenomicsexperimental studygenome sequencinggenome wide methylationgenome-widehistone modificationhuman diseaseindividualized medicinelong short term memory networkmethylation patternmethylomemutantnanoporenetwork architecturepredictive modelingpredictive toolsprognosticprognostic assayspromoterrecruitrelating to nervous systemsuccesstooltranscription factorwhole genome
项目摘要
Abstract
Large numbers of complete methylomes are being acquired through clinical sequencing projects, such as
through The Cancer Genome Atlas, Blueprint Epigenome Project, and International Cancer Genome
Consortium. Furthermore, third-generation nanopore sequencers, which detect DNA methylation and genetic
variation in a single experiment, are nearly ready for routine clinical sequencing and will provide complete
methylomes for all patients where whole-genome sequencing is indicated. Current analysis tools however only
perform preliminary methylome processing and catalogue differentially methylated regions (DMRs). In order to
transform methylome analysis into a clinically useful diagnostic/prognostic test, we need to develop predictive
tools to interpret the functional and pathological consequences of identified methylation changes. Towards this
goal, we have published a series of papers demonstrating that machine-learning based models utilizing high-
resolution signatures of all methylation changes around a promoter vastly outperform conventional DMR
methods. Our models accurately predict expression states at genes potentially regulated by methylation and
reveal predictive methylation signatures that facilitate mechanistic interpretation. Nonetheless, several
challenges remain before we can achieve our goals of translating genome-wide methylation data for routine
clinical use: (1) To our knowledge, no current models integrate distal enhancers, whose activation is affected by
DNA methylation. Such integrative analysis is necessary to understand consequences of methylation changes
in cancers, whose genomes frequently undergo wide-spread methylation changes. In addition, such modelling
will be essential to understand the role of 5-hydroxymethylcytosine (5hmC), which may play both repressive and
activating roles in neurons depending on whether it is found at promoters or enhancers. (2) Our current models
(and conventional approaches) represent methylation data independent of DNA sequence despite mechanistic
studies demonstrating that methylation changes can have different functional effects depending on which
sequences change and depending on the context of the local regulatory grammar. In this proposal, we will meet
these challenges by first developing a predictive model that incorporates 5-methylcytosine and 5hmC at
promoters and enhancers to determine how these marks act in concert. In particular, we will examine the
hypothesized dual role of 5hmC as a repressor at promoters and as an activator at enhancers in cortical neurons.
We will then use new advances in natural language processing to model DNA sequence and methylation to
predict expression states. Our results will reveal which regulatory elements and transcription factors binding sites
are affected by DNA methylation and how changes at different sites collaborate to affect expression changes.
We will experimentally validate our in silico predictions using a combination of reporter assays and CRISPR-
based epigenome-editing tools. Thus, the software tools we develop will form an important toolkit for the analysis
and mechanistic interpretation of whole-genome methylation studies, both in the laboratory and clinic.
摘要
大量完整的甲基组正在通过临床测序项目获得,例如
通过癌症基因组图谱、蓝图表观基因组计划和国际癌症基因组
财团。此外,第三代纳米孔测序仪,它检测DNA甲基化和基因
单个实验中的变异,几乎准备好进行常规临床测序,并将提供完整的
适用于所有需要进行全基因组测序的患者。然而,仅限当前分析工具
执行初步的甲基组处理和编目差异甲基化区域(DMRS)。为了
将甲基组分析转化为临床有用的诊断/预后测试,我们需要开发预测性
用于解释已识别的甲基化变化的功能和病理后果的工具。朝向这个方向
Goal,我们已经发表了一系列的论文,证明了基于机器学习的模型利用高效率的
启动子周围所有甲基化变化的分辨率签名远远优于传统的DMR
方法:研究方法。我们的模型准确地预测了可能受甲基化和
揭示便于机械性解释的预测性甲基化特征。尽管如此,有几个
在我们能够实现将全基因组甲基化数据转换为常规数据的目标之前,仍然存在挑战
临床应用:(1)据我们所知,目前还没有模型整合远端增强子,其激活受以下因素的影响
DNA甲基化。这种综合分析对于理解甲基化变化的后果是必要的
在癌症中,其基因组经常经历广泛的甲基化变化。另外,这样的造型
对于理解5-羟甲基胞嘧啶(5HmC)的作用是至关重要的,它可能同时发挥抑制和
激活神经元的作用取决于它是在启动子还是在增强子中发现的。(2)我们目前的模式
(和传统方法)表示独立于DNA序列的甲基化数据,尽管是机械的
研究表明,甲基化的变化可以有不同的功能影响,这取决于
序列会根据当地调控语法的上下文而变化。在这个提案中,我们将相遇
这些挑战是通过首先开发一个包含5-甲基胞嘧啶和5-HmC的预测模型来实现的
推动者和增强者,以确定这些标记如何协同起作用。特别是,我们将研究
假设5HmC在皮质神经元中作为启动子的抑制者和增强子的激活者的双重作用。
然后,我们将使用自然语言处理的新进展来模拟DNA序列和甲基化,以
预测表达式状态。我们的结果将揭示哪些调控元件和转录因子结合位点
受到DNA甲基化的影响,以及不同位点的变化如何协作影响表达变化。
我们将使用报告分析和CRISPR相结合的方法来实验验证我们的计算机预测-
基于表观基因组的编辑工具。因此,我们开发的软件工具将形成分析的重要工具包
以及在实验室和临床上对全基因组甲基化研究的机械性解释。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
John R Edwards其他文献
John R Edwards的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('John R Edwards', 18)}}的其他基金
Single-cell approaches to probe the function of the unique neuronal epigenome
单细胞方法探测独特神经元表观基因组的功能
- 批准号:
10440762 - 财政年份:2022
- 资助金额:
$ 36.24万 - 项目类别:
Single-cell approaches to probe the function of the unique neuronal epigenome
单细胞方法探测独特神经元表观基因组的功能
- 批准号:
10578749 - 财政年份:2022
- 资助金额:
$ 36.24万 - 项目类别:
Computational modeling of DNA methylation-mediated gene regulation
DNA甲基化介导的基因调控的计算模型
- 批准号:
10018936 - 财政年份:2019
- 资助金额:
$ 36.24万 - 项目类别:
Computational modeling of DNA methylation-mediated gene regulation
DNA甲基化介导的基因调控的计算模型
- 批准号:
10405488 - 财政年份:2019
- 资助金额:
$ 36.24万 - 项目类别:
MODELING DNA METHYLATION'S ROLE IN GENE REGULATION
模拟 DNA 甲基化在基因调控中的作用
- 批准号:
8759963 - 财政年份:2014
- 资助金额:
$ 36.24万 - 项目类别:
MODELING DNA METHYLATION'S ROLE IN GENE REGULATION
模拟 DNA 甲基化在基因调控中的作用
- 批准号:
8899611 - 财政年份:2014
- 资助金额:
$ 36.24万 - 项目类别:
A MACHINE LEARNING APPROACH FOR FINE-SCALE GENOME WIDE DNA METHYLATION ANALYSIS
用于精细规模全基因组 DNA 甲基化分析的机器学习方法
- 批准号:
8229567 - 财政年份:2012
- 资助金额:
$ 36.24万 - 项目类别:
Novel approach to whole genome methylation profiling of breast cancer
乳腺癌全基因组甲基化分析的新方法
- 批准号:
8013458 - 财政年份:2008
- 资助金额:
$ 36.24万 - 项目类别:
Novel approach to whole genome methylation profiling of breast cancer
乳腺癌全基因组甲基化分析的新方法
- 批准号:
7471745 - 财政年份:2008
- 资助金额:
$ 36.24万 - 项目类别:
Novel approach to whole genome methylation profiling of breast cancer
乳腺癌全基因组甲基化分析的新方法
- 批准号:
8239536 - 财政年份:2008
- 资助金额:
$ 36.24万 - 项目类别:
相似国自然基金
帽结合蛋白(cap binding protein)调控乙烯信号转导的分子机制
- 批准号:32170319
- 批准年份:2021
- 资助金额:58.00 万元
- 项目类别:面上项目
帽结合蛋白(cap binding protein)调控乙烯信号转导的分子机制
- 批准号:
- 批准年份:2021
- 资助金额:58 万元
- 项目类别:
ID1 (Inhibitor of DNA binding 1) 在口蹄疫病毒感染中作用机制的研究
- 批准号:31672538
- 批准年份:2016
- 资助金额:62.0 万元
- 项目类别:面上项目
番茄EIN3-binding F-box蛋白2超表达诱导单性结实和果实成熟异常的机制研究
- 批准号:31372080
- 批准年份:2013
- 资助金额:80.0 万元
- 项目类别:面上项目
P53 binding protein 1 调控乳腺癌进展转移及化疗敏感性的机制研究
- 批准号:81172529
- 批准年份:2011
- 资助金额:58.0 万元
- 项目类别:面上项目
DBP(Vitamin D Binding Protein)在多发性硬化中的作用和相关机制的蛋白质组学研究
- 批准号:81070952
- 批准年份:2010
- 资助金额:35.0 万元
- 项目类别:面上项目
研究EB1(End-Binding protein 1)的癌基因特性及作用机制
- 批准号:30672361
- 批准年份:2006
- 资助金额:24.0 万元
- 项目类别:面上项目
相似海外基金
How lipid binding proteins shape the activity of nuclear hormone receptors
脂质结合蛋白如何影响核激素受体的活性
- 批准号:
DP240103141 - 财政年份:2024
- 资助金额:
$ 36.24万 - 项目类别:
Discovery Projects
Structural classification of NHEJ pathways; unravelling the role of Ku-binding proteins
NHEJ通路的结构分类;
- 批准号:
MR/X00029X/1 - 财政年份:2023
- 资助金额:
$ 36.24万 - 项目类别:
Research Grant
BRC-BIO: Evolutionary Patterns of Ice-Binding Proteins in North Pacific Intertidal Invertebrates
BRC-BIO:北太平洋潮间带无脊椎动物冰结合蛋白的进化模式
- 批准号:
2312378 - 财政年份:2023
- 资助金额:
$ 36.24万 - 项目类别:
Standard Grant
Exploring the roles and functions of sex steroid hormone receptor-associated RNA binding proteins in the development of geriatric diseases.
探索性类固醇激素受体相关 RNA 结合蛋白在老年疾病发展中的作用和功能。
- 批准号:
23K06408 - 财政年份:2023
- 资助金额:
$ 36.24万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
UV Plasmon-Enhanced Chiroptical Spectroscopy of Membrane-Binding Proteins
膜结合蛋白的紫外等离子增强手性光谱
- 批准号:
10680969 - 财政年份:2023
- 资助金额:
$ 36.24万 - 项目类别:
Investigating physiologic and pathophysiologic connections between the Parkinson's disease protein alpha-synuclein and RNA binding proteins
研究帕金森病蛋白 α-突触核蛋白和 RNA 结合蛋白之间的生理和病理生理联系
- 批准号:
10744556 - 财政年份:2023
- 资助金额:
$ 36.24万 - 项目类别:
Structural and computational analysis of immune-related RNA-binding proteins
免疫相关 RNA 结合蛋白的结构和计算分析
- 批准号:
23K06597 - 财政年份:2023
- 资助金额:
$ 36.24万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Characterization of carbohydrate-binding proteins and their applications
碳水化合物结合蛋白的表征及其应用
- 批准号:
23K05034 - 财政年份:2023
- 资助金额:
$ 36.24万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
A machine learning approach to identify carbon dioxide-binding proteins for sustainability and health
一种机器学习方法来识别二氧化碳结合蛋白以实现可持续发展和健康
- 批准号:
2838427 - 财政年份:2023
- 资助金额:
$ 36.24万 - 项目类别:
Studentship
The role of RNA binding proteins in heart development and congenital heart defects
RNA结合蛋白在心脏发育和先天性心脏缺陷中的作用
- 批准号:
10827567 - 财政年份:2023
- 资助金额:
$ 36.24万 - 项目类别: