Quantitative Modeling of Transcription Factor-DNA Binding
转录因子-DNA 结合的定量建模
基本信息
- 批准号:9975181
- 负责人:
- 金额:$ 52.37万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2019
- 资助国家:美国
- 起止时间:2019-07-09 至 2024-06-30
- 项目状态:已结题
- 来源:
- 关键词:3-DimensionalAmino AcidsBase PairingBase SequenceBindingBinding ProteinsBinding SitesBiological AssayBiophysicsChromatinChromatin StructureComputing MethodologiesDNADNA BindingDNA MethylationDNA SequenceDNA StructureDataDevelopmentEpitopesFamilyGene Expression RegulationGenerationsGenesGenomeGenomicsHydrogen BondingHydrophobicityIn VitroIndividualKnowledgeMajor GrooveMentorsMethodsModelingMolecularMutationNatureNucleosomesProtein FamilyProteinsRegulator GenesScientistShapesSiteSpecificityStructureTechnologyTimeTrainingTranscriptional Regulationbasecell typecofactorconvolutional neural networkdeep learningdimerexperimental studyfeature selectionfunctional grouphistone modificationin vivoinnovationinsightlearning strategytranscription factor
项目摘要
Title: Quantitative Modeling of Transcription Factor–DNA Binding
PI: Rohs, Remo
PROJECT SUMMARY
Genes are regulated through transcription factor (TF) binding to specific DNA target sites in the genome.
These target sites are recognized through several layers of specificity determinants. The most extensively
studied layer of binding specificity are hydrogen bonds and hydrophobic contacts between protein amino acids
and functional groups of the base pairs mainly in the major groove. Base readout recognizes nucleotide
sequence within a short core-binding site of only a few base pairs. However, these distinct sequence
combinations in a TF binding motif occur many times in the genome and only a very small fraction of putative
binding sites are functional. It is still unknown how a TF locates and identifies its in vivo binding sites in the
plethora of possible genomic target sites. Recognition of three-dimensional DNA structure is an additional layer
that refines base readout. While the latter is restricted to direct contacts with the core motif, shape readout is a
mechanism through which flanking regions of the core motif or spacer regions between half-sites of dimeric
TFs contribute to binding specificity. Other layers of in vivo TF binding determinants are chromatin structure,
DNA accessibility, histone modifications, DNA methylation, cofactors and cooperative binding, and cell type.
Given this multi-layer nature of TF recognition, we will develop quantitative models to predict TF binding with
high accuracy. More important, however, is that our models will reveal recognition mechanisms in the absence
of experiment-based structural information. We will build models where each distinct layer of TF binding
specificity determinants is added to a base-line model combining DNA sequence and shape. Since it is
expected that the importance of each of these TF binding specificity determinants will vary dramatically across
protein families, we will use feature selection to identify relative contributions of each feature group as a
function of TF or TF family. We will also develop a deep learning framework where individual feature modules
can be added or removed from the input layer of convolutional neural networks. This approach will leverage
the advantages of deep learning while circumventing the “black box” nature of standard deep learning
methods. We will also generate experimental data for specific TFs using the SELEX-seq technology. This
approach is currently able to probe the effect of cofactors, cooperative binding, and protein mutations on the
binding specificity of a TF. We will add nucleosomes to the SELEX-seq binding assay and, thereby, probe
chromatin effects on TF binding using an in vitro experiment in the absence of other cellular contributions. This
project will result in a better mechanistic understanding of TF-DNA binding and reveal the impact of various
specificity determinants across multiple scales. The new insights will describe different combinations of readout
mechanisms on a protein-family specific basis. Our new methods will yield progress in biomedical innovation
that is based on transcription and gene regulation. The generated knowledge will better integrate genomics
and biophysics, and the project will contribute to the training and mentoring of a new generation of scientists.
转录因子-DNA结合的定量模型
圆周率:RoHS,Remo
项目总结
基因通过转录因子(TF)与基因组中特定的DNA靶点结合来调节。
这些靶点是通过几层特异性决定因素识别的。最广泛的
研究的结合特异层是蛋白质氨基酸之间的氢键和疏水接触
碱基对的官能团主要在主槽中。碱基读数识别核苷酸
只有几个碱基对的短核心结合位点内的序列。然而,这些不同序列
Tf结合基序中的结合在基因组中出现多次,并且只有很小一部分推定
结合位点具功能性。目前还不清楚转铁蛋白如何定位和识别其在体内的结合部位。
过多的可能的基因组靶点。识别三维DNA结构是额外的一层
这改进了基本读数。虽然后者仅限于与核心主题的直接接触,但形状读出是一种
核心基序侧翼区域或二聚体半位点间隔区的作用机制
TFS有助于结合特异性。体内转铁蛋白结合决定簇的其他层是染色质结构,
DNA可及性、组蛋白修饰、DNA甲基化、辅因子和协同结合,以及细胞类型。
鉴于转铁蛋白识别的这种多层性质,我们将开发定量模型来预测转铁蛋白与
精确度高。然而,更重要的是,我们的模型将揭示在没有识别机制的情况下
基于实验的结构信息。我们将构建模型,其中每个不同的TF绑定层
将特异性决定因素添加到结合DNA序列和形状的基线模型中。因为它是
预计这些转铁蛋白结合特异性决定因素中的每一个的重要性将在
蛋白质家族,我们将使用特征选择来识别每个特征组的相对贡献
Tf或Tf家族的功能。我们还将开发一个深度学习框架,其中各个功能模块
可以从卷积神经网络的输入层中添加或删除。这一方法将利用
深度学习的优势,同时绕过了标准深度学习的黑匣子性质
方法:研究方法。我们还将使用SELEX-SEQ技术为特定的TF生成实验数据。这
方法目前能够探索辅因子、协同结合和蛋白质突变对
转铁蛋白的结合特异性。我们将在SELEX-SEQ结合分析中加入核小体,从而探测
在没有其他细胞贡献的情况下,用体外实验研究染色质对转铁蛋白结合的影响。这
该项目将导致对TF-DNA结合的更好的机制理解,并揭示各种
多个尺度上的特异性决定因素。新的见解将描述读数的不同组合
在蛋白质家族特定基础上的机制。我们的新方法将在生物医学创新方面取得进展
这是基于转录和基因调控。产生的知识将更好地整合基因组学
和生物物理学,该项目将有助于培训和指导新一代科学家。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
                item.title }}
{{ item.translation_title }}
- DOI:{{ item.doi }} 
- 发表时间:{{ item.publish_year }} 
- 期刊:
- 影响因子:{{ item.factor }}
- 作者:{{ item.authors }} 
- 通讯作者:{{ item.author }} 
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ patent.updateTime }}
Remo Rohs其他文献
Remo Rohs的其他文献
{{
              item.title }}
{{ item.translation_title }}
- DOI:{{ item.doi }} 
- 发表时间:{{ item.publish_year }} 
- 期刊:
- 影响因子:{{ item.factor }}
- 作者:{{ item.authors }} 
- 通讯作者:{{ item.author }} 
{{ truncateString('Remo Rohs', 18)}}的其他基金
Quantitative Modeling of Transcription Factor-DNA Binding
转录因子-DNA 结合的定量建模
- 批准号:10431863 
- 财政年份:2019
- 资助金额:$ 52.37万 
- 项目类别:
Quantitative Modeling of Transcription Factor-DNA Binding
转录因子-DNA 结合的定量建模
- 批准号:10650775 
- 财政年份:2019
- 资助金额:$ 52.37万 
- 项目类别:
Quantitative Modeling of Transcription Factor-DNA Binding
转录因子-DNA 结合的定量建模
- 批准号:10189652 
- 财政年份:2019
- 资助金额:$ 52.37万 
- 项目类别:
Genome analysis based on the integration of DNA sequence and shape
基于DNA序列和形状整合的基因组分析
- 批准号:8795204 
- 财政年份:2014
- 资助金额:$ 52.37万 
- 项目类别:
Genome analysis based on the integration of DNA sequence and shape
基于DNA序列和形状整合的基因组分析
- 批准号:8632246 
- 财政年份:2014
- 资助金额:$ 52.37万 
- 项目类别:
相似海外基金
Double Incorporation of Non-Canonical Amino Acids in an Animal and its Application for Precise and Independent Optical Control of Two Target Genes
动物体内非规范氨基酸的双重掺入及其在两个靶基因精确独立光学控制中的应用
- 批准号:BB/Y006380/1 
- 财政年份:2024
- 资助金额:$ 52.37万 
- 项目类别:Research Grant 
Quantifying L-amino acids in Ryugu to constrain the source of L-amino acids in life on Earth
量化 Ryugu 中的 L-氨基酸以限制地球生命中 L-氨基酸的来源
- 批准号:24K17112 
- 财政年份:2024
- 资助金额:$ 52.37万 
- 项目类别:Grant-in-Aid for Early-Career Scientists 
Collaborative Research: RUI: Elucidating Design Rules for non-NRPS Incorporation of Amino Acids on Polyketide Scaffolds
合作研究:RUI:阐明聚酮化合物支架上非 NRPS 氨基酸掺入的设计规则
- 批准号:2300890 
- 财政年份:2023
- 资助金额:$ 52.37万 
- 项目类别:Continuing Grant 
Basic research toward therapeutic strategies for stress-induced chronic pain with non-natural amino acids
非天然氨基酸治疗应激性慢性疼痛策略的基础研究
- 批准号:23K06918 
- 财政年份:2023
- 资助金额:$ 52.37万 
- 项目类别:Grant-in-Aid for Scientific Research (C) 
Molecular mechanisms how arrestins that modulate localization of glucose transporters are phosphorylated in response to amino acids
调节葡萄糖转运蛋白定位的抑制蛋白如何响应氨基酸而被磷酸化的分子机制
- 批准号:23K05758 
- 财政年份:2023
- 资助金额:$ 52.37万 
- 项目类别:Grant-in-Aid for Scientific Research (C) 
Molecular recognition and enantioselective reaction of amino acids
氨基酸的分子识别和对映选择性反应
- 批准号:23K04668 
- 财政年份:2023
- 资助金额:$ 52.37万 
- 项目类别:Grant-in-Aid for Scientific Research (C) 
Design and Synthesis of Fluorescent Amino Acids: Novel Tools for Biological Imaging
荧光氨基酸的设计与合成:生物成像的新工具
- 批准号:2888395 
- 财政年份:2023
- 资助金额:$ 52.37万 
- 项目类别:Studentship 
Structurally engineered N-acyl amino acids for the treatment of NASH
用于治疗 NASH 的结构工程 N-酰基氨基酸
- 批准号:10761044 
- 财政年份:2023
- 资助金额:$ 52.37万 
- 项目类别:
Lifestyle, branched-chain amino acids, and cardiovascular risk factors: a randomized trial
生活方式、支链氨基酸和心血管危险因素:一项随机试验
- 批准号:10728925 
- 财政年份:2023
- 资助金额:$ 52.37万 
- 项目类别:
Single-molecule protein sequencing by barcoding of N-terminal amino acids
通过 N 端氨基酸条形码进行单分子蛋白质测序
- 批准号:10757309 
- 财政年份:2023
- 资助金额:$ 52.37万 
- 项目类别:

 刷新
              刷新
            
















 {{item.name}}会员
              {{item.name}}会员
            



