Modeling gene expression in yeast using large degenerate libraries

使用大型简并文库模拟酵母中的基因表达

基本信息

  • 批准号:
    10172925
  • 负责人:
  • 金额:
    $ 35.09万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2018
  • 资助国家:
    美国
  • 起止时间:
    2018-08-01 至 2023-05-31
  • 项目状态:
    已结题

项目摘要

PROJECT SUMMARY Short sequence elements in DNA and RNA determine the levels and composition of mRNAs and proteins, making it critical that we can accurately model how any given sequence will affect transcription, splicing or translation. Such models of cis-regulation will fill in gaps in our knowledge of these core gene expression processes. Additionally, as large numbers of human genomes are sequenced, the ability to predict the effects of sequence variation on the ultimate levels of proteins will be integral to the interpretation of variation in regulatory sequences. Similarly, the construction of metabolic pathways with defined levels of expression and the engineering of synthetic gene networks require accurate knowledge of how regulatory sequences affect expression. This application seeks to use the yeast Saccharomyces cerevisiae as a test case for learning how any short regulatory sequence affects protein levels. A predictive model will be trained on a set of libraries two orders of magnitude more complex than have been characterized to date. Libraries will be generated of a growth reporter gene with a million random sequences of 50 nucleotides that comprise either a DNA element that regulates transcription or an RNA element that regulates splicing or translation. The libraries will be transformed into yeast, and the yeast will be placed under selection such that they grow according to the ability of each random sequence to contribute to protein expression. A convolution neural network approach will be used to learn the relationship between these “fitness” phenotypes and their associated genotypes. Although yeast is a single-celled eukaryote, it has been the source of most of the original findings on gene expression, and these findings form the basis for much of our knowledge of more complex eukaryotes. Furthermore, the short sequences in yeast that comprise the DNA- and RNA-binding sites of regulatory proteins tend to be comparable in size to those of other organisms. Yeast is used often in synthetic biology and metabolic engineering, and the work proposed here will result in novel tools for quantitatively controlling its gene expression. Initial results with a library of 5' untranslated regions (UTRs) indicate that we can construct a model to account for a large fraction of the observed variability in expression, and that the model extends to native sequence elements. The model allowed us to forward engineer 5' UTRs to have increased activity. Specific aims of this application are to assess the effects of random sequences targeted to upstream regulatory elements, core promoter elements, 5' UTRs, introns and 3' UTRs; to learn predictive and interpretable models using convolutional neural networks and to identify novel functional cis-regulatory elements; and to validate our models on native sequences and combinatorial libraries, and by engineering synthetic sequence elements with user-specified properties. In sum, the proposal seeks to construct a comprehensive and predictive model of regulatory sequence–function relationships for a well-studied single- celled eukaryote, providing a basis for similar studies on other organisms.
项目摘要 DNA和RNA中的短序列元件决定mRNA和蛋白质的水平和组成, 这使得我们能够准确地模拟任何给定序列如何影响转录、剪接或 翻译.这种顺式调控模型将填补我们对这些核心基因表达的知识空白 流程.此外,随着大量人类基因组的测序, 在蛋白质的最终水平上的序列变异的解释将是不可分割的, 调节序列类似地,构建具有确定表达水平的代谢途径, 合成基因网络的工程设计需要精确了解调控序列如何影响 表情这个应用程序试图使用酵母酿酒酵母作为一个测试案例,学习如何 任何短的调节序列都会影响蛋白质水平。预测模型将在一组库上训练, 数量级比迄今为止所描述的更为复杂。库将由一个 具有50个核苷酸的一百万个随机序列的生长报告基因,所述随机序列包含DNA元件 调节转录的RNA元件或调节剪接或翻译的RNA元件。图书馆将 转化成酵母,酵母将被置于选择之下,使它们根据能力生长。 每个随机序列的组合有助于蛋白质表达。卷积神经网络方法将是 用于了解这些“适应性”表型与其相关基因型之间的关系。虽然 酵母是一种单细胞真核生物,它是大多数关于基因表达的原始发现的来源, 这些发现为我们了解更复杂的真核生物奠定了基础。而且 酵母中包含调节蛋白的DNA和RNA结合位点的短序列往往是 在大小上与其他生物体相当。酵母通常用于合成生物学和代谢生物学。 工程,这里提出的工作将导致定量控制其基因的新工具 表情利用5'非翻译区(UTR)文库的初步结果表明,我们可以构建一个 该模型解释了观察到的表达变化的很大一部分,并且该模型扩展到 天然序列元件。该模型允许我们向前工程化5'UTR以具有增加的活性。 本申请的具体目的是评估靶向上游的随机序列的影响, 调控元件、核心启动子元件、5'UTR、内含子和3' UTR;以学习预测和 使用卷积神经网络的可解释模型,并确定新的功能性顺式调节 元素;并验证我们的模型对天然序列和组合库,并通过工程 具有用户指定属性的合成序列元素。总括而言,这项建议旨在建立一个 全面和预测模型的调控序列功能关系,为一个良好的研究单一的, 细胞真核生物,为其他生物的类似研究提供了基础。

项目成果

期刊论文数量(1)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Effects of sequence motifs in the yeast 3' untranslated region determined from massively parallel assays of random sequences.
  • DOI:
    10.1186/s13059-021-02509-6
  • 发表时间:
    2021-10-18
  • 期刊:
  • 影响因子:
    12.3
  • 作者:
    Savinov A;Brandsen BM;Angell BE;Cuperus JT;Fields S
  • 通讯作者:
    Fields S
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

STANLEY FIELDS其他文献

STANLEY FIELDS的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('STANLEY FIELDS', 18)}}的其他基金

INTERROGATION OF E3 UBIQUITIN LIGASE CATALYSIS BY DEEP MUTATIONAL SCANNING
通过深度突变扫描研究 E3 泛素连接酶催化作用
  • 批准号:
    8365800
  • 财政年份:
    2011
  • 资助金额:
    $ 35.09万
  • 项目类别:
CHARACTERIZATION OF SMALL MOLECULE METABOLITES
小分子代谢物的表征
  • 批准号:
    8365852
  • 财政年份:
    2011
  • 资助金额:
    $ 35.09万
  • 项目类别:
A STRATEGY TO QUANTIFY PROTEIN STABILITY
量化蛋白质稳定性的策略
  • 批准号:
    8365801
  • 财政年份:
    2011
  • 资助金额:
    $ 35.09万
  • 项目类别:
GENOME-WIDE ANALYSIS OF NASCENT TRANSCRIPTION IN SACCHAROMYCES CEREVISIAE
酿酒酵母新生转录的全基因组分析
  • 批准号:
    8365819
  • 财政年份:
    2011
  • 资助金额:
    $ 35.09万
  • 项目类别:
MASSIVELY PARALLEL MEASUREMENT OF SRC KINASE ACTIVITY AND DRUG RESISTANCE IN VIV
VIV 中 SRC 激酶活性和耐药性的大规模并行测量
  • 批准号:
    8365921
  • 财政年份:
    2011
  • 资助金额:
    $ 35.09万
  • 项目类别:
UNDERSTANDING THE MOLECULAR BASIS OF SELECTIVITY IN AKAP
了解 AKAP 选择性的分子基础
  • 批准号:
    8365785
  • 财政年份:
    2011
  • 资助金额:
    $ 35.09万
  • 项目类别:
HIGH-RESOLUTION MAPPING OF PROTEIN SEQUENCE-FUNCTION RELATIONSHIPS
蛋白质序列-功能关系的高分辨率绘图
  • 批准号:
    8365920
  • 财政年份:
    2011
  • 资助金额:
    $ 35.09万
  • 项目类别:
LARGE SCALE MEASUREMENT OF EPISTASIS TO IDENTIFY MUTATIONS THAT STABILIZE PROTEI
大规模测量上位性以鉴定稳定蛋白质的突变
  • 批准号:
    8365793
  • 财政年份:
    2011
  • 资助金额:
    $ 35.09万
  • 项目类别:
WIDE VARIATION IN ANTIBIOTIC RESISTANCE PROTEINS IDENTIFIED BY FUNCTIONAL METAGE
通过功能计量鉴定的抗生素抗性蛋白的广泛变异
  • 批准号:
    8365808
  • 财政年份:
    2011
  • 资助金额:
    $ 35.09万
  • 项目类别:
SEMINARS GIVEN BY STANLEY FIELDS
斯坦利·菲尔兹举办的研讨会
  • 批准号:
    8365853
  • 财政年份:
    2011
  • 资助金额:
    $ 35.09万
  • 项目类别:

相似海外基金

Impact of alternative polyadenylation of 3'-untranslated regions in the PI3K/AKT cascade on microRNA
PI3K/AKT 级联中 3-非翻译区的替代多聚腺苷酸化对 microRNA 的影响
  • 批准号:
    573541-2022
  • 财政年份:
    2022
  • 资助金额:
    $ 35.09万
  • 项目类别:
    University Undergraduate Student Research Awards
How do untranslated regions of cannabinoid receptor type 1 mRNA determine receptor subcellular localisation and function?
1 型大麻素受体 mRNA 的非翻译区如何决定受体亚细胞定位和功能?
  • 批准号:
    2744317
  • 财政年份:
    2022
  • 资助金额:
    $ 35.09万
  • 项目类别:
    Studentship
MICA:Synthetic untranslated regions for direct delivery of therapeutic mRNAs
MICA:用于直接递送治疗性 mRNA 的合成非翻译区
  • 批准号:
    MR/V010948/1
  • 财政年份:
    2021
  • 资助金额:
    $ 35.09万
  • 项目类别:
    Research Grant
Translational Control by 5'-untranslated regions
5-非翻译区域的翻译控制
  • 批准号:
    10019570
  • 财政年份:
    2019
  • 资助金额:
    $ 35.09万
  • 项目类别:
Translational Control by 5'-untranslated regions
5-非翻译区域的翻译控制
  • 批准号:
    10223370
  • 财政年份:
    2019
  • 资助金额:
    $ 35.09万
  • 项目类别:
Translational Control by 5'-untranslated regions
5-非翻译区域的翻译控制
  • 批准号:
    10455108
  • 财政年份:
    2019
  • 资助金额:
    $ 35.09万
  • 项目类别:
Synergistic microRNA-binding sites, and 3' untranslated regions: a dialogue of silence
协同的 microRNA 结合位点和 3 非翻译区:沉默的对话
  • 批准号:
    255762
  • 财政年份:
    2012
  • 资助金额:
    $ 35.09万
  • 项目类别:
    Operating Grants
Analysis of long untranslated regions in Nipah virus genome
尼帕病毒基因组长非翻译区分析
  • 批准号:
    20790351
  • 财政年份:
    2008
  • 资助金额:
    $ 35.09万
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)
Search for mRNA elements involved in the compatibility between 5' untranslated regions and coding regions in chloroplast translation
寻找参与叶绿体翻译中 5 非翻译区和编码区之间兼容性的 mRNA 元件
  • 批准号:
    19370021
  • 财政年份:
    2007
  • 资助金额:
    $ 35.09万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Post-transcriptional Regulation of PPAR-g Expression by 5'-Untranslated Regions
5-非翻译区对 PPAR-g 表达的转录后调控
  • 批准号:
    7131841
  • 财政年份:
    2006
  • 资助金额:
    $ 35.09万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了