Harmony AI: Natural Language Processing Enabling Advanced Biomanufacturing
Harmony AI:自然语言处理实现先进生物制造
基本信息
- 批准号:10761082
- 负责人:
- 金额:$ 22.92万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2023
- 资助国家:美国
- 起止时间:2023-09-01 至 2024-08-31
- 项目状态:已结题
- 来源:
- 关键词:AffectAmino Acid SequenceAntibodiesArtificial IntelligenceBiologicalBiological AssayBiomanufacturingCell ProliferationCellsCodeCodon NucleotidesComputational TechniqueComputer softwareCustomDNA SequenceDependenceDrug Delivery SystemsEngineered GeneEnsureEnzymesEscherichia coliEscherichia coli ProteinsEvaluationFoodFood productionFrequenciesGenesGoalsGrowthHeadImmunosorbentsLearningLinkLiteratureMeasuresModelingNatural Language ProcessingOrganismPharmacologic SubstancePlasmidsPolymersPositioning AttributeProductionProtein ConformationProteinsRecombinant ProteinsReportingResearch PersonnelRiskRunningSignal TransductionSystemTechniquesTechnologyTextilesTrainingTranslationsTreatment EfficacyVariantVegan Dietcostdrug discoveryexperimental studyfeedinghands-on learningimprovedin vitro Assaymeetingspreventprocess optimizationprotein expressionprotein foldingprotein functionprotein misfoldingr-hGH-Mscale uptoolvector
项目摘要
Project Summary/Abstract:
Recombinant proteins have a wide range of applications, from pharmaceutical products, drug
discovery, protein-based polymers for drug delivery, antibodies enzymes, and sustainable
technologies such as textiles and vegan food production such as the impossible burger. Towards
meeting these increased demands by replacing batch culture with continuous culture reduces
overhead costs, batch-to-batch variation, and increases protein production. Further, towards
maximizing recombinant protein, yield computational techniques for gene engineering, such as
codon optimization, use synonymous codon changes to increase protein production. Although
codon optimization increases protein production in specific systems, synonymous changes to a
gene sequence can cause unexpected detrimental results to the protein, such as protein
misfolding, decreased protein yield, and vector loss. Therefore, codon optimization may not
provide an optimal strategy for increasing protein production in batch culture and may introduce
risk when scaling to continuous culture. CFDRC has utilized state-of-the-art natural language
processing techniques to learn how synonymous codons are used by a target organism and apply
this learning to gene engineering. We demonstrated that our AI-based codon harmonization
model could predict the E. Coli synonymous codon usage with 73% accuracy, significantly above
prior reports. Using this AI-based approach to gene engineering will provide an optimal strategy
for increasing protein production in batch culture and continuous culture in addition to de-risk
scaling up to continuous culture.
项目概要/摘要:
重组蛋白具有广泛的应用,从医药产品、药物
发现,用于药物递送的蛋白质聚合物,抗体酶,以及可持续的
比如纺织品和素食食品生产,比如不可能的汉堡。朝向
通过用连续培养代替分批培养来满足这些增加的需求,
间接成本、批次间差异,并增加蛋白质产量。此外,向
最大化重组蛋白,产生用于基因工程的计算技术,例如
密码子优化,使用同义密码子改变来增加蛋白质产量。虽然
密码子优化增加了特定系统中的蛋白质产量,
基因序列可对蛋白质造成意想不到的有害结果,例如蛋白质
错误折叠、蛋白质产量降低和载体损失。因此,密码子优化可能不
提供了在分批培养中增加蛋白质产量的最佳策略,
风险时扩展到连续培养。CFDRC利用最先进的自然语言
处理技术,以了解同义密码子如何被目标生物体使用,并应用
这种学习到基因工程。我们证明了我们基于人工智能的密码子协调
模型可以预测E.大肠杆菌同义密码子使用的准确率为73%,显著高于
先前的报告。将这种基于人工智能的方法用于基因工程将提供一种最佳策略
除了降低风险外,还用于增加分批培养和连续培养中的蛋白质产量
扩大到连续培养。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
David Gaddes其他文献
David Gaddes的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('David Gaddes', 18)}}的其他基金
Harmony AI: State of the Art Natural Language Processing for Genetic Engineering
Harmony AI:用于基因工程的最先进的自然语言处理
- 批准号:
10698805 - 财政年份:2023
- 资助金额:
$ 22.92万 - 项目类别:
相似海外基金
Cerebral infarction treatment strategy using collagen-like "triple helix peptide" containing functional amino acid sequence
含功能氨基酸序列的类胶原“三螺旋肽”治疗脑梗塞策略
- 批准号:
23K06972 - 财政年份:2023
- 资助金额:
$ 22.92万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Establishment of a screening method for functional microproteins independent of amino acid sequence conservation
不依赖氨基酸序列保守性的功能性微生物蛋白筛选方法的建立
- 批准号:
23KJ0939 - 财政年份:2023
- 资助金额:
$ 22.92万 - 项目类别:
Grant-in-Aid for JSPS Fellows
Effects of amino acid sequence and lipids on the structure and self-association of transmembrane helices
氨基酸序列和脂质对跨膜螺旋结构和自缔合的影响
- 批准号:
19K07013 - 财政年份:2019
- 资助金额:
$ 22.92万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Construction of electron-transfer amino acid sequence probe with an interaction for protein and cell
蛋白质与细胞相互作用的电子转移氨基酸序列探针的构建
- 批准号:
16K05820 - 财政年份:2016
- 资助金额:
$ 22.92万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Development of artificial antibody of anti-bitter taste receptor using random amino acid sequence library
利用随机氨基酸序列库开发抗苦味受体人工抗体
- 批准号:
16K08426 - 财政年份:2016
- 资助金额:
$ 22.92万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
The aa15-17 amino acid sequence in the terminal protein domain of HBV polymerase as a viral factor affect-ing in vivo as well as in vitro replication activity of the virus.
HBV聚合酶末端蛋白结构域中的aa15-17氨基酸序列作为影响病毒体内和体外复制活性的病毒因子。
- 批准号:
25461010 - 财政年份:2013
- 资助金额:
$ 22.92万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Amino acid sequence analysis of fossil proteins using mass spectrometry
使用质谱法分析化石蛋白质的氨基酸序列
- 批准号:
23654177 - 财政年份:2011
- 资助金额:
$ 22.92万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research
Precise hybrid synthesis of glycoprotein through amino acid sequence-specific introduction of oligosaccharide followed by enzymatic transglycosylation reaction
通过氨基酸序列特异性引入寡糖,然后进行酶促糖基转移反应,精确杂合合成糖蛋白
- 批准号:
22550105 - 财政年份:2010
- 资助金额:
$ 22.92万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Estimating selection on amino-acid sequence polymorphisms in Drosophila
果蝇氨基酸序列多态性选择的估计
- 批准号:
NE/D00232X/1 - 财政年份:2006
- 资助金额:
$ 22.92万 - 项目类别:
Research Grant
Construction of a neural network for detecting novel domains from amino acid sequence information only
构建仅从氨基酸序列信息检测新结构域的神经网络
- 批准号:
16500189 - 财政年份:2004
- 资助金额:
$ 22.92万 - 项目类别:
Grant-in-Aid for Scientific Research (C)