Virtual Approaches to New Chemistries
新化学的虚拟方法
基本信息
- 批准号:10447249
- 负责人:
- 金额:$ 44万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2022
- 资助国家:美国
- 起止时间:2022-06-06 至 2024-05-31
- 项目状态:已结题
- 来源:
- 关键词:AbbreviationsAddressAlgorithmsAutomationBackBiologicalBiological AssayCategoriesCharacteristicsChemical StructureChemicalsChemistryCollectionDataDatabasesDescriptorDrug DesignEvaluationFAIR principlesGenerationsGoalsHumanInformaticsInternetLearning ModuleMachine LearningMeasuresMethodologyModelingNational Center for Advancing Translational SciencesNatural Language ProcessingNatural regenerationNatureOntologyProcessProgram DevelopmentProtocols documentationQuantitative Structure-Activity RelationshipReactionReadabilityReagentRecipeResearch PersonnelRunningSchemeScientistSemanticsSolventsSorting - Cell MovementStructureSystemTechnologyTextUpdateVendorVisualWorkbasechemical reactiondeep learningdesigndrug developmentexperienceinstrumentinteractive toolknowledge basenatural languagenew technologynovelpreferencesmall moleculestoichiometrysuccesstoolvectorvirtual
项目摘要
Project Summary/Abstract
Two new virtual chemistry technologies will be added to the NCATS ASPIRE project as separate modules. The
first module will enable new chemistries to be modelled and selected from cutting edge (deep) machine
learning technology using the latest structure/activity data taken directly from instruments. The second module
will be a novel informatics system for capturing chemistry-rich data in a semantic template as
machine-readable reactions which will increase the utility of chemical reactions in electronic lab notebooks and
allow more precise interrogation and automation of reaction analyses (and their corresponding reaction
products).
The deep learning technology in module 1 is based on our new chemically rich vector (CRV) methodology,
which is able to compress information about chemical structures into a vector of 64 numbers with an efficiency
that allows the encoding process to be reversed: not only can a CRV be converted back into its original
structure with high success (>90% exact match), but a modified CRV can be converted into a structure that is
representative of that point in chemical space. CRVs make excellent descriptors for SAR/QSAR iteration
because they contain much more chemical information in a small space, allowing the automation of
structure-activity models to be more streamlined, relative to conventional descriptors. The resulting models will
explore the multi-dimensional space via an interactive visual interface (human-directed) or a back-end
algorithm to constantly search for new and better structures (machine-directed). Both interactive and
automated processes will be connected back into the ASPIRE automation cycle so that they can be
synthesized and measured (hypothesis evaluation and iterative optimization).
The second module, machine-readable reactions, draws from our extensive experience developing the
BioHarmony Annotator (formerly: BioAssay Express) which uses natural language models to assign semantic
ontology terms to biological assay protocols, turning them from unstructured text into machine-readable data.
Extracting the full content of reactions from protocols and chemical structure diagrams is remarkably difficult
given the unstructured nature of text, abbreviations, shortcuts and assumptions that go into diagrams. It is
further complicated by the need to connect the materials in the scheme with the reaction text description (e.g.
reagents, solvents, the sequences involved in the recipe, reaction workup, and product characterization). As an
alternative, we will modularize the CDD stoichiometric sketcher, which will allow us to extract this data. We will
work with NCATS to identify important fields to capture, creating a machine readable chemical reaction
template.
项目总结/摘要
两项新的虚拟化学技术将作为单独的模块加入NCATS ASPIRE项目。的
第一个模块将使新的化学品能够建模,并从尖端(深)机中选择
使用直接从仪器获取的最新结构/活动数据学习技术。第二模块
将是一个新的信息系统,用于捕获语义模板中的化学丰富的数据,
机器可读的反应,这将增加电子实验室笔记本中化学反应的实用性,
允许反应分析(及其相应的反应)的更精确的询问和自动化
产品)。
模块1中的深度学习技术基于我们新的化学丰富向量(CRV)方法,
它能够高效地将有关化学结构的信息压缩到64个数字的向量中
这使得编码过程可以逆转:不仅可以将CRV转换回其原始状态,
结构具有高成功率(>90%精确匹配),但修改的CRV可以转换为
在化学空间中的代表点。CRV是SAR/QSAR迭代的良好描述符
因为它们在一个小空间里包含了更多的化学信息,
结构-活性模型相对于常规描述符更加精简。由此产生的模型将
通过交互式视觉界面(人工指导)或后端探索多维空间
算法不断搜索新的和更好的结构(机器指导)。交互和
自动化流程将重新连接到ASPIRE自动化周期中,
合成和测量(假设评估和迭代优化)。
第二个模块是机器可读的反应,它借鉴了我们开发
BioHarmony Annotator(前身为BioAssay Express),使用自然语言模型分配语义
将本体术语转换为生物测定协议,将它们从非结构化文本转换为机器可读数据。
从协议和化学结构图中提取反应的全部内容非常困难
考虑到图表中的文本、缩写、快捷方式和假设的非结构化性质。是
由于需要将方案中的材料与反应文本描述(例如,
试剂、溶剂、配方中涉及的序列、反应后处理和产物表征)。作为
或者,我们将模块化的CDD化学计量素描,这将使我们能够提取这些数据。我们将
与NCATS合作,确定要捕获的重要字段,创建机器可读的化学反应
template.
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
BARRY A BUNIN其他文献
BARRY A BUNIN的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('BARRY A BUNIN', 18)}}的其他基金
Automated Molecular Identity Disambiguator (AutoMID)
自动分子身份消歧器 (AutoMID)
- 批准号:
10357906 - 财政年份:2020
- 资助金额:
$ 44万 - 项目类别:
Automated Molecular Identity Disambiguator (AutoMID)
自动分子身份消歧器 (AutoMID)
- 批准号:
10569639 - 财政年份:2020
- 资助金额:
$ 44万 - 项目类别:
Intelligent Chemical Structure Browser for Drug Discovery and Optimization
用于药物发现和优化的智能化学结构浏览器
- 批准号:
10241834 - 财政年份:2019
- 资助金额:
$ 44万 - 项目类别:
A Robust, Secure Framework to Effortlessly Bind Distributed Databases and Analysis Tools into Tightly Integrated Translational Drug Discovery Computational Platforms
一个强大、安全的框架,可以轻松地将分布式数据库和分析工具绑定到紧密集成的转化药物发现计算平台中
- 批准号:
10484172 - 财政年份:2019
- 资助金额:
$ 44万 - 项目类别:
Digital representation of chemical mixtures to aid drug discovery and formulation
化学混合物的数字表示以帮助药物发现和配制
- 批准号:
9902210 - 财政年份:2019
- 资助金额:
$ 44万 - 项目类别:
A Robust, Secure Framework to Effortlessly Bind Distributed Databases and Analysis Tools into Tightly Integrated Translational Drug Discovery Computational Platforms
一个强大、安全的框架,可以轻松地将分布式数据库和分析工具绑定到紧密集成的转化药物发现计算平台中
- 批准号:
10685358 - 财政年份:2019
- 资助金额:
$ 44万 - 项目类别:
Intelligent Chemical Structure Browser for Drug Discovery and Optimization
用于药物发现和优化的智能化学结构浏览器
- 批准号:
10386918 - 财政年份:2019
- 资助金额:
$ 44万 - 项目类别:
Novel deep learning strategy to better predict pharmacological properties of candidate drugs and focus discovery efforts
新颖的深度学习策略可以更好地预测候选药物的药理学特性并集中发现工作
- 批准号:
10133177 - 财政年份:2018
- 资助金额:
$ 44万 - 项目类别:
Novel deep learning strategy to better predict pharmacological properties of candidate drugs and focus discovery efforts
新颖的深度学习策略可以更好地预测候选药物的药理学特性并集中发现工作
- 批准号:
10004481 - 财政年份:2018
- 资助金额:
$ 44万 - 项目类别:
相似海外基金
Rational design of rapidly translatable, highly antigenic and novel recombinant immunogens to address deficiencies of current snakebite treatments
合理设计可快速翻译、高抗原性和新型重组免疫原,以解决当前蛇咬伤治疗的缺陷
- 批准号:
MR/S03398X/2 - 财政年份:2024
- 资助金额:
$ 44万 - 项目类别:
Fellowship
Re-thinking drug nanocrystals as highly loaded vectors to address key unmet therapeutic challenges
重新思考药物纳米晶体作为高负载载体以解决关键的未满足的治疗挑战
- 批准号:
EP/Y001486/1 - 财政年份:2024
- 资助金额:
$ 44万 - 项目类别:
Research Grant
CAREER: FEAST (Food Ecosystems And circularity for Sustainable Transformation) framework to address Hidden Hunger
职业:FEAST(食品生态系统和可持续转型循环)框架解决隐性饥饿
- 批准号:
2338423 - 财政年份:2024
- 资助金额:
$ 44万 - 项目类别:
Continuing Grant
Metrology to address ion suppression in multimodal mass spectrometry imaging with application in oncology
计量学解决多模态质谱成像中的离子抑制问题及其在肿瘤学中的应用
- 批准号:
MR/X03657X/1 - 财政年份:2024
- 资助金额:
$ 44万 - 项目类别:
Fellowship
CRII: SHF: A Novel Address Translation Architecture for Virtualized Clouds
CRII:SHF:一种用于虚拟化云的新型地址转换架构
- 批准号:
2348066 - 财政年份:2024
- 资助金额:
$ 44万 - 项目类别:
Standard Grant
The Abundance Project: Enhancing Cultural & Green Inclusion in Social Prescribing in Southwest London to Address Ethnic Inequalities in Mental Health
丰富项目:增强文化
- 批准号:
AH/Z505481/1 - 财政年份:2024
- 资助金额:
$ 44万 - 项目类别:
Research Grant
ERAMET - Ecosystem for rapid adoption of modelling and simulation METhods to address regulatory needs in the development of orphan and paediatric medicines
ERAMET - 快速采用建模和模拟方法的生态系统,以满足孤儿药和儿科药物开发中的监管需求
- 批准号:
10107647 - 财政年份:2024
- 资助金额:
$ 44万 - 项目类别:
EU-Funded
BIORETS: Convergence Research Experiences for Teachers in Synthetic and Systems Biology to Address Challenges in Food, Health, Energy, and Environment
BIORETS:合成和系统生物学教师的融合研究经验,以应对食品、健康、能源和环境方面的挑战
- 批准号:
2341402 - 财政年份:2024
- 资助金额:
$ 44万 - 项目类别:
Standard Grant
Ecosystem for rapid adoption of modelling and simulation METhods to address regulatory needs in the development of orphan and paediatric medicines
快速采用建模和模拟方法的生态系统,以满足孤儿药和儿科药物开发中的监管需求
- 批准号:
10106221 - 财政年份:2024
- 资助金额:
$ 44万 - 项目类别:
EU-Funded
Recite: Building Research by Communities to Address Inequities through Expression
背诵:社区开展研究,通过表达解决不平等问题
- 批准号:
AH/Z505341/1 - 财政年份:2024
- 资助金额:
$ 44万 - 项目类别:
Research Grant