Continued development of the ChEBI database and ontology for improved interoperability with biomedical resources

持续开发 ChEBI 数据库和本体,以提高与生物医学资源的互操作性

基本信息

  • 批准号:
    BB/G022747/1
  • 负责人:
  • 金额:
    $ 69.1万
  • 依托单位:
  • 依托单位国家:
    英国
  • 项目类别:
    Research Grant
  • 财政年份:
    2009
  • 资助国家:
    英国
  • 起止时间:
    2009 至 无数据
  • 项目状态:
    已结题

项目摘要

Today, the biological sciences are generating an enormous amount of data aimed at tackling fundamental questions such as 'What is the molecular basis for life?', 'How do organisms work?' and 'How does disease arise and how can it be treated?'. While research in the past has often followed a reductionist approach - studying the parts to understand the whole - today it increasingly follows a systems approach, integrating insights from the past into a holistic model of an organism (systems biology). The goal is to use such a model to perform a computer simulation of the organism and to use this to answer questions such as 'What effect will the addition of compound X (for example a drug) have on the organism?'. Isolated approaches to organizing the data in particular fields in the molecular sciences - information on genes (the code of life), proteins (an organism's chemical factories) and small molecules such as sugars or drugs - will hamper synergistic insights. Instead, databases in the biological sciences, which are used to parametrize such simulations, need to be interlinked and interoperable, allowing seamless movement amongst them. Because biological databases are generated on a worldwide basis by diverse communities, their integration creates obvious challenges. Besides simple technical questions of interoperability, the bioscience community is therefore working on common data models and standards. Scientists create rules on how to name and encode scientific information in a computer (semantics) and how a particular piece of such information relates to the scientific concepts in its surroundings (ontologies). This results in ontological chains such as 'A fox is_a mammal, which again is_a animal', from which the computer automatically reasons that a fox is an animal, even if this is not explicitly stated. Besides the so-called 'is_a' relationship between entities in ontologies, there exists a whole range of other relationships such as 'is_part_of', but which may be relevant only in certain fields of knowledge. Since ontologies can be complex and those of neighbouring fields may be interlinked, they allow machines to reason about the world. The database Chemical Entities of Biological Interest (ChEBI) provides for the bioscientific community semantic and ontological information as well as stable identifiers for small chemical compounds (as are most drugs). Areas such as drug discovery or systems biology bring together information about the morphology of cells, genes and proteins, as well as the small molecules that act on these. The interlinking between these bits of information in databases is typically performed through stable identifiers assigned to entities such as single genes, proteins or small molecules by standardization bodies and database providers. In addition, formal and so-called 'trivial' names are assigned and associated with both the entity and the stable identifier in the database. ChEBI acts as a resource for such names and stable identifiers in the area of small molecules of biological interest. For this purpose it is widely used in the bioscience community, who send formal requests for the assignment of identifiers for particular small molecule entities to the ChEBI team, who then perform the assignment, publish the information into the public domain and inform the requesting party that the request has been fulfilled. Also acting as an ontology, ChEBI puts small molecule structures and their structural properties into an ontological context. It makes statements such as 'D-Glucose is_a D-aldohexose, which is_a ... [various is_a relationships omitted] ... which is_a monosaccharide, which is_a sugar.' Again, ontological chains such as the one above allow computers to make statements about the world (of chemistry in this case), which have not been explicitly coded elsewhere. This is useful, for example, in the field of text mining, the computer-based re-discovery of knowledge in the printed literature.
今天,生物科学正在产生大量的数据,旨在解决基本问题,如“生命的分子基础是什么?”生物体是如何工作的?疾病是如何产生的,如何治疗?'.虽然过去的研究经常遵循还原论的方法-研究部分以了解整体-但今天它越来越多地遵循系统方法,将过去的见解整合到有机体的整体模型(系统生物学)中。目标是使用这样一个模型来执行生物体的计算机模拟,并使用它来回答诸如“添加化合物X(例如药物)对生物体有什么影响?'.在分子科学的特定领域-基因(生命的密码)、蛋白质(有机体的化学工厂)和糖或药物等小分子的信息-组织数据的孤立方法将妨碍协同洞察力。相反,生物科学中用于参数化此类模拟的数据库需要相互关联和互操作,允许它们之间的无缝移动。由于生物数据库是由不同的社区在世界范围内建立的,它们的整合带来了明显的挑战。因此,除了互操作性的简单技术问题外,生物科学界正在研究共同的数据模型和标准。科学家们创建了关于如何在计算机中命名和编码科学信息的规则(语义学),以及这些信息的特定片段如何与其周围的科学概念(本体论)相关。这导致了本体链,例如“狐狸是哺乳动物,又是动物”,计算机自动推理狐狸是动物,即使没有明确说明。除了本体中实体之间所谓的“is_a”关系之外,还存在一系列其他关系,例如“is_part_of”,但这些关系可能仅在某些知识领域中相关。由于本体可以是复杂的,并且相邻领域的本体可能是相互关联的,因此它们允许机器对世界进行推理。生物学感兴趣的化学实体数据库(ChEBI)为生物科学界提供语义和本体信息以及小化学化合物(如大多数药物)的稳定标识符。药物发现或系统生物学等领域汇集了有关细胞,基因和蛋白质形态的信息,以及作用于这些的小分子。数据库中这些信息位之间的互连通常通过标准化机构和数据库提供商分配给单基因、蛋白质或小分子等实体的稳定标识符来执行。此外,正式的和所谓的“平凡”的名称被分配并与数据库中的实体和稳定标识符相关联。ChEBI作为生物学感兴趣的小分子领域的此类名称和稳定标识符的资源。为此,它在生物科学界广泛使用,他们向ChEBI团队发送分配特定小分子实体标识符的正式请求,然后ChEBI团队执行分配,将信息发布到公共领域并通知请求方请求已得到满足。ChEBI还作为一个本体论,将小分子结构及其结构特性置于本体论背景下。它作出声明,如'D-葡萄糖是一种D-己醛糖,这是一种... [各种关系省略].它是一种单糖,也就是糖。再一次,像上面这样的本体链允许计算机对世界(在这种情况下是化学)做出陈述,这些陈述在其他地方没有被明确编码。例如,在文本挖掘领域,这是有用的,基于计算机的印刷文献中知识的重新发现。

项目成果

期刊论文数量(8)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Structure-based classification and ontology in chemistry
化学中基于结构的分类和本体论
  • DOI:
    10.3929/ethz-b-000049483
  • 发表时间:
    2012
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Hastings, Janna
  • 通讯作者:
    Hastings, Janna
The chemical information ontology: provenance and disambiguation for chemical data on the biological semantic web.
  • DOI:
    10.1371/journal.pone.0025513
  • 发表时间:
    2011
  • 期刊:
  • 影响因子:
    3.7
  • 作者:
    Hastings J;Chepelev L;Willighagen E;Adams N;Steinbeck C;Dumontier M
  • 通讯作者:
    Dumontier M
OntoQuery: easy-to-use web-based OWL querying.
OntoQuery:易于使用的基于 Web 的 OWL 查询。
PIDO: the primary immunodeficiency disease ontology.
PIDO:初级免疫缺陷疾病本体。
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Christoph Steinbeck其他文献

Userscripts for the Life Sciences
  • DOI:
    10.1186/1471-2105-8-487
  • 发表时间:
    2007-12-21
  • 期刊:
  • 影响因子:
    3.300
  • 作者:
    Egon L Willighagen;Noel M O'Boyle;Harini Gopalakrishnan;Dazhi Jiao;Rajarshi Guha;Christoph Steinbeck;David J Wild
  • 通讯作者:
    David J Wild
So what have data standards ever done for us? The view from metabolomics
  • DOI:
    10.1186/gm159
  • 发表时间:
    2010-01-01
  • 期刊:
  • 影响因子:
    11.200
  • 作者:
    Julian L Griffin;Christoph Steinbeck
  • 通讯作者:
    Christoph Steinbeck
Molecular fragments chemoinformatics
  • DOI:
    10.1186/1758-2946-2-s1-p14
  • 发表时间:
    2010-05-01
  • 期刊:
  • 影响因子:
    5.700
  • 作者:
    Hubert Kuhn;Stefan Neumann;Christoph Steinbeck;Carsten Wittekindt;Achim Zielesny
  • 通讯作者:
    Achim Zielesny
STOUT V2.0: SMILES to IUPAC name conversion using transformer models
  • DOI:
    10.1186/s13321-024-00941-x
  • 发表时间:
    2024-12-27
  • 期刊:
  • 影响因子:
    5.700
  • 作者:
    Kohulan Rajan;Achim Zielesny;Christoph Steinbeck
  • 通讯作者:
    Christoph Steinbeck
Twenty years of nmrshiftdb2: A case study of an open database for analytical chemistry.
nmrshiftdb2 二十年:分析化学开放数据库的案例研究。
  • DOI:
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Stefan Kuhn;Heinz Kolshorn;Christoph Steinbeck;Nils E Schlörer
  • 通讯作者:
    Nils E Schlörer

Christoph Steinbeck的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Christoph Steinbeck', 18)}}的其他基金

Next Generation Tools For The Identification Of Metabolites In Global Metabolomic Studies (Lead application: BB/N023013/1)
全球代谢组学研究中代谢物鉴定的下一代工具(主导申请:BB/N023013/1)
  • 批准号:
    BB/N023242/1
  • 财政年份:
    2016
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Research Grant
Sharing of metabolomics data and their analyses as Galaxy workflows through a UK-China collaboration
通过英中合作,以 Galaxy 工作流程共享代谢组学数据及其分析
  • 批准号:
    BB/M027635/1
  • 财政年份:
    2015
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Research Grant
A comprehensive online spectra analysis and visualisation tool for the OMICS sciences
适用于 OMICS 科学的综合在线光谱分析和可视化工具
  • 批准号:
    BB/L018721/1
  • 财政年份:
    2014
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Research Grant
Metabo
麦太保
  • 批准号:
    BB/L024152/1
  • 财政年份:
    2014
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Research Grant
Closing the gaps in metabolomics - Identifying unknown metabolites and mapping onto biochemical pathways
缩小代谢组学的差距 - 识别未知代谢物并绘制生化途径
  • 批准号:
    BB/K004301/1
  • 财政年份:
    2012
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Research Grant
BBSRC Industrial CASE Partnership Grant
BBSRC 工业案例合作伙伴资助
  • 批准号:
    BB/I532153/1
  • 财政年份:
    2010
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Training Grant
MetaboLights: Creating the missing Metabolomics community resource
MetaboLights:创建缺失的代谢组学社区资源
  • 批准号:
    BB/I000933/1
  • 财政年份:
    2010
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Research Grant

相似国自然基金

损伤线粒体传递机制介导成纤维细胞/II型肺泡上皮细胞对话在支气管肺发育不良肺泡发育阻滞中的作用
  • 批准号:
    82371721
  • 批准年份:
    2023
  • 资助金额:
    49.00 万元
  • 项目类别:
    面上项目
增强子在小鼠早期胚胎细胞命运决定中的功能和调控机制研究
  • 批准号:
    82371668
  • 批准年份:
    2023
  • 资助金额:
    52.00 万元
  • 项目类别:
    面上项目
MAP2的m6A甲基化在七氟烷引起SST神经元树突发育异常及精细运动损伤中的作用机制研究
  • 批准号:
    82371276
  • 批准年份:
    2023
  • 资助金额:
    47.00 万元
  • 项目类别:
    面上项目
"胚胎/生殖细胞发育特性激活”促进“神经胶质瘤恶变”的机制及其临床价值研究
  • 批准号:
    82372327
  • 批准年份:
    2023
  • 资助金额:
    49.00 万元
  • 项目类别:
    面上项目
Irisin通过整合素调控黄河鲤肌纤维发育的分子机制研究
  • 批准号:
    32303019
  • 批准年份:
    2023
  • 资助金额:
    30.00 万元
  • 项目类别:
    青年科学基金项目
TMEM30A介导的磷脂酰丝氨酸外翻促进毛细胞-SGN突触发育成熟的机制研究
  • 批准号:
    82371172
  • 批准年份:
    2023
  • 资助金额:
    49.00 万元
  • 项目类别:
    面上项目
HER2特异性双抗原表位识别诊疗一体化探针研制与临床前诊疗效能研究
  • 批准号:
    82372014
  • 批准年份:
    2023
  • 资助金额:
    48.00 万元
  • 项目类别:
    面上项目
水稻边界发育缺陷突变体abnormal boundary development(abd)的基因克隆与功能分析
  • 批准号:
    32070202
  • 批准年份:
    2020
  • 资助金额:
    58 万元
  • 项目类别:
    面上项目
Development of a Linear Stochastic Model for Wind Field Reconstruction from Limited Measurement Data
  • 批准号:
  • 批准年份:
    2020
  • 资助金额:
    40 万元
  • 项目类别:
细胞核分布基因NudCL2在细胞迁移及小鼠胚胎发育过程中的作用及机制研究
  • 批准号:
    31701214
  • 批准年份:
    2017
  • 资助金额:
    25.0 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

Development of a new solid tritium breeder blanket
新型固体氚增殖毯的研制
  • 批准号:
    2908923
  • 财政年份:
    2027
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Studentship
Optimal utility-based design of oncology clinical development programmes
基于效用的肿瘤学临床开发项目的优化设计
  • 批准号:
    2734768
  • 财政年份:
    2026
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Studentship
REU Site: Microbial Biofilm Development, Resistance, & Community Structure
REU 网站:微生物生物膜的发展、耐药性、
  • 批准号:
    2349311
  • 财政年份:
    2025
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Continuing Grant
SoundDecisions - Musical Listening, Decision Making, And Equitable Development In The Mekong Delta
SoundDecisions - 湄公河三角洲的音乐聆听、决策和公平发展
  • 批准号:
    EP/Z000424/1
  • 财政年份:
    2025
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Research Grant
Bio-MATSUPER: Development of high-performance supercapacitors based on bio-based carbon materials
Bio-MATSUPER:开发基于生物基碳材料的高性能超级电容器
  • 批准号:
    EP/Z001013/1
  • 财政年份:
    2025
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Fellowship
NSF Engines Development Award: Utilizing space research, development and manufacturing to improve the human condition (OH)
NSF 发动机发展奖:利用太空研究、开发和制造来改善人类状况(OH)
  • 批准号:
    2314750
  • 财政年份:
    2024
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Cooperative Agreement
NSF Engines Development Award: Building an sustainable plastics innovation ecosystem in the Midwest (MN, IL)
NSF 引擎发展奖:在中西部(明尼苏达州、伊利诺伊州)建立可持续塑料创新生态系统
  • 批准号:
    2315247
  • 财政年份:
    2024
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Cooperative Agreement
CAREER: Development of New Gas-Releasing Molecules Using a Thiol Carrier
职业:利用硫醇载体开发新型气体释放分子
  • 批准号:
    2338835
  • 财政年份:
    2024
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Continuing Grant
FDSS Track 1: A New Paradigm for Faculty Development in Geospace Science at Georgia Tech
FDSS Track 1:佐治亚理工学院地球空间科学教师发展的新范式
  • 批准号:
    2347873
  • 财政年份:
    2024
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Continuing Grant
Cultivating Persistence and Leadership Development in Science Master Teachers
培养科学名师的毅力和领导力发展
  • 批准号:
    2345144
  • 财政年份:
    2024
  • 资助金额:
    $ 69.1万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了