Linking data with Identifiers.org

将数据与 Identifiers.org 链接

基本信息

  • 批准号:
    BB/K016946/1
  • 负责人:
  • 金额:
    $ 15.22万
  • 依托单位:
  • 依托单位国家:
    英国
  • 项目类别:
    Research Grant
  • 财政年份:
    2013
  • 资助国家:
    英国
  • 起止时间:
    2013 至 无数据
  • 项目状态:
    已结题

项目摘要

Annotating data, life science datasets with cross-references to other sources of knowledge has always be very important. These metadata are often what separate valuable information from heaps of unusable data. With the advent of systems biology, the size and complexity of datasets shifted the balance from direct human interaction to automated computer processing. Such operations are greatly facilitated if the metadata is encoded following standard procedures and using controlled vocabularies. If those procedures and vocabularies are shared between different types of data, it becomes possible to align, compare and integrate different datasets. A key part of any cross-reference is the identifier of the resource it points to. This identifier must be unique, perennial, resolvable and free. Most data providers create identifiers for their own records; for example '9606' identifies 'Homo sapiens' in the Taxonomy, and '22140103' identifies the latest publication about Identifiers.org in PubMed. However, those identifiers are only unique within a given dataset so their usefulness is limited when considering records in a wider context.Identifiers.org provides such global identifiers, and resolves them to the relevant dataset. In order to achieve this purpose, it uses the information recorded in the MIRIAM Registry (http://www.ebi.ac.uk/miriam/). Therefore both projects provide a distinct part of the final technical solution. Identifiers generated with the Registry make use of the accession numbers supplied by data providers, but also contain information about the collection they come from. All identifiers are unique, resolvable and robust. They allow persons or software tools to directly access the identified pieces of data on the web, via alternative providers. Although a prototype, Identifiers.org has been adopted by a number of communities and projects, as it fulfils their need for perennial cross-references and removes their previous need for maintaining and keeping up to date long lists of ever changing web links (or URLs).As more and more communities realise the benefits of using Identifiers.org URIs, new needs and use cases have appeared. This proposal seeks to strengthen and extend the services provided by the resource in order to respond to those new user requests. We will make the resource easier to use in automated procedures, specially for semantic web applications. This involves providing the content of the Registry in Resource Description Framework (RDF) format and supply a SPARQL endpoint for query purposes. Users will be able to fine-tuned the way identifiers are resolved, via the creation of 'profiles', that will record their preferences. The resource will allow the communities (more specially the data providers themselves) to get involved in the maintenance of the Registry. This will take place via a system of "ownership" by data providers of their record in the Registry. Although we currently have automatic systems in place to detect obsolete information, having the actual data providers contributing to the maintenance would ensure better quality of the recorded information, meaning a better quality of the services provided. Finally we will improve and extend the underlying computing infrastructure. By deploying it in more more data centres, we will provide more reliable services to an ever growing number of users.The resulting resource will provide a way to seamlessly link all data annotated with the same URI to represent the same concept, a key step towards data integration. By providing a semantic glue between those datasets, Identifiers.org will facilitate data retrieval, comparison, integration, locally or through the semantic web. It will also facilitate the reasoning on the integrated datasets and lead to new, possibly automated discovery in the biomedical domain.
注释数据,生命科学数据集与其他知识来源的交叉引用一直非常重要。这些元数据通常是将有价值的信息与大量无用的数据分开的东西。随着系统生物学的出现,数据集的规模和复杂性将平衡从直接的人类交互转移到自动化计算机处理。如果元数据遵循标准过程并使用受控词汇表进行编码,则会极大地促进此类操作。如果这些程序和词汇表在不同类型的数据之间共享,就有可能调整、比较和整合不同的数据集。任何交叉引用的关键部分都是它所指向的资源的标识符。这个标识符必须是唯一的、永久的、可解析的和自由的。大多数数据提供商都会为自己的记录创建标识符;例如,“9606”标识分类中的“智人”,“22140103”标识PubMed中有关Identifiers.org的最新出版物。然而,这些标识符仅在给定的数据集中是唯一的,因此当考虑更广泛的www.example.com中的记录时,它们的有用性是有限context.Identifiers.org提供了这样的全局标识符,并将它们解析为相关的数据集。为了实现这一目的,它使用MIRIAM登记处(http://www.ebi.ac.uk/miriam/)中记录的信息。因此,这两个项目提供了最终技术解决方案的不同部分。登记处生成的标识符使用数据提供者提供的登录号,但也包含有关其来源的集合的信息。所有标识符都是唯一的、可解析的和鲁棒的。它们允许个人或软件工具通过替代提供商直接访问网络上已识别的数据。虽然只是一个原型,但Identifiers.org已经被许多社区和项目所采用,因为它满足了他们对常年交叉引用的需求,并消除了他们以前对维护和保持不断变化的Web链接(或URL)的最新长列表的需求。随着越来越多的社区意识到使用Identifiers.org URI的好处,新的需求和用例已经出现。这项提议旨在加强和扩大该资源提供的服务,以满足这些新用户的要求。我们将使资源更容易在自动化过程中使用,特别是语义Web应用程序。这涉及到以资源描述框架(RDF)格式提供注册表的内容,并提供一个SPARQL端点用于查询目的。用户将能够通过创建“配置文件”来微调标识符的解析方式,该配置文件将记录他们的偏好。该资源将使各社区(更具体地说是数据提供者本身)能够参与登记册的维护工作。这将通过数据提供者对其在登记册中的记录的“所有权”制度来实现。虽然我们目前有自动系统来检测过时的信息,但让实际的数据提供者参与维护将确保记录的信息质量更高,这意味着提供的服务质量更高。最后,我们将改进和扩展底层计算基础设施。通过在更多的数据中心部署它,我们将为越来越多的用户提供更可靠的服务。由此产生的资源将提供一种无缝链接所有使用相同URI注释的数据的方法,以表示相同的概念,这是迈向数据集成的关键一步。通过在这些数据集之间提供语义粘合剂,Identifiers.org将促进本地或通过语义网的数据检索、比较、集成。它还将促进对综合数据集的推理,并在生物医学领域带来新的、可能是自动化的发现。

项目成果

期刊论文数量(5)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
BioHackathon series in 2011 and 2012: penetration of ontology and linked data in life science domains.
  • DOI:
    10.1186/2041-1480-5-5
  • 发表时间:
    2014-02-05
  • 期刊:
  • 影响因子:
    1.9
  • 作者:
    Katayama T;Wilkinson MD;Aoki-Kinoshita KF;Kawashima S;Yamamoto Y;Yamaguchi A;Okamoto S;Kawano S;Kim JD;Wang Y;Wu H;Kano Y;Ono H;Bono H;Kocbek S;Aerts J;Akune Y;Antezana E;Arakawa K;Aranda B;Baran J;Bolleman J;Bonnal RJ;Buttigieg PL;Campbell MP;Chen YA;Chiba H;Cock PJ;Cohen KB;Constantin A;Duck G;Dumontier M;Fujisawa T;Fujiwara T;Goto N;Hoehndorf R;Igarashi Y;Itaya H;Ito M;Iwasaki W;Kalaš M;Katoda T;Kim T;Kokubu A;Komiyama Y;Kotera M;Laibe C;Lapp H;Lütteke T;Marshall MS;Mori T;Mori H;Morita M;Murakami K;Nakao M;Narimatsu H;Nishide H;Nishimura Y;Nystrom-Persson J;Ogishima S;Okamura Y;Okuda S;Oshita K;Packer NH;Prins P;Ranzinger R;Rocca-Serra P;Sansone S;Sawaki H;Shin SH;Splendiani A;Strozzi F;Tadaka S;Toukach P;Uchiyama I;Umezaki M;Vos R;Whetzel PL;Yamada I;Yamasaki C;Yamashita R;York WS;Zmasek CM;Kawamoto S;Takagi T
  • 通讯作者:
    Takagi T
BioModels: Content, Features, Functionality, and Use.
  • DOI:
    10.1002/psp4.3
  • 发表时间:
    2015-02
  • 期刊:
  • 影响因子:
    3.5
  • 作者:
    Juty, N;Ali, R;Glont, M;Keating, S;Rodriguez, N;Swat, M J;Wimalaratne, S M;Hermjakob, H;Le Novere, N;Laibe, C;Chelliah, V
  • 通讯作者:
    Chelliah, V
BioModels: ten-year anniversary.
  • DOI:
    10.1093/nar/gku1181
  • 发表时间:
    2015-01
  • 期刊:
  • 影响因子:
    14.9
  • 作者:
    Chelliah V;Juty N;Ajmera I;Ali R;Dumousseau M;Glont M;Hucka M;Jalowicki G;Keating S;Knight-Schrijver V;Lloret-Villas A;Natarajan KN;Pettit JB;Rodriguez N;Schubert M;Wimalaratne SM;Zhao Y;Hermjakob H;Le Novère N;Laibe C
  • 通讯作者:
    Laibe C
SPARQL-enabled identifier conversion with Identifiers.org.
  • DOI:
    10.1093/bioinformatics/btv064
  • 发表时间:
    2015-06-01
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Wimalaratne SM;Bolleman J;Juty N;Katayama T;Dumontier M;Redaschi N;Le Novère N;Hermjakob H;Laibe C
  • 通讯作者:
    Laibe C
BioModels linked dataset.
  • DOI:
    10.1186/s12918-014-0091-5
  • 发表时间:
    2014-08-15
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Wimalaratne SM;Grenon P;Hermjakob H;Le Novère N;Laibe C
  • 通讯作者:
    Laibe C
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Henning Hermjakob其他文献

Minimum information about a bioactive entity (MIABE)
生物活性实体的最小信息(MIABE)
  • DOI:
    10.1038/nrd3503
  • 发表时间:
    2011-08-31
  • 期刊:
  • 影响因子:
    101.800
  • 作者:
    Sandra Orchard;Bissan Al-Lazikani;Steve Bryant;Dominic Clark;Elizabeth Calder;Ian Dix;Ola Engkvist;Mark Forster;Anna Gaulton;Michael Gilson;Robert Glen;Martin Grigorov;Kim Hammond-Kosack;Lee Harland;Andrew Hopkins;Christopher Larminie;Nick Lynch;Romeena K. Mann;Peter Murray-Rust;Elena Lo Piparo;Christopher Southan;Christoph Steinbeck;David Wishart;Henning Hermjakob;John Overington;Janet Thornton
  • 通讯作者:
    Janet Thornton
Reactome - Pathway Context and Visualisation for Omics Data
  • DOI:
    10.1016/j.bpj.2018.11.1784
  • 发表时间:
    2019-02-15
  • 期刊:
  • 影响因子:
  • 作者:
    Henning Hermjakob
  • 通讯作者:
    Henning Hermjakob
An informatic pipeline for the data capture and submission of quantitative proteomic data using iTRAQ TM
  • DOI:
    10.1186/1477-5956-5-4
  • 发表时间:
    2007-02-01
  • 期刊:
  • 影响因子:
    1.600
  • 作者:
    Jennifer A Siepen;Neil Swainston;Andrew R Jones;Sarah R Hart;Henning Hermjakob;Philip Jones;Simon J Hubbard
  • 通讯作者:
    Simon J Hubbard
DAS Writeback: A Collaborative Annotation System
  • DOI:
    10.1186/1471-2105-12-143
  • 发表时间:
    2011-05-10
  • 期刊:
  • 影响因子:
    3.300
  • 作者:
    Gustavo A Salazar;Rafael C Jimenez;Alexander Garcia;Henning Hermjakob;Nicola Mulder;Edwin Blake
  • 通讯作者:
    Edwin Blake
Broadening the horizon – level 2.5 of the HUPO-PSI format for molecular interactions
  • DOI:
    10.1186/1741-7007-5-44
  • 发表时间:
    2007-10-09
  • 期刊:
  • 影响因子:
    4.500
  • 作者:
    Samuel Kerrien;Sandra Orchard;Luisa Montecchi-Palazzi;Bruno Aranda;Antony F Quinn;Nisha Vinod;Gary D Bader;Ioannis Xenarios;Jérôme Wojcik;David Sherman;Mike Tyers;John J Salama;Susan Moore;Arnaud Ceol;Andrew Chatr-aryamontri;Matthias Oesterheld;Volker Stümpflen;Lukasz Salwinski;Jason Nerothin;Ethan Cerami;Michael E Cusick;Marc Vidal;Michael Gilson;John Armstrong;Peter Woollard;Christopher Hogue;David Eisenberg;Gianni Cesareni;Rolf Apweiler;Henning Hermjakob
  • 通讯作者:
    Henning Hermjakob

Henning Hermjakob的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Henning Hermjakob', 18)}}的其他基金

2021BBSRC-NSF/BIO UniPlex - Genome-Wide Protein Complex Prediction and Validation
2021BBSRC-NSF/BIO UniPlex - 全基因组蛋白质复合物预测和验证
  • 批准号:
    BB/X002179/1
  • 财政年份:
    2023
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Research Grant
Japan Partnering Award: Establishment of an Integrative proteomics bioinformatics platform to enable novel analysis approaches
日本合作奖:建立综合蛋白质组生物信息学平台以实现新颖的分析方法
  • 批准号:
    BB/N022440/1
  • 财政年份:
    2016
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Research Grant
China Partnering Award: Proteomics Data Exchange
中国合作奖:蛋白质组学数据交换
  • 批准号:
    BB/N022432/1
  • 财政年份:
    2016
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Research Grant
MultiMod, flexible management for multi-scale multi-approach models in biology
MultiMod,生物学中多尺度多方法模型的灵活管理
  • 批准号:
    BB/N019482/1
  • 财政年份:
    2016
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Research Grant
MIDAS - Molecular Interaction Data Availability Standards
MIDAS - 分子相互作用数据可用性标准
  • 批准号:
    BB/L024179/1
  • 财政年份:
    2014
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Research Grant
ProteoGenomics: Dynamic Linkage of Genomes and Proteomes through Ensembl and ProteomeXchange
ProteoGenomics:通过 Ensembl 和 ProteomeXchange 动态链接基因组和蛋白质组
  • 批准号:
    BB/L024225/1
  • 财政年份:
    2014
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Research Grant
PROCESS - Proteomics data Collection, Software and Standards to support open access and long term management of data
PROCESS - 蛋白质组学数据收集、软件和标准,支持数据的开放获取和长期管理
  • 批准号:
    BB/K020145/1
  • 财政年份:
    2013
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Research Grant
BioModels Database, the comprehensive resource for computational models in biology
BioModels 数据库,生物学计算模型的综合资源
  • 批准号:
    BB/J019305/1
  • 财政年份:
    2012
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Research Grant
PRIDE Converter - Efficient Database Deposition of Mass Spectrometry Data
PRIDE Con​​verter - 质谱数据的高效数据库沉积
  • 批准号:
    BB/I024204/1
  • 财政年份:
    2012
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Research Grant
An Integrated Open Source Software Resource for Quantitative Proteomics
用于定量蛋白质组学的集成开源软件资源
  • 批准号:
    BB/I000909/1
  • 财政年份:
    2010
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Research Grant

相似国自然基金

Scalable Learning and Optimization: High-dimensional Models and Online Decision-Making Strategies for Big Data Analysis
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    万元
  • 项目类别:
    合作创新研究团队
Data-driven Recommendation System Construction of an Online Medical Platform Based on the Fusion of Information
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    万元
  • 项目类别:
    外国青年学者研究基金项目
Development of a Linear Stochastic Model for Wind Field Reconstruction from Limited Measurement Data
  • 批准号:
  • 批准年份:
    2020
  • 资助金额:
    40 万元
  • 项目类别:
基于高频信息下高维波动率矩阵估计及应用
  • 批准号:
    71901118
  • 批准年份:
    2019
  • 资助金额:
    18.0 万元
  • 项目类别:
    青年科学基金项目
半参数空间自回归面板模型的有效估计与应用研究
  • 批准号:
    71961011
  • 批准年份:
    2019
  • 资助金额:
    16.0 万元
  • 项目类别:
    地区科学基金项目
高频数据波动率统计推断、预测与应用
  • 批准号:
    71971118
  • 批准年份:
    2019
  • 资助金额:
    50.0 万元
  • 项目类别:
    面上项目
基于个体分析的投影式非线性非负张量分解在高维非结构化数据模式分析中的研究
  • 批准号:
    61502059
  • 批准年份:
    2015
  • 资助金额:
    19.0 万元
  • 项目类别:
    青年科学基金项目
基于Linked Open Data的Web服务语义互操作关键技术
  • 批准号:
    61373035
  • 批准年份:
    2013
  • 资助金额:
    77.0 万元
  • 项目类别:
    面上项目
体数据表达与绘制的新方法研究
  • 批准号:
    61170206
  • 批准年份:
    2011
  • 资助金额:
    55.0 万元
  • 项目类别:
    面上项目
一类新Regime-Switching模型及其在金融建模中的应用研究
  • 批准号:
    11061041
  • 批准年份:
    2010
  • 资助金额:
    24.0 万元
  • 项目类别:
    地区科学基金项目

相似海外基金

Dark Data from the White Continent: New Light on Five Decades of Vertebrate Paleontology Collections from the Triassic Fremouw Formation of Antarctica
来自白色大陆的暗数据:对南极洲三叠纪 Fremouw 组的五个十年的脊椎动物古生物学收藏的新认识
  • 批准号:
    2313242
  • 财政年份:
    2024
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Standard Grant
CAREER: Data-Enabled Neural Multi-Step Predictive Control (DeMuSPc): a Learning-Based Predictive and Adaptive Control Approach for Complex Nonlinear Systems
职业:数据支持的神经多步预测控制(DeMuSPc):一种用于复杂非线性系统的基于学习的预测和自适应控制方法
  • 批准号:
    2338749
  • 财政年份:
    2024
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Standard Grant
Collaborative Research: Constraining next generation Cascadia earthquake and tsunami hazard scenarios through integration of high-resolution field data and geophysical models
合作研究:通过集成高分辨率现场数据和地球物理模型来限制下一代卡斯卡迪亚地震和海啸灾害情景
  • 批准号:
    2325311
  • 财政年份:
    2024
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Standard Grant
RII Track-4:@NASA: Wind-induced noise in the prospective seismic data measured in the Venusian surface environment
RII Track-4:@NASA:金星表面环境中测量的预期地震数据中的风致噪声
  • 批准号:
    2327422
  • 财政年份:
    2024
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Standard Grant
RII Track-4:NSF: Physics-Informed Machine Learning with Organ-on-a-Chip Data for an In-Depth Understanding of Disease Progression and Drug Delivery Dynamics
RII Track-4:NSF:利用器官芯片数据进行物理信息机器学习,深入了解疾病进展和药物输送动力学
  • 批准号:
    2327473
  • 财政年份:
    2024
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: IMPRESS-U: Groundwater Resilience Assessment through iNtegrated Data Exploration for Ukraine (GRANDE-U)
合作研究:EAGER:IMPRESS-U:通过乌克兰综合数据探索进行地下水恢复力评估 (GRANDE-U)
  • 批准号:
    2409395
  • 财政年份:
    2024
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Standard Grant
I-Corps: Translation Potential of a Secure Data Platform Empowering Artificial Intelligence Assisted Digital Pathology
I-Corps:安全数据平台的翻译潜力,赋能人工智能辅助数字病理学
  • 批准号:
    2409130
  • 财政年份:
    2024
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Standard Grant
EAGER: Integrating Pathological Image and Biomedical Text Data for Clinical Outcome Prediction
EAGER:整合病理图像和生物医学文本数据进行临床结果预测
  • 批准号:
    2412195
  • 财政年份:
    2024
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Standard Grant
Research Infrastructure: CC* Data Storage: Foundational Campus Research Storage for Digital Transformation
研究基础设施:CC* 数据存储:数字化转型的基础校园研究存储
  • 批准号:
    2346636
  • 财政年份:
    2024
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Standard Grant
CC* Networking Infrastructure: YinzerNet: A Multi-Site Data and AI Driven Research Network
CC* 网络基础设施:YinzerNet:多站点数据和人工智能驱动的研究网络
  • 批准号:
    2346707
  • 财政年份:
    2024
  • 资助金额:
    $ 15.22万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了