EAGER: Collaborative Research: Evaluating Identifier Services for the Life Cycle of Biological Data
EAGER:协作研究:评估生物数据生命周期的标识符服务
基本信息
- 批准号:1555458
- 负责人:
- 金额:$ 23.46万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2015
- 资助国家:美国
- 起止时间:2015-09-01 至 2018-02-28
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Unique identifiers are key to current and future access to and use of research data, which are often distributed across a landscape of storage and analysis resources, publishing platforms, and repository services. In the biology domain, researchers and data managers have expressed the need to use identifiers from the moment of data creation and throughout the research lifecycle. A wide range of methods are used in the biological sciences to produce many different kinds of data, which may require the application of different types of identifiers to make connections between physical samples, digital data, analysis, and publications. This project will develop and evaluate proof-of-concept and prototype services with particular focus on DNA/RNA sequence data. This will expand on data modeling work done as part of the iPlant Data Commons, using real world biology datasets from iPlant, the Texas Advanced Computing Center (TACC), and the National Ecological Observatory Network (NEON). Project results will be disseminated across the biology and information science communities. Software generated during this project will be maintained in an open source software repository for further development by the community. Some of these products will benefit smaller organizations that provide repository services but have limited software development staffing. This research will inform the development of similar services for different data types and in other domains dealing with issues identification through a project's lifecycle.There is a growing need for services to verify, track, and report events (i.e. provenance) in relation to identified datasets over time. Such services should start as early as possible in the life of a research project and be as much as possible automated. Much of the current research and development around digital identifiers focuses on facilitating data citation and discovery post-publication. This project will address problems arising for large, dispersed, biology datasets and changing events. Instead of assigning identifiers only at the last stage for curated datasets, usage of different identifiers are assessed throughout the continuum of data management, publication, archiving, and reuse. A prototype identifier infrastructure for identifiers management, permutation, and data validation/authentication across time. The implementation and evaluation of these services will test the use of identifiers beyond the "data publication stage," to connect dispersed data objects as they transition through the continuum of data management, publication, and archiving. This project will develop and evaluate a set of proof of concepts/prototypes to 1) model identifiers to the lifecycle management of bio data including their transition into global, unique and persistent identifiers; 2) conduct automated verification of the data linked to those identifiers to track presence at registered locations and integrity and identity over time; and 3) assess how collection creators use identifiers and respond to identifier services.
唯一标识符是当前和未来访问和使用研究数据的关键,这些数据通常分布在存储和分析资源、发布平台和存储库服务的环境中。在生物学领域,研究人员和数据管理人员表示需要从数据创建的那一刻起以及在整个研究生命周期中使用标识符。在生物科学中使用各种方法来产生许多不同类型的数据,这可能需要应用不同类型的标识符来建立物理样本,数字数据,分析和出版物之间的联系。该项目将开发和评估概念验证和原型服务,特别关注DNA/RNA序列数据。这将扩展作为iPlant Data Commons的一部分所做的数据建模工作,使用来自iPlant,德克萨斯州高级计算中心(TACC)和国家生态观测网络(氖)的真实的世界生物数据集。项目成果将在生物学和信息科学界传播。该项目期间生成的软件将保存在一个开放源码软件库中,供社区进一步开发。其中一些产品将使提供存储库服务但软件开发人员有限的小型组织受益。这项研究将为不同数据类型的类似服务的开发提供信息,并在其他领域处理项目生命周期中的问题识别。随着时间的推移,越来越需要服务来验证,跟踪和报告与已识别数据集相关的事件(即出处)。这种服务应在研究项目的生命周期中尽早开始,并尽可能自动化。目前围绕数字标识符的大部分研究和开发都集中在促进数据引用和出版后发现。该项目将解决大型、分散的生物数据集和不断变化的事件所产生的问题。不是仅在最后阶段为策展数据集分配标识符,而是在数据管理、发布、存档和重用的整个过程中评估不同标识符的使用。用于跨时间的标识符管理、置换和数据验证/认证的原型标识符基础设施。这些服务的实施和评价将测试在“数据公布阶段”之后使用标识符的情况,以便在分散的数据对象通过数据管理、公布和存档的连续体过渡时将其连接起来。该项目将开发和评估一套概念验证/原型,以:1)为生物数据的生命周期管理建立标识符模型,包括将其转换为全球、唯一和持久的标识符; 2)对与这些标识符相关联的数据进行自动验证,以跟踪在注册地点的存在情况以及随着时间的推移的完整性和身份;以及3)评估集合创建者如何使用标识符并响应标识符服务。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Maria Esteva其他文献
Special Issue on Cyberinfrastructure, Machine Learning, and Digital Library
- DOI:
10.2478/dim-2019-0007 - 发表时间:
2019-03-01 - 期刊:
- 影响因子:
- 作者:
Weijia Xu;Maria Esteva;Jessica Trelogan;Dan Wu - 通讯作者:
Dan Wu
Identifier Services: Modeling and Implementing Distributed Data Management in Cyberinfrastructure
- DOI:
10.2478/dim-2019-0002 - 发表时间:
2019-03-01 - 期刊:
- 影响因子:
- 作者:
Maria Esteva;Ramona L. Walls;Andrew B. Magill;Weijia Xu;Ruizhu Huang;James Carson;Jawon Song - 通讯作者:
Jawon Song
Maria Esteva的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
相似海外基金
Collaborative Research: EAGER: IMPRESS-U: Groundwater Resilience Assessment through iNtegrated Data Exploration for Ukraine (GRANDE-U)
合作研究:EAGER:IMPRESS-U:通过乌克兰综合数据探索进行地下水恢复力评估 (GRANDE-U)
- 批准号:
2409395 - 财政年份:2024
- 资助金额:
$ 23.46万 - 项目类别:
Standard Grant
EAGER/Collaborative Research: An LLM-Powered Framework for G-Code Comprehension and Retrieval
EAGER/协作研究:LLM 支持的 G 代码理解和检索框架
- 批准号:
2347624 - 财政年份:2024
- 资助金额:
$ 23.46万 - 项目类别:
Standard Grant
EAGER/Collaborative Research: Revealing the Physical Mechanisms Underlying the Extraordinary Stability of Flying Insects
EAGER/合作研究:揭示飞行昆虫非凡稳定性的物理机制
- 批准号:
2344215 - 财政年份:2024
- 资助金额:
$ 23.46万 - 项目类别:
Standard Grant
Collaborative Research: EAGER: Designing Nanomaterials to Reveal the Mechanism of Single Nanoparticle Photoemission Intermittency
合作研究:EAGER:设计纳米材料揭示单纳米粒子光电发射间歇性机制
- 批准号:
2345581 - 财政年份:2024
- 资助金额:
$ 23.46万 - 项目类别:
Standard Grant
Collaborative Research: EAGER: Designing Nanomaterials to Reveal the Mechanism of Single Nanoparticle Photoemission Intermittency
合作研究:EAGER:设计纳米材料揭示单纳米粒子光电发射间歇性机制
- 批准号:
2345582 - 财政年份:2024
- 资助金额:
$ 23.46万 - 项目类别:
Standard Grant
Collaborative Research: EAGER: Designing Nanomaterials to Reveal the Mechanism of Single Nanoparticle Photoemission Intermittency
合作研究:EAGER:设计纳米材料揭示单纳米粒子光电发射间歇性机制
- 批准号:
2345583 - 财政年份:2024
- 资助金额:
$ 23.46万 - 项目类别:
Standard Grant
Collaborative Research: EAGER: The next crisis for coral reefs is how to study vanishing coral species; AUVs equipped with AI may be the only tool for the job
合作研究:EAGER:珊瑚礁的下一个危机是如何研究正在消失的珊瑚物种;
- 批准号:
2333604 - 财政年份:2024
- 资助金额:
$ 23.46万 - 项目类别:
Standard Grant
Collaborative Research: EAGER: Energy for persistent sensing of carbon dioxide under near shore waves.
合作研究:EAGER:近岸波浪下持续感知二氧化碳的能量。
- 批准号:
2339062 - 财政年份:2024
- 资助金额:
$ 23.46万 - 项目类别:
Standard Grant
Collaborative Research: EAGER: The next crisis for coral reefs is how to study vanishing coral species; AUVs equipped with AI may be the only tool for the job
合作研究:EAGER:珊瑚礁的下一个危机是如何研究正在消失的珊瑚物种;
- 批准号:
2333603 - 财政年份:2024
- 资助金额:
$ 23.46万 - 项目类别:
Standard Grant
EAGER/Collaborative Research: An LLM-Powered Framework for G-Code Comprehension and Retrieval
EAGER/协作研究:LLM 支持的 G 代码理解和检索框架
- 批准号:
2347623 - 财政年份:2024
- 资助金额:
$ 23.46万 - 项目类别:
Standard Grant