Improved metadata authoring to enhance AI/ML readiness of associated datasets
改进元数据创作,以增强相关数据集的 AI/ML 准备情况
基本信息
- 批准号:10592638
- 负责人:
- 金额:$ 27.45万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2021
- 资助国家:美国
- 起止时间:2021-05-01 至 2025-01-31
- 项目状态:未结题
- 来源:
- 关键词:7 year oldArchitectureAreaArtificial IntelligenceAwardBig Data to KnowledgeCOVID diagnosticCOVID-19 testingCollaborationsCommon Data ElementComputer softwareComputersDataData SetDevelopmentDiagnostic testsElementsEnsureFAIR principlesFundingGoalsGrantGuidelinesInfrastructureInstitutesInvestigationLibrariesMetadataMethodsModernizationNorth CarolinaOnline SystemsOntologyPlayProcessRADxReadinessRenaissanceReportingResearchResearch PersonnelResourcesRestRoleScientistSecureStandardizationSupplementationSystemTechniquesTechnologyUnited States National Institutes of HealthUniversitiesWorkcloud baseddata archivedata hubdata managementdata repositorydesignexperimental studyimprovedmetadata standardsnovel strategiesonline repositoryopen sourceparent grantprogramsrepositoryspellingsystem architecturetoolweb-accessible
项目摘要
PROJECT SUMMARY/ABSTRACT
This proposal is submitted to supplement grant R01 LM013498-01, “The Metadata Powerwash—Integrated
tools to make biomedical data FAIR.” The parent grant proposes to study AI methods to standardize the
metadata in online datasets to make the corresponding data findable, accessible, interoperable, and reusable,
and thus “AI-ready.” The goal of the parent grant is to transform the metadata that annotate experimental
datasets online to a form that adheres to formal reporting guidelines and that uses terms from standard
ontologies and common data elements from NIH repositories. The research depends on technology known as
CEDAR, which manages a library of metadata templates that correspond to reporting guidelines that define the
expected attribute–value pairs in standard metadata descriptions. The Metadata Powerwash uses these
CEDAR metadata templates to suggest what elements from standard reporting guidelines might have been
intended by the idiosyncratic entries that scientists often use when they author metadata. The CEDAR
technology, while widely used and extremely successful, is already 7 years old and in need of modernization.
Enhancements to CEDAR will have obvious benefits to the parent grant.
CEDAR uses its library of metadata templates to assist scientists when they author new metadata to describe
the datasets that result from their experiments. The system ensures that the new metadata are adherent to
appropriate standards whenever possible. CEDAR is slated to be included as part of the cloud-based Data
Hub for the NIH RADx program, which supports a wide range of studies in the area of diagnostic testing for
COVID-19. Unfortunately, CEDAR is not cloud-ready. Thus, if CEDAR is to play an optimal role in enhancing
the AI-readiness of NIH RADx data, then ideally additional work is necessary. To advance the role of CEDAR
in the creation of AI-ready datasets, (1) we will make CEDAR cloud-native by containerizing all CEDAR
microservices, by making these microservices discoverable and observable, and by migrating the entire
system to the cloud, and (2) we will make CEDAR a highly available system that is easy to maintain and
evolve; we will simplify and enhance the system’s architecture, taking advantage of new approaches and
components that were not available to us when the system was first designed. As a result, CEDAR will be
much more scalable, maintainable, and deployable. The new architecture will advance the application of AI
techniques not only to RADx data, but also to a wide range of datasets of importance to the NIH.
项目总结/文摘
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Mark A Musen其他文献
Mark A Musen的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Mark A Musen', 18)}}的其他基金
Enhanced ontology engineering through a Web-based, Cloud-based software architecture
通过基于网络、云的软件架构增强本体工程
- 批准号:
10405968 - 财政年份:2021
- 资助金额:
$ 27.45万 - 项目类别:
The Metadata Powerwash - Integrated tools to make biomedical data FAIR
Metadata Powerwash - 使生物医学数据公平的集成工具
- 批准号:
10397981 - 财政年份:2021
- 资助金额:
$ 27.45万 - 项目类别:
Enhancing the RADx Data Hub for Data FAIRness
增强 RADx 数据中心以实现数据公平
- 批准号:
10433797 - 财政年份:2021
- 资助金额:
$ 27.45万 - 项目类别:
Enhancing the RADx Data Hub for Data FAIRness
增强 RADx 数据中心以实现数据公平
- 批准号:
10794704 - 财政年份:2021
- 资助金额:
$ 27.45万 - 项目类别:
The Metadata Powerwash - Integrated tools to make biomedical data FAIR
Metadata Powerwash - 使生物医学数据公平的集成工具
- 批准号:
10551273 - 财政年份:2021
- 资助金额:
$ 27.45万 - 项目类别:
BioPortal: An Expansive Knowledgebase of Biomedical Entities and Relations
BioPortal:生物医学实体和关系的广泛知识库
- 批准号:
10494104 - 财政年份:2021
- 资助金额:
$ 27.45万 - 项目类别:
BioPortal: An Expansive Knowledgebase of Biomedical Entities and Relations
BioPortal:生物医学实体和关系的广泛知识库
- 批准号:
10271048 - 财政年份:2021
- 资助金额:
$ 27.45万 - 项目类别:
Enhancing the RADx Data Hub for Data FAIRness
增强 RADx 数据中心以实现数据公平
- 批准号:
10699372 - 财政年份:2021
- 资助金额:
$ 27.45万 - 项目类别:
The Metadata Powerwash - Integrated tools to make biomedical data FAIR
Metadata Powerwash - 使生物医学数据公平的集成工具
- 批准号:
10093841 - 财政年份:2021
- 资助金额:
$ 27.45万 - 项目类别:
Enhancing the RADx Data Hub for Data FAIRness
增强 RADx 数据中心以实现数据公平
- 批准号:
10850055 - 财政年份:2021
- 资助金额:
$ 27.45万 - 项目类别:
相似海外基金
Practical Study on Disaster Countermeasure Architecture Model by Sustainable Design in Asian Flood Area
亚洲洪泛区可持续设计防灾建筑模型实践研究
- 批准号:
17K00727 - 财政年份:2017
- 资助金额:
$ 27.45万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Functional architecture of a face processing area in the common marmoset
普通狨猴面部处理区域的功能架构
- 批准号:
9764503 - 财政年份:2016
- 资助金额:
$ 27.45万 - 项目类别:
Heating and airconditioning by hypocausts in residential and representative architecture in Rome and Latium studies of a phenomenon of luxury in a favoured climatic area of the Roman Empire on the basis of selected examples.
罗马和拉齐奥的住宅和代表性建筑中的火烧供暖和空调根据选定的例子,研究了罗马帝国有利的气候地区的奢华现象。
- 批准号:
317469425 - 财政年份:2016
- 资助金额:
$ 27.45万 - 项目类别:
Research Grants
SBIR Phase II: Area and Energy Efficient Error Floor Free Low-Density Parity-Check Codes Decoder Architecture for Flash Based Storage
SBIR 第二阶段:用于基于闪存的存储的面积和能源效率高、无错误层的低密度奇偶校验码解码器架构
- 批准号:
1632562 - 财政年份:2016
- 资助金额:
$ 27.45万 - 项目类别:
Standard Grant
SBIR Phase I: Area and Energy Efficient Error Floor Free Low-Density Parity-Check Codes Decoder Architecture for Flash Based Storage
SBIR 第一阶段:用于基于闪存的存储的面积和能源效率高、无错误层低密度奇偶校验码解码器架构
- 批准号:
1520137 - 财政年份:2015
- 资助金额:
$ 27.45万 - 项目类别:
Standard Grant
A Study on The Spatial Setting and The Inhavitant's of The Flood Prevention Architecture in The Flood Area
洪泛区防洪建筑空间设置及居民生活研究
- 批准号:
26420620 - 财政年份:2014
- 资助金额:
$ 27.45万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Area and power efficient interconnect architecture for multi-bit processing on FPGAs
用于 FPGA 上多位处理的面积和功率高效互连架构
- 批准号:
327691-2007 - 财政年份:2011
- 资助金额:
$ 27.45万 - 项目类别:
Discovery Grants Program - Individual
A FUNDAMENTAL STUDY ON UTILIZATION OF THE POST-WAR ARCHITECTURE AS URBAN REGENERATION METHOD, A case of the central area of Osaka city
战后建筑作为城市更新方法的基础研究——以大阪市中心区为例
- 批准号:
22760469 - 财政年份:2010
- 资助金额:
$ 27.45万 - 项目类别:
Grant-in-Aid for Young Scientists (B)
Area and power efficient interconnect architecture for multi-bit processing on FPGAs
用于 FPGA 上多位处理的面积和功率高效互连架构
- 批准号:
327691-2007 - 财政年份:2010
- 资助金额:
$ 27.45万 - 项目类别:
Discovery Grants Program - Individual
Area and power efficient interconnect architecture for multi-bit processing on FPGAs
用于 FPGA 上多位处理的面积和功率高效互连架构
- 批准号:
327691-2007 - 财政年份:2009
- 资助金额:
$ 27.45万 - 项目类别:
Discovery Grants Program - Individual