EarthCube Data Capabilities: Collaborative Research: Integration of Reproducibility into Community Cyberinfrastructure

EarthCube 数据能力:协作研究:将可重复性集成到社区网络基础设施中

基本信息

  • 批准号:
    1928288
  • 负责人:
  • 金额:
    $ 33.19万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2019
  • 资助国家:
    美国
  • 起止时间:
    2019-09-01 至 2024-08-31
  • 项目状态:
    已结题

项目摘要

For science to reliably support new discoveries, its results must be reproducible. This has proven to be a challenge in many fields including, most notably, fields that rely on computational studies as a means for supporting new discoveries. Reproducibility in these studies is particularly difficult because they require open sharing of data and models and careful control by the original researcher. This is to ensure that products can be run on later generations of hardware and software and produce consistent results. This project will develop software that helps support computational reproducibility and makes it easier and more efficient for geoscientists to preserve, share, repeat and replicate scientific computations. The Broader Impacts of this project include a collaboration between computer scientists, hydrologists and the Consortium of Universities for the Advancement of Hydrologic Science Inc. (CUAHSI) for the hydrology research community. With over 3500 users, and holding over 8000 model and data resources, this collaboration will bring improved tools and best practices to a broad and diverse community of geoscientists. Beyond hydrology, the methods and tools developed as part of this project have the potential to be extended to the solid Earth and space science geoscience domains. They also have the potential to inform the reproducibility evaluation process as currently undertaken by journals and publishers. The projct will also conduct workshops to train researchers and be used in the classroom at Utah Sate Universtiy, DePaul University and the University of Virginia. Emphasis on the importance of research reproducibility is steadily rising, however many studies still continue to not be reproducible. Reproducibility in computational studies is particularly difficult because of the challenges involved in completely documenting the data, models and procedures used together with the underlying hardware and software dependencies. The reproducibility workbench software (ReproBench) developed in this project will address reproducibility questions by establishing a container-based reproducible workflow that will make it easy and efficient for geoscientists to verify scientific results. Automation and documentation are two key methods for improving verification and, in general, the conduct of reproducible science. This project will build-from past investments: (I) automated containerization methods, through the Sciunit project, and (II) well-documented, community-adopted interfaces, through HydroShare, and bring these investments together to establish a novel, robust, and reproducible workflow. By applying this workflow to water-related science use cases, this project will demonstrate how to preserve, share, repeat, and replicate scientific results. The interfaces can become an exemplar for other community cyberinfrastructure that, akin to Hydrology, aims to share data and models at a large scale. In establishing this workflow, the ReproBench project team combines expertise in cyberinfrastructure, domain science, and reproducible computational data science. By leveraging Sciunit, ReproBench brings formal methods for the conduct of reproducible computational science into the geosciences.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
科学要可靠地支持新发现,其结果必须是可重复的。这已被证明是一个挑战,在许多领域,包括,最值得注意的是,依赖于计算研究作为支持新发现的手段的领域。这些研究的重现性特别困难,因为它们需要开放共享数据和模型以及原始研究人员的仔细控制。这是为了确保产品可以在以后的硬件和软件上运行,并产生一致的结果。该项目将开发有助于支持计算再现性的软件,使地球科学家更容易和更有效地保存,共享,重复和复制科学计算。该项目的更广泛影响包括计算机科学家,水文学家和水文科学促进大学联盟之间的合作。(CUAHSI)为水文研究界。 拥有超过3500个用户,拥有超过8000个模型和数据资源,这项合作将为广泛而多样化的地球科学家社区带来改进的工具和最佳实践。 除水文学外,作为该项目一部分开发的方法和工具有可能扩展到固体地球和空间科学地球科学领域。 它们也有可能为期刊和出版商目前进行的再现性评估过程提供信息。该项目还将举办研讨会,培训研究人员,并在犹他州州立大学、德保罗大学和弗吉尼亚大学的课堂上使用。研究重复性的重要性越来越受到重视,但许多研究仍然无法重复。计算研究中的再现特别困难,因为涉及完整记录数据,模型和程序以及底层硬件和软件依赖性的挑战。该项目开发的再现性工作台软件(ReproBench)将通过建立基于容器的可再现工作流程来解决再现性问题,这将使地球科学家能够轻松有效地验证科学结果。自动化和文件化是改进核查的两个关键方法,一般来说,也是进行可复制科学的两个关键方法。该项目将从过去的投资中构建:(I)通过Sciunit项目的自动化容器化方法,以及(II)通过HydroShare的良好记录,社区采用的界面,并将这些投资结合在一起,以建立一个新颖,强大和可复制的工作流程。通过将此工作流程应用于与水相关的科学用例,该项目将展示如何保存,共享,重复和复制科学成果。这些接口可以成为其他社区网络基础设施的典范,类似于水文学,旨在大规模共享数据和模型。在建立这一工作流程时,ReproBench项目团队结合了网络基础设施、领域科学和可再现计算数据科学方面的专业知识。通过利用Sciunit,ReproBench将可重复计算科学的正式方法引入地球科学。该奖项反映了NSF的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(7)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Provenance-based Workflow Diagnostics Using Program Specification
使用程序规范进行基于来源的工作流程诊断
  • DOI:
  • 发表时间:
    2022
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Nakamura, Y. Malik
  • 通讯作者:
    Nakamura, Y. Malik
Efficient Provenance Alignment in Reproduced Executions
复制执行中的高效出处对齐
A taxonomy for reproducible and replicable research in environmental modelling
环境建模中可重复和可复制研究的分类法
  • DOI:
    10.1016/j.envsoft.2020.104753
  • 发表时间:
    2020
  • 期刊:
  • 影响因子:
    4.9
  • 作者:
    Essawy, Bakinam T.;Goodall, Jonathan L.;Voce, Daniel;Morsy, Mohamed M.;Sadler, Jeffrey M.;Choi, Young Don;Tarboton, David G.;Malik, Tanu
  • 通讯作者:
    Malik, Tanu
Artifact Description/Artifact Evaluation: A Reproducibility Bane or a Boon
工件描述/工件评估:可重复性的祸根或福音
Comparing containerization-based approaches for reproducible computational modeling of environmental systems
比较基于容器化的环境系统可重复计算建模方法
  • DOI:
    10.1016/j.envsoft.2023.105760
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    4.9
  • 作者:
    Choi, Young-Don;Roy, Binata;Nguyen, Jared;Ahmad, Raza;Maghami, Iman;Nassar, Ayman;Li, Zhiyu;Castronova, Anthony M.;Malik, Tanu;Wang, Shaowen
  • 通讯作者:
    Wang, Shaowen
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Tanu Malik其他文献

Genistein and daidzein
金雀异黄酮和大豆黄酮
  • DOI:
  • 发表时间:
    2022
  • 期刊:
  • 影响因子:
    0
  • 作者:
    L. Sarao;S. Kaur;Tanu Malik;Ashutosh Kumar Singh
  • 通讯作者:
    Ashutosh Kumar Singh
Special issue on Data-driven Science
Reproducible eScience: The Data Containerization Challenge
可重复的电子科学:数据容器化挑战
Advances in understanding wheat-related disorders: A comprehensive review on gluten-free products with emphasis on wheat allergy, celiac and non-celiac gluten sensitivity
  • DOI:
    10.1016/j.focha.2024.100627
  • 发表时间:
    2024-06-01
  • 期刊:
  • 影响因子:
  • 作者:
    Deepika Singla;Tanu Malik;Ajay Singh;Sheetal Thakur;Pradyuman Kumar
  • 通讯作者:
    Pradyuman Kumar

Tanu Malik的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Tanu Malik', 18)}}的其他基金

CAREER: Advanced Containers for Reproducibility in Computational and Data Science
职业:计算和数据科学中可重复性的先进容器
  • 批准号:
    1846418
  • 财政年份:
    2019
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Continuing Grant
EarthCube Building Block: GeoDataspace: Simplifying Data Management for Geoscience Models
EarthCube 构建模块:GeoDataspace:简化地球科学模型的数据管理
  • 批准号:
    1722152
  • 财政年份:
    2016
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant
EarthCube Building Blocks: Collaborative Proposal: GeoTrust: Improving Sharing and Reproducibility of Geoscience Applications
EarthCube 构建模块:协作提案:GeoTrust:提高地球科学应用的共享性和可重复性
  • 批准号:
    1639759
  • 财政年份:
    2016
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant
EarthCube IA: Collaborative Proposal: Integrated GeoScience Observatory
EarthCube IA:协作提案:综合地球科学观测站
  • 批准号:
    1661918
  • 财政年份:
    2016
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant
EarthCube IA: Collaborative Proposal: Integrated GeoScience Observatory
EarthCube IA:协作提案:综合地球科学观测站
  • 批准号:
    1540901
  • 财政年份:
    2015
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant
EarthCube Building Block: GeoDataspace: Simplifying Data Management for Geoscience Models
EarthCube 构建模块:GeoDataspace:简化地球科学模型的数据管理
  • 批准号:
    1440327
  • 财政年份:
    2014
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant

相似国自然基金

Scalable Learning and Optimization: High-dimensional Models and Online Decision-Making Strategies for Big Data Analysis
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    万元
  • 项目类别:
    合作创新研究团队
Data-driven Recommendation System Construction of an Online Medical Platform Based on the Fusion of Information
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    万元
  • 项目类别:
    外国青年学者研究基金项目
Development of a Linear Stochastic Model for Wind Field Reconstruction from Limited Measurement Data
  • 批准号:
  • 批准年份:
    2020
  • 资助金额:
    40 万元
  • 项目类别:
基于Linked Open Data的Web服务语义互操作关键技术
  • 批准号:
    61373035
  • 批准年份:
    2013
  • 资助金额:
    77.0 万元
  • 项目类别:
    面上项目
Molecular Interaction Reconstruction of Rheumatoid Arthritis Therapies Using Clinical Data
  • 批准号:
    31070748
  • 批准年份:
    2010
  • 资助金额:
    34.0 万元
  • 项目类别:
    面上项目
高维数据的函数型数据(functional data)分析方法
  • 批准号:
    11001084
  • 批准年份:
    2010
  • 资助金额:
    16.0 万元
  • 项目类别:
    青年科学基金项目
染色体复制负调控因子datA在细胞周期中的作用
  • 批准号:
    31060015
  • 批准年份:
    2010
  • 资助金额:
    25.0 万元
  • 项目类别:
    地区科学基金项目
Computational Methods for Analyzing Toponome Data
  • 批准号:
    60601030
  • 批准年份:
    2006
  • 资助金额:
    17.0 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

EarthCube Data Capabilities: Collaborative Proposal: Reducing Time-To-Science in the Earth Sciences: Annotations to foster convergence, inclusion, and credit
EarthCube 数据功能:协作提案:缩短地球科学的科学时间:促进融合、包容和信用的注释
  • 批准号:
    2246427
  • 财政年份:
    2022
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant
Collaborative Research: EarthCube Data Capabilities: Volcanology hub for Interdisciplinary Collaboration, Tools and Resources (VICTOR)
合作研究:EarthCube 数据能力:跨学科合作、工具和资源的火山学中心 (VICTOR)
  • 批准号:
    2125974
  • 财政年份:
    2021
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant
EarthCube Capabilities: CloudDrift: a platform for accelerating research with Lagrangian climate data
EarthCube 功能:CloudDrift:利用拉格朗日气候数据加速研究的平台
  • 批准号:
    2126413
  • 财政年份:
    2021
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant
EarthCube Capabilities: Reducing Time-to-science for Terrestrial Sensor Networks by Integrating Field Notes, Management, and QA/QC into Data Curation
EarthCube 功能:通过将现场记录、管理和 QA/QC 集成到数据管理中,缩短地面传感器网络的科学时间
  • 批准号:
    2126386
  • 财政年份:
    2021
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant
Collaborative Research: EarthCube Capabilities: Repurposing FAIR-Compliant Earth Science Data Repositories
协作研究:EarthCube 功能:重新利用符合 FAIR 的地球科学数据存储库
  • 批准号:
    2126427
  • 财政年份:
    2021
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant
Collaborative Research: EarthCube Data Capabilities: Volcanology hub for Interdisciplinary Collaboration, Tools and Resources (VICTOR)
合作研究:EarthCube 数据能力:跨学科合作、工具和资源的火山学中心 (VICTOR)
  • 批准号:
    2126268
  • 财政年份:
    2021
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant
Collaborative Research: EarthCube Data Capabilities: Volcanology hub for Interdisciplinary Collaboration, Tools and Resources (VICTOR)
合作研究:EarthCube 数据能力:跨学科合作、工具和资源的火山学中心 (VICTOR)
  • 批准号:
    2126435
  • 财政年份:
    2021
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant
Collaborative Research: EarthCube Capabilities: Raijin: Community Geoscience Analysis Tools for Unstructured Mesh Data
协作研究:EarthCube 功能:Raijin:非结构化网格数据的社区地球科学分析工具
  • 批准号:
    2126459
  • 财政年份:
    2021
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant
Collaborative Research: EarthCube Capabilities: ICESpark: An Open-Source Big Data Platform for Science Discoveries in the New Arctic and Beyond
协作研究:EarthCube 功能:ICESpark:新北极及其他地区科学发现的开源大数据平台
  • 批准号:
    2126474
  • 财政年份:
    2021
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant
Collaborative Research: EarthCube Capabilities: Repurposing FAIR-Compliant Earth Science Data Repositories
协作研究:EarthCube 功能:重新利用符合 FAIR 的地球科学数据存储库
  • 批准号:
    2126298
  • 财政年份:
    2021
  • 资助金额:
    $ 33.19万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了