CCRI: Collaborative Research: Planning for World Of Code (WoC): An Infrastructure for Open Source Software Census

CCRI:协作研究:规划代码世界(WoC):开源软件普查的基础设施

基本信息

  • 批准号:
    1925615
  • 负责人:
  • 金额:
    $ 6.61万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2019
  • 资助国家:
    美国
  • 起止时间:
    2019-10-01 至 2021-09-30
  • 项目状态:
    已结题

项目摘要

While software engineering research has made great progress over the decades, the unavailability of data has long been a major limitation. Open source software (OSS) makes large quantities of data available, creating the promise of using modern quantitative techniques on very large data sets to understand how to create better software and improve productivity. However, the data exists in different formats, has many errors and omissions, is located in millions of repositories around the world, and requires extensive processing to render it in a form researchers can use. The planning grant aims to develop the requirements for creating World of Code (WoC), an infrastructure for enhancing software engineering research, and to build a research community centered around massive data gathered from all open source repositories around the world. This planning process will ensure that WoC, if realized, will enable researchers to easily access all open source data, resolve issues of replication, avoid drawing samples that are not representative, and reduce the risk of inappropriate conclusions based on erroneous data. The planning grant will establish a Steering Committee and an Advisory Board to guide the development and evolution of the resource in a way that provides the greatest value to the community of researchers. Hackathons will give both experienced and novice researchers an opportunity to learn how to use WoC's prototype capabilities, add to the tooling, and concretely express the desires and priorities of the research community WoC seeks to serve.Resources needed to mine and maintain data on the entire OSS code base and version histories far exceed what individual research groups can accomplish in the scope of conventional hypothesis-driven research. The pressing need to investigate the entirety of OSS will be addressed by creating requirements and building a community for an infrastructure that should contain: 1) A complete, rapidly updated source code data for the entirety of OSS; 2) Tooling for data correction, augmentation, and curation essential for such data: e.g., identity disambiguation and extraction of dependencies; 3) Services supporting research tasks that rely on the entirety of the collection; 4) A self-governing community of researchers, developers, and companies who maintain, enhance, and operate this infrastructure. To elicit the precise requirements for this OSS-wide infrastructure a wide audience of software researchers will be engaged.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
虽然软件工程研究在过去的几十年里取得了很大的进步,但数据的不可用性一直是一个主要的限制。开源软件(OSS)使大量数据可用,创造了在非常大的数据集上使用现代定量技术的承诺,以了解如何创建更好的软件并提高生产力。然而,这些数据以不同的格式存在,有许多错误和遗漏,位于世界各地的数百万个存储库中,并且需要进行大量处理才能将其呈现为研究人员可以使用的形式。该计划拨款旨在开发创建代码世界(WoC)的需求,这是一个增强软件工程研究的基础设施,并建立一个以从世界各地所有开源存储库收集的大量数据为中心的研究社区。这一规划过程将确保如果实现WoC,将使研究人员能够轻松访问所有开源数据,解决复制问题,避免绘制不具代表性的样本,并降低基于错误数据得出不适当结论的风险。规划赠款将建立一个指导委员会和一个咨询委员会,以向研究人员群体提供最大价值的方式指导资源的开发和演变。黑客马拉松将为有经验的和新手研究人员提供学习如何使用WoC原型功能的机会,添加工具,并具体表达WoC寻求服务的研究社区的愿望和优先事项。挖掘和维护整个OSS代码库和版本历史数据所需的资源远远超过单个研究小组在传统假设驱动的研究范围内所能完成的工作。调查整个OSS的迫切需要将通过创建需求和建立一个基础设施社区来解决,该基础设施应该包含:1)整个OSS的完整,快速更新的源代码数据;2)用于数据校正、增强和管理的工具,例如,身份消歧和依赖性提取;3)支持依赖于整个馆藏的研究任务的服务;4)一个由维护、增强和操作该基础设施的研究人员、开发人员和公司组成的自治社区。为了引出这个oss范围内的基础设施的精确需求,将会有大量的软件研究人员参与其中。该奖项反映了美国国家科学基金会的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(11)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
World of code: enabling a research workflow for mining and analyzing the universe of open source VCS data
  • DOI:
    10.1007/s10664-020-09905-9
  • 发表时间:
    2020-10
  • 期刊:
  • 影响因子:
    4.1
  • 作者:
    Yuxing Ma;Tapajit Dey;Chris Bogart;Sadika Amreen;Marat Valiev;Adam Tutko;David Kennard;R. Zaretzki;A. Mockus
  • 通讯作者:
    Yuxing Ma;Tapajit Dey;Chris Bogart;Sadika Amreen;Marat Valiev;Adam Tutko;David Kennard;R. Zaretzki;A. Mockus
A Complete Set of Related Git Repositories Identified via Community Detection Approaches Based on Shared Commits
基于共享提交的社区检测方法识别出的一整套相关Git存储库
An Exploratory Study of Project Activity Changepoints in Open Source Software Evolution
开源软件演化中项目活动变点的探索性研究
Effect of Technical and Social Factors on Pull Request Quality for the NPM Ecosystem
Detecting and Characterizing Bots that Commit Code
检测和表征提交代码的机器人
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Audris Mockus其他文献

箏曲・地歌のXML 記述とその応用
古筝九塔的XML描述及其应用
古楽譜及び未解読楽譜のデータベース化のためのソフトウェアの設計
用于创建旧乐谱和未破译乐谱数据库的软件设计
A Web laboratory for software data analysis
Inflow and Retention in OSS Communities with Commercial Involvement: A Case Study of Three Hybrid Projects.
商业参与的 OSS 社区的流入和保留:三个混合项目的案例研究。
Bonobos in forest-savanna mosaic environment: development and perspectives of our newly launched wild bonobo research site
森林-稀树草原镶嵌环境中的倭黑猩猩:我们新启动的野生倭黑猩猩研究基地的发展和前景
  • DOI:
  • 发表时间:
    2018
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Kazuhiro Yamashita;Changyun Huang;Meiyappan Nagappan;Yasutaka Kamei;Audris Mockus;Ahmed E. Hassan and Naoyasu Ubayashi;Yamamoto S.
  • 通讯作者:
    Yamamoto S.

Audris Mockus的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Audris Mockus', 18)}}的其他基金

Collaborative Research: CCRI: New: World Of Code (WoC): The development of curated code resource to support research in software engineering
合作研究:CCRI:新:代码世界 (WoC):开发精选代码资源以支持软件工程研究
  • 批准号:
    2120429
  • 财政年份:
    2021
  • 资助金额:
    $ 6.61万
  • 项目类别:
    Standard Grant
CHS: Medium: Collaborative Research: SDI-CPR: Sustaining Digital Infrastructure as a Common Pool Resource
CHS:中:协作研究:SDI-CPR:将数字基础设施维持为公共池资源
  • 批准号:
    1901102
  • 财政年份:
    2019
  • 资助金额:
    $ 6.61万
  • 项目类别:
    Continuing Grant
BIGDATA: Collaborative Research: IA: OSCAR - Open Source Supply Chains and Avoidance of Risk: An Evidence Based Approach to Improve FLOSS Supply Chains
BIGDATA:协作研究:IA:OSCAR - 开源供应链和风险规避:改进 FLOSS 供应链的基于证据的方法
  • 批准号:
    1633437
  • 财政年份:
    2016
  • 资助金额:
    $ 6.61万
  • 项目类别:
    Standard Grant

相似海外基金

Collaborative Research: CISE-MSI: RCBP-ED: CCRI: TechHouse Partnership to Increase the Computer Engineering Research Expansion at Morehouse College
合作研究:CISE-MSI:RCBP-ED:CCRI:TechHouse 合作伙伴关系,以促进莫尔豪斯学院计算机工程研究扩展
  • 批准号:
    2318703
  • 财政年份:
    2023
  • 资助金额:
    $ 6.61万
  • 项目类别:
    Standard Grant
Collaborative Research: CCRI: New: A Scalable Hardware and Software Environment Enabling Secure Multi-party Learning
协作研究:CCRI:新:可扩展的硬件和软件环境支持安全的多方学习
  • 批准号:
    2347617
  • 财政年份:
    2023
  • 资助金额:
    $ 6.61万
  • 项目类别:
    Standard Grant
Collaborative Research: CCRI: NEW: Building a Batteryless Computing Community through Access to Education, Testbeds, and Tools
合作研究:CCRI:新:通过获得教育、测试平台和工具构建无电池计算社区
  • 批准号:
    2235002
  • 财政年份:
    2023
  • 资助金额:
    $ 6.61万
  • 项目类别:
    Standard Grant
Collaborative Research: Research Infrastructure: CCRI: ENS: Enhanced Open Networked Airborne Computing Platform
合作研究:研究基础设施:CCRI:ENS:增强型开放网络机载计算平台
  • 批准号:
    2235160
  • 财政年份:
    2023
  • 资助金额:
    $ 6.61万
  • 项目类别:
    Standard Grant
Collaborative Research: CCRI: New: Syntactic Differencing Infrastructure for Software Evolution Research
合作研究:CCRI:新:软件进化研究的句法差异基础设施
  • 批准号:
    2232594
  • 财政年份:
    2023
  • 资助金额:
    $ 6.61万
  • 项目类别:
    Standard Grant
Collaborative Research: CCRI: New: CoMIC: A Collaborative Mobile Immersive Computing Research Infrastructure for Multi-user XR
协作研究:CCRI:新:CoMIC:用于多用户 XR 的协作移动沉浸式计算研究基础设施
  • 批准号:
    2235050
  • 财政年份:
    2023
  • 资助金额:
    $ 6.61万
  • 项目类别:
    Standard Grant
Collaborative Research: Research Infrastructure: CCRI: New: Distributed Space and Terrestrial Networking Infrastructure for Multi-Constellation Coexistence
合作研究:研究基础设施:CCRI:新:用于多星座共存的分布式空间和地面网络基础设施
  • 批准号:
    2235140
  • 财政年份:
    2023
  • 资助金额:
    $ 6.61万
  • 项目类别:
    Standard Grant
Collaborative Research: CISE-MSI: RCBP-ED: CCRI: TechHouse Partnership to Increase the Computer Engineering Research Expansion at Morehouse College
合作研究:CISE-MSI:RCBP-ED:CCRI:TechHouse 合作伙伴关系,以促进莫尔豪斯学院计算机工程研究扩展
  • 批准号:
    2318704
  • 财政年份:
    2023
  • 资助金额:
    $ 6.61万
  • 项目类别:
    Standard Grant
Collaborative Research: CCRI: Grand: Quori 2.0: Uniting, Broadening, and Sustaining a Research Community Around a Modular Social Robot Platform
协作研究:CCRI:盛大:Quori 2.0:围绕模块化社交机器人平台联合、扩大和维持研究社区
  • 批准号:
    2235042
  • 财政年份:
    2023
  • 资助金额:
    $ 6.61万
  • 项目类别:
    Continuing Grant
Collaborative Research: CCRI: Planning-C: A Community for Configurability Open Research and Development (ACCORD)
合作研究:CCRI:Planning-C:可配置性开放研究与开发社区 (ACCORD)
  • 批准号:
    2234909
  • 财政年份:
    2023
  • 资助金额:
    $ 6.61万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了