DEFECTS - Comparable and Externally Valid Software Defect Prediction

DEFECTS - 可比较且外部有效的软件缺陷预测

基本信息

项目摘要

The comparability and reproducibility of empirical software engineering research is, for the most part, an open problem. This statement holds true for the field of software defect prediction. Current research shows that this leads to actual problems regarding the external validity of defect prediction research. Multiple replications conducted by different groups of researchers led to different findings than prior research. Moreover, problems with the currently used data sets were discovered and it was demonstrated that these problems may change conclusions. Thus, defect prediction research faces a replication crisis if these problems are ignored. Within this project, we plan to create a solid foundation for comparable and externally valid defect prediction research. Our approach rests on three pillars. The first pillar is the quality of the data we use for defect prediction experiments. The current studies on data quality do not cover the impact of mislabeled data. This kind of noise affects not only the creation of defect prediction models, but also their evaluation. We will statistically evaluate the noise in current data sets. Based on our findings, we will improve the state of the art of defect labeling and generate large data set with less noise. The quality of our data will be statistically validated. The collected body will be larger than the available defect prediction data sets and thereby facilitate a better generalizability and external validity of results. The second pillar is the replication of the current state of the art. Since prior replications were already contradictory to the original experiments, we believe that a broader replication effort is necessary. Current replications consider only parts of the state of the art, e.g., classifier impact or cross-project defect prediction. Most of the state of the art still was never replicated and diligently compared to other approaches or naïve baselines. Most experiments only used small data sets, which is a key factor for the problems with external validity. We will conduct a conceptual replication of the state of the art of defect prediction. Through this, we will improve the external validity of the defect prediction state of the art and lay the groundwork for a better external validity of future work. The third pillar are guidelines for defect prediction research. In case we cannot get researchers to avoid anti-patterns that led to bad validity of results, our efforts to combat the replication crisis of defect prediction research will only have a short-term effect. To make our results sustainable, we will work together with the defect prediction community to define guidelines that allow researchers to conduct their defect prediction experiments in such a way that we hopefully never face such problems with replicability again.
在大多数情况下,经验软件工程研究的可比性和可重复性是一个开放的问题。这句话适用于软件缺陷预测领域。目前的研究表明,这导致了缺陷预测研究的外部有效性的实际问题。不同研究小组进行的多次重复实验得出了与之前研究不同的结果。此外,发现了当前使用的数据集的问题,并证明这些问题可能会改变结论。因此,如果忽视这些问题,缺陷预测研究将面临复制危机。在这个项目中,我们计划为可比较的和外部有效的缺陷预测研究创建一个坚实的基础。我们的做法基于三个支柱。第一个支柱是我们用于缺陷预测实验的数据质量。目前对数据质量的研究并没有涵盖错标数据的影响。这种噪声不仅影响缺陷预测模型的建立,而且影响缺陷预测模型的评价。我们将对当前数据集中的噪声进行统计评估。基于我们的发现,我们将改进缺陷标记技术的现状,并生成具有更少噪声的大数据集。我们的数据质量将经过统计验证。收集的主体将比可用的缺陷预测数据集更大,从而有助于更好的推广和结果的外部有效性。第二个支柱是对当前技术状态的复制。由于先前的重复实验已经与最初的实验相矛盾,我们认为有必要进行更广泛的重复实验。当前的复制只考虑技术状态的一部分,例如,分类器影响或跨项目缺陷预测。大多数最新的技术仍然没有被复制,也没有与其他方法或naïve基线进行比较。大多数实验只使用小数据集,这是外部效度问题的关键因素。我们将对缺陷预测技术的现状进行概念性的复制。通过此,我们将提高现有缺陷预测现状的外部有效性,为今后工作更好的外部有效性奠定基础。第三个支柱是缺陷预测研究的指导方针。如果我们不能让研究人员避免导致结果有效性差的反模式,那么我们对抗缺陷预测研究的复制危机的努力将只会产生短期效果。为了使我们的结果可持续,我们将与缺陷预测社区一起定义指导方针,允许研究人员以这样一种方式进行他们的缺陷预测实验,我们希望再也不会遇到这种可复制性问题。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Professor Dr. Steffen Herbold其他文献

Professor Dr. Steffen Herbold的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Professor Dr. Steffen Herbold', 18)}}的其他基金

GAIUS - Maintenance activities for the sustainability of AUGUSTUS
GAIUS - AUGUSTUS 可持续发展的维护活动
  • 批准号:
    391397397
  • 财政年份:
    2018
  • 资助金额:
    --
  • 项目类别:
    Research data and software (Scientific Library Services and Information Systems)
SENLP - Software Engineering knowledge of NLP models
SENLP - NLP 模型的软件工程知识
  • 批准号:
    524228075
  • 财政年份:
  • 资助金额:
    --
  • 项目类别:
    Research Grants

相似海外基金

Cancer Survival in WTC First Responders vs. Comparable Occupational Cohorts
世贸中心急救人员与可比职业群体的癌症生存率
  • 批准号:
    10748127
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
Development of an internationally comparable longitudinal database of older adults and causal inference
开发国际可比的老年人纵向数据库和因果推理
  • 批准号:
    23H03164
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Establishment of various and comparable liver injury models with low molecular weight compounds
用低分子化合物建立各种可比较的肝损伤模型
  • 批准号:
    21K06663
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
An internationally comparable individual longitudinal experimental study of intertemporal and interindividual variability in trust, reciprocity, and altruism
关于信任、互惠和利他主义的跨期和个体间变异性的国际可比个人纵向实验研究
  • 批准号:
    21K18129
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Challenging Research (Pioneering)
Development of a novel in vitro skin sensitization assay with predictive ability comparable to animal studies
开发一种新型体外皮肤致敏测定方法,其预测能力可与动物研究相媲美
  • 批准号:
    21K02070
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Development of 3D-printable restorative material with hardness comparable to human enamel
开发硬度与人类牙釉质相当的3D打印修复材料
  • 批准号:
    20K21685
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Challenging Research (Exploratory)
Development of comparable intracellular stability evaluation method for nanomaterials using intracellular delivery nanocarriers
使用细胞内递送纳米载体开发纳米材料的可比细胞内稳定性评估方法
  • 批准号:
    20K05284
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
A Comparable Research on Policies for Academic Improvement through School Interconnection
学校互联促进学业进步的政策比较研究
  • 批准号:
    19K02413
  • 财政年份:
    2019
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Could "Ktedonobacteria" be a new group of useful bacteria comparable to actinomycetes?
“Ktedonobacteria”能否成为与放线菌相媲美的一组新的有用细菌?
  • 批准号:
    18K05406
  • 财政年份:
    2018
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Laboratory test-comparable mobile assessments of hemoglobin for anemia detection
用于贫血检测的血红蛋白实验室测试可比移动评估
  • 批准号:
    9341800
  • 财政年份:
    2017
  • 资助金额:
    --
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了