A fine-grained data set and analysis of tangling in bug fixing commits-Reference-Cited by-同舟云学术

A fine-grained data set and analysis of tangling in bug fixing commits

Published:2022-07-02 Issue:6 Volume:27 Page:
ISSN:1382-3256
Container-title:Empirical Software Engineering
language:en
Short-container-title:Empir Software Eng

Author:

Herbold Steffen^ORCID,Trautsch Alexander,Ledel Benjamin,Aghamohammadi Alireza,Ghaleb Taher A.,Chahal Kuljit Kaur,Bossenmaier Tim,Nagaria Bhaveet,Makedonski Philip,Ahmadabadi Matin Nili,Szabados Kristof,Spieker Helge,Madeja Matej,Hoy Nathaniel,Lenarduzzi Valentina,Wang Shangwen,Rodríguez-Pérez Gema,Colomo-Palacios Ricardo,Verdecchia Roberto,Singh Paramvir,Qin Yihao,Chakroborti Debasish,Davis Willard,Walunj Vijay,Wu Hongjun,Marcilio Diego,Alam Omar,Aldaeej Abdullah,Amit Idan,Turhan Burak,Eismann Simon,Wickert Anna-Katharina,Malavolta Ivano,Sulír Matúš,Fard Fatemeh,Henley Austin Z.,Kourtzanidis Stratos,Tuzun Eray,Treude Christoph,Shamasbi Simin Maleki,Pashchenko Ivan,Wyrich Marvin,Davis James,Serebrenik Alexander,Albrecht Ella,Aktas Ethem Utku,Strüber Daniel,Erbel Johannes

Abstract

Abstract Context Tangled commits are changes to software that address multiple concerns at once. For researchers interested in bugs, tangled commits mean that they actually study not only bugs, but also other concerns irrelevant for the study of bugs. Objective We want to improve our understanding of the prevalence of tangling and the types of changes that are tangled within bug fixing commits. Methods We use a crowd sourcing approach for manual labeling to validate which changes contribute to bug fixes for each line in bug fixing commits. Each line is labeled by four participants. If at least three participants agree on the same label, we have consensus. Results We estimate that between 17% and 32% of all changes in bug fixing commits modify the source code to fix the underlying problem. However, when we only consider changes to the production code files this ratio increases to 66% to 87%. We find that about 11% of lines are hard to label leading to active disagreements between participants. Due to confirmed tangling and the uncertainty in our data, we estimate that 3% to 47% of data is noisy without manual untangling, depending on the use case. Conclusion Tangled commits have a high prevalence in bug fixes and can lead to a large amount of noise in the data. Prior research indicates that this noise may alter results. As researchers, we should be skeptics and assume that unvalidated data is likely very noisy, until proven otherwise.

Funder

Technische Universität Clausthal

Publisher

Springer Science and Business Media LLC

Subject

Software

Link

https://link.springer.com/content/pdf/10.1007/s10664-021-10083-5.pdf

Reference65 articles.

1. Agresti A, Coull B A (1998) Approximate is better than ”exact” for interval estimation of binomial proportions. Amer Stat 52(2):119–126

2. Anderson A, Huttenlocher D, Kleinberg J, Leskovec J (2013) Steering user behavior with badges. In: Proceedings of the 22nd International Conference on World Wide Web, WWW ’13. Association for Computing Machinery, New York, pp 95–106

3. Arima R, Higo Y, Kusumoto S (2018) A study on inappropriately partitioned commits: How much and what kinds of ip commits in java projects?. In: Proceedings of the 15th International Conference on Mining Software Repositories, MSR ’18. Association for Computing Machinery, New York, pp 336–340

4. Baltes S, Ralph P (2020) Sampling in software engineering research: A critical review and guidelines. arXiv:2002.07764

5. Bissyandí T F, Thung F, Wang S, Lo D, Jiang L, Réveillère L (2013) Empirical evaluation of bug linking. In: 2013 17th European Conference on Software Maintenance and Reengineering, pp 89–98

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A code change‐oriented approach to just‐in‐time defect prediction with multiple input semantic fusion;Expert Systems;2024-08-27

2. On Refining the SZZ Algorithm with Bug Discussion Data;Empirical Software Engineering;2024-07-24

3. JIT-Smart: A Multi-task Learning Framework for Just-in-Time Defect Prediction and Localization;Proceedings of the ACM on Software Engineering;2024-07-12

4. An Exploratory Study of Programmers' Analogical Reasoning and Software History Usage During Code Re-Purposing;Proceedings of the 2024 IEEE/ACM 17th International Conference on Cooperative and Human Aspects of Software Engineering;2024-04-14

5. Delving into Parameter-Efficient Fine-Tuning in Code Change Learning: An Empirical Study;2024 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER);2024-03-12