A Comparison of Summarization Methods for Duplicate Software Bug Reports
-
Published:2023-08-15
Issue:16
Volume:12
Page:3456
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Mukhtar Samal1, Primadani Claudia Cahya1, Lee Seonah12ORCID, Jung Pilsu1
Affiliation:
1. Department of AI Convergence Engineering, Gyeongsang National University, 501 Jinjudaero, Jinju-si 52828, Gyeongsangnam-do, Republic of Korea 2. Department of Aerospace and Software Engineering, Gyeongsang National University, 501 Jinjudaero, Jinju-si 52828, Gyeongsangnam-do, Republic of Korea
Abstract
Bug reports vary in length, while some bug reports are lengthy, others are too brief to describe bugs in detail. In such a case, duplicate bug reports can serve as valuable resources for enriching bug descriptions. However, existing bug summarization methods mainly focused on summarizing a single bug report. In this paper, we focus on summarizing duplicate bug reports. By doing so, we aim to obtain an informative summary of bug reports while reducing redundant sentences in the summary. We apply several text summarization methods to duplicate bug reports. We then compare summarization results generated by different summarization methods and identify the most effective method for summarizing duplicate bug reports. Our comparative experiment reveals that the extractive multi-document method based on TF-IDF is the most effective in the summarization. This method successfully captures the relevant information from duplicate bug reports, resulting in comprehensive summaries. These results contribute to the advancement of bug summarization techniques, especially in summarizing duplicate bug reports.
Funder
National Research Foundation of Korea Ministry of Education and National Research Foundation of Korea
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference25 articles.
1. Automatic keyword and sentence-based text summarization for software bug reports;Jindal;IEEE Access,2020 2. Bettenburg, N., Premraj, R., Zimmermann, T., and Kim, S. (October, January 28). Duplicate bug reports considered harmful… really?. Proceedings of the 2008 IEEE International Conference on Software Maintenance, Beijing, China. 3. Hao, R., Feng, Y., Jones, J.A., Li, Y., and Chen, Z. (2019, January 25–31). CTRAS: Crowdsourced test report aggregation and summarization. Proceedings of the 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE), Montreal, QC, Canada. 4. Manh, H.C., Le Thanh, H., and Minh, T.L. (2019, January 4–6). Extractive Multi-Document Summarization Using K-Means, Centroid-Based Method, MMR, and Sentence Position. Proceedings of the 10th International Symposium on Information and Communication Technology, SoICT ’19, Hanoi, Vietnam. 5. Reichold, L. (2023, July 03). Multi-Document News Article Summarizer. Available online: https://github.com/lukereichold/News-Summarizer.
|
|