Pattern Mining-Based Warning Prioritization by Refining Abstract Syntax Tree-Reference-Cited by-同舟云学术

Pattern Mining-Based Warning Prioritization by Refining Abstract Syntax Tree

Published:2024-07-23 Issue: Volume: Page:1-27
ISSN:0218-1940
Container-title:International Journal of Software Engineering and Knowledge Engineering
language:en
Short-container-title:Int. J. Soft. Eng. Knowl. Eng.

Author:

Ge Xiuting¹²³^ORCID,Li Xuanye¹²³^ORCID,Sun Yuanyuan³^ORCID,Qing Mingshuang³^ORCID,Zheng Haitao¹^ORCID,Zhang Huibin¹^ORCID,Wu Xianyu¹^ORCID

Affiliation:

1. GuangDong Tops Soft-park co, LTD, P. R. China

2. Shanghai Key Laboratory of Computer, Software Evaluating and Testing P. R. China

3. The State Key Laboratory for Novel Software Technology, Nanjing University, P. R. China

Abstract

Static code analysis tools (SATs) are widely used to detect potential defects in software projects. However, the usability of SATs is seriously hindered by a large number of unactionable warnings. Currently, many warning prioritization approaches are proposed to improve the usability of SATs. These approaches mainly extract different warning features to capture the statistical or historical information of warnings, thereby ranking actionable warnings in front of unactionable warnings. Such features are extracted by extremely relying on domain knowledge. However, the precise domain knowledge is difficult to be acquired. Also, the domain knowledge obtained in a project cannot be directly applied to other projects due to different application scenarios among different projects. To address the above problem, we propose a pattern mining-based warning prioritization approach based on the warning-related Abstract Syntax Tree (AST). To automatically mine actionable warning patterns, our approach leverages an advanced technique to collect actionable warnings, designs an algorithm to extract the warning-related AST, and mines patterns from ASTs of all actionable warnings. To prioritize the newly reported warnings, our approach combines exact and fuzzing matching techniques to calculate the similarity score between patterns of the newly reported warnings and the mined actionable warning patterns. We compare our approach with four typical baselines on five open-source and large-scale Java projects. The results show that our approach outperforms four baselines and achieves the maximum MAP (0.76) and MRR (2.19). Besides, a case study on Defect4J dataset demonstrates that our approach can discover 83% of true defects in the top 10 warnings.

Publisher

World Scientific Pub Co Pte Ltd

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218194024500293

Reference51 articles.

1. AVATAR: Fixing Semantic Bugs with Fix Patterns of Static Analysis Violations

2. From Quick Fixes to Slow Fixes: Reimagining Static Analysis Resolutions to Enable Design Space Exploration

3. Bug Prioritization to Facilitate Bug Report Triage

4. Context is king: The developer perspective on the usage of static analysis tools

5. An Empirical Study on Spectral Clustering-based Software Defect Detection