Extended Association Rule Mining and Its Application to Software Engineering Data Sets-Reference-Cited by-同舟云学术

Extended Association Rule Mining and Its Application to Software Engineering Data Sets

Published:2024-08-30 Issue: Volume: Page:1-22
ISSN:0218-1940
Container-title:International Journal of Software Engineering and Knowledge Engineering
language:en
Short-container-title:Int. J. Soft. Eng. Knowl. Eng.

Author:

Saito Hidekazu¹,Nishiura Kinari²^ORCID,Monden Akito¹^ORCID,Morisaki Shuji³^ORCID

Affiliation:

1. Graduate School of Natural Science and Technology, Okayama University, Okayama, Japan

2. Faculty of Information and Human Sciences, Kyoto Institute of Technology, Kyoto, Japan

3. Graduate School of Informatics, Nagoya University, Nagoya, Japan

Abstract

Association rule mining is a highly effective approach to data analysis for datasets of varying sizes, accommodating diverse feature values. Nevertheless, deriving practical rules from datasets with numerical variables presents a challenge, as these variables must be discretized beforehand. Quantitative association rule mining addresses this issue, allowing the extraction of valuable rules. This paper introduces an extension to quantitative association rules, incorporating a two-variable function in their consequent part. The use of correlation functions, statistical test functions, and error functions is also introduced. We illustrate the utility of this extension through three case studies employing software engineering datasets. In case study 1, we successfully pinpointed the conditions that result in either a high or low correlation between effort and software size, offering valuable insights for software project managers. In case study 2, we effectively identified the conditions that lead to a high or low correlation between the number of bugs and source lines of code, aiding in the formulation of software test planning strategies. In case study 3, we applied our approach to the two-step software effort estimation process, uncovering the conditions most likely to yield low effort estimation errors.

Funder

Japan Society for the Promotion of Science

Publisher

World Scientific Pub Co Pte Ltd

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218194024500347

Reference23 articles.

1. Mining association rules between sets of items in large databases

2. A hybrid faulty module prediction using association rule mining and logistic regression analysis