Dynamic stacking ensemble for cross-language code smell detection-Reference-Cited by-同舟云学术

Dynamic stacking ensemble for cross-language code smell detection

Published:2024-08-15 Issue: Volume:10 Page:e2254
ISSN:2376-5992
Container-title:PeerJ Computer Science
language:en
Short-container-title:

Author:

Aljamaan Hamoud¹²^ORCID

Affiliation:

1. Information and Computer Science Department, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia

2. Interdisciplinary Research Center for Finance and Digital Economy, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia

Abstract

Code smells refer to poor design and implementation choices by software engineers that might affect the overall software quality. Code smells detection using machine learning models has become a popular area to build effective models that are capable of detecting different code smells in multiple programming languages. However, the process of building of such effective models has not reached a state of stability, and most of the existing research focuses on Java code smells detection. The main objective of this article is to propose dynamic ensembles using two strategies, namely greedy search and backward elimination, which are capable of accurately detecting code smells in two programming languages (i.e., Java and Python), and which are less complex than full stacking ensembles. The detection performance of dynamic ensembles were investigated within the context of four Java and two Python code smells. The greedy search and backward elimination strategies yielded different base models lists to build dynamic ensembles. In comparison to full stacking ensembles, dynamic ensembles yielded less complex models when they were used to detect most of the investigated Java and Python code smells, with the backward elimination strategy resulting in less complex models. Dynamic ensembles were able to perform comparably against full stacking ensembles with no significant detection loss. This article concludes that dynamic stacking ensembles were able to facilitate the effective and stable detection performance of Java and Python code smells over all base models and with less complexity than full stacking ensembles.

Funder

King Fahd University of Petroleum and Minerals

Publisher

PeerJ

Link

https://peerj.com/articles/cs-2254.pdf

Reference47 articles.

1. The treatment of missing values and its effect on classifier accuracy;Acuna,2004

2. Optuna: a next-generation hyperparameter optimization framework;Akiba,2019

3. Bad smell detection using machine learning techniques: a systematic literature review;Al-Shaaby;Arabian Journal for Science and Engineering,2020

4. Code smell detection using feature selection and stacking ensemble: an empirical investigation;Alazba;Information and Software Technology,2021

5. Deep learning approaches for bad smell detection: a systematic literature review;Alazba;Empirical Software Engineering,2023