Identifying algorithm in program code based on structural features using CNN classification model-Reference-Cited by-同舟云学术

Identifying algorithm in program code based on structural features using CNN classification model

Published:2022-09-23 Issue: Volume: Page:
ISSN:0924-669X
Container-title:Applied Intelligence
language:en
Short-container-title:Appl Intell

Author:

Watanobe Yutaka^ORCID,Rahman Md. Mostafizer^ORCID,Amin Md. Faizul Ibne,Kabir Raihan^ORCID

Abstract

AbstractIn software, an algorithm is a well-organized sequence of actions that provides the optimal way to complete a task. Algorithmic thinking is also essential to break-down a problem and conceptualize solutions in some steps. The proper selection of an algorithm is pivotal to improve computational performance and software productivity as well as to programming learning. That is, determining a suitable algorithm from a given code is widely relevant in software engineering and programming education. However, both humans and machines find it difficult to identify algorithms from code without any meta-information. This study aims to propose a program code classification model that uses a convolutional neural network (CNN) to classify codes based on the algorithm. First, program codes are transformed into a sequence of structural features (SFs). Second, SFs are transformed into a one-hot binary matrix using several procedures. Third, different structures and hyperparameters of the CNN model are fine-tuned to identify the best model for the code classification task. To do so, 61,614 real-world program codes of different types of algorithms collected from an online judge system are used to train, validate, and evaluate the model. Finally, the experimental results show that the proposed model can identify algorithms and classify program codes with a high percentage of accuracy. The average precision, recall, and F-measure scores of the best CNN model are 95.65%, 95.85%, and 95.70%, respectively, indicating that it outperforms other baseline models.

Funder

Japan Society for the Promotion of Science (JSPS) KAKENHI

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence

Link

https://link.springer.com/content/pdf/10.1007/s10489-022-04078-y.pdf

Reference75 articles.

1. Rahman MM, Watanobe Y, Kiran RU, Thang TC, Paik I (2021) Impact of practical skills on academic performance: a data-driven analysis. IEEE Access 9:139975–139993. https://doi.org/10.1109/ACCESS.2021.3119145https://doi.org/10.1109/ACCESS.2021.3119145

2. Medeiros RP, Ramalho GL, Falcão TP (2019) A systematic literature review on teaching and learning introductory programming in higher education. IEEE Trans Educ 62(2):77–90. https://doi.org/10.1109/TE.2018.2864133

3. Perera P, Tennakoon G, Ahangama S, Panditharathna R, Chathuranga B (2021) A systematic mapping of introductory programming languages for novice learners. IEEE Access 9:88121–88136. https://doi.org/10.1109/ACCESS.2021.3089560