Source Code Assessment and Classification Based on Estimated Error Probability Using Attentive LSTM Language Model and Its Application in Programming Education-Reference-Cited by-同舟云学术

Source Code Assessment and Classification Based on Estimated Error Probability Using Attentive LSTM Language Model and Its Application in Programming Education

Published:2020-04-24 Issue:8 Volume:10 Page:2973
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Rahman Md. Mostafizer^ORCID,Watanobe Yutaka,Nakamura Keita

Abstract

The rate of software development has increased dramatically. Conventional compilers cannot assess and detect all source code errors. Software may thus contain errors, negatively affecting end-users. It is also difficult to assess and detect source code logic errors using traditional compilers, resulting in software that contains errors. A method that utilizes artificial intelligence for assessing and detecting errors and classifying source code as correct (error-free) or incorrect is thus required. Here, we propose a sequential language model that uses an attention-mechanism-based long short-term memory (LSTM) neural network to assess and classify source code based on the estimated error probability. The attentive mechanism enhances the accuracy of the proposed language model for error assessment and classification. We trained the proposed model using correct source code and then evaluated its performance. The experimental results show that the proposed model has logic and syntax error detection accuracies of 92.2% and 94.8%, respectively, outperforming state-of-the-art models. We also applied the proposed model to the classification of source code with logic and syntax errors. The average precision, recall, and F-measure values for such classification are much better than those of benchmark models. To strengthen the proposed model, we combined the attention mechanism with LSTM to enhance the results of error assessment and detection as well as source code classification. Finally, our proposed model can be effective in programming education and software engineering by improving code writing, debugging, error-correction, and reasoning.

Funder

Japan Society for the Promotion of Science

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/10/8/2973/pdf

Reference43 articles.

1. Data mining for software engineering and humans in the loop

2. Automatic Software Repair

3. Progress on approaches to software defect prediction

4. Early software defect prediction: A systematic map and review

Cited by 44 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Empowering Coders: Revolutionizing Programming Education with NLP and Challenge-Based Learning;2024 Third International Conference on Smart Technologies and Systems for Next Generation Computing (ICSTSN);2024-07-18

2. CommentClass: A Robust Ensemble Machine Learning Model for Comment Classification;International Journal of Computational Intelligence Systems;2024-07-15

3. Grading Programming Assignments by Summarization;ACM Turing Award Celebration Conference 2024;2024-07-05

4. Development Trend of Code Defect Detection Technology Based on Natural Language Processing;2024 IEEE 13th International Conference on Communication Systems and Network Technologies (CSNT);2024-04-06

5. Enhancing catalysis studies with chat generative pre-trained transformer (ChatGPT): Conversation with ChatGPT;Dalton Transactions;2024