CVs Classification Using Neural Network Approaches Combined with BERT and Gensim: CVs of Moroccan Engineering Students-Reference-Cited by-同舟云学术

CVs Classification Using Neural Network Approaches Combined with BERT and Gensim: CVs of Moroccan Engineering Students

Published:2024-05-24 Issue:6 Volume:9 Page:74
ISSN:2306-5729
Container-title:Data
language:en
Short-container-title:Data

Author:

Qostal Aniss¹^ORCID,Moumen Aniss²^ORCID,Lakhrissi Younes¹^ORCID

Affiliation:

1. Intelligent Systems, Georesources and Renewable Energies Laboratory (SIGER IN FRENCH), Sidi Mohamed Ben Abdellah University, FST, Fez 30050, Morocco

2. Laboratory of Engineering Sciences, National School of Applied Sciences, Ibn Tofaïl University, Kenitra 14000, Morocco

Abstract

Deep learning (DL)-oriented document processing is widely used in different fields for extraction, recognition, and classification processes from raw corpus of data. The article examines the application of deep learning approaches, based on different neural network methods, including Gated Recurrent Unit (GRU), long short-term memory (LSTM), and convolutional neural networks (CNNs). The compared models were combined with two different word embedding techniques, namely: Bidirectional Encoder Representations from Transformers (BERT) and Gensim Word2Vec. The models are designed to evaluate the performance of architectures based on neural network techniques for the classification of CVs of Moroccan engineering students at ENSAK (National School of Applied Sciences of Kenitra, Ibn Tofail University). The used dataset included CVs collected from engineering students at ENSAK in 2023 for a project on the employability of Moroccan engineers in which new approaches were applied, especially machine learning, deep learning, and big data. Accordingly, 867 resumes were collected from five specialties of study (Electrical Engineering (ELE), Networks and Systems Telecommunications (NST), Computer Engineering (CE), Automotive Mechatronics Engineering (AutoMec), Industrial Engineering (Indus)). The results showed that the proposed models based on the BERT embedding approach had more accuracy compared to models based on the Gensim Word2Vec embedding approach. Accordingly, the CNN-GRU/BERT model achieved slightly better accuracy with 0.9351 compared to other hybrid models. On the other hand, single learning models also have good metrics, especially based on BERT embedding architectures, where CNN has the best accuracy with 0.9188.

Publisher

MDPI AG

Link

https://www.mdpi.com/2306-5729/9/6/74/pdf

Reference48 articles.

1. Machine learning: Applications of artificial intelligence to imaging and diagnosis;Nichols;Biophys. Rev.,2019

2. History of artificial intelligence in medicine;Kaul;Gastrointest. Endosc.,2020

3. Li, Q., Cai, W., Wang, X., Zhou, Y., Feng, D.D., and Chen, M. (2014, January 10–12). Medical image classification with convolutional neural network. Proceedings of the IEEE 2014 13th International Conference on Control Automation Robotics & Vision (ICARCV), Singapore.

4. Educational data mining: Prediction of students’ academic performance using machine learning algorithms;Smart Learn. Environ.,2022

5. Usage of Machine Learning for Strategic Decision Making at Higher Educational Institutions;Nieto;IEEE Access,2019