CVs Classification Using Neural Network Approaches Combined with BERT and Gensim: CVs of Moroccan Engineering Students
Aniss Qostal (),
Aniss Moumen and
Younes Lakhrissi
Additional contact information
Aniss Qostal: Intelligent Systems, Georesources and Renewable Energies Laboratory (SIGER IN FRENCH), Sidi Mohamed Ben Abdellah University, FST, Fez 30050, Morocco
Aniss Moumen: Laboratory of Engineering Sciences, National School of Applied Sciences, Ibn Tofaïl University, Kenitra 14000, Morocco
Younes Lakhrissi: Intelligent Systems, Georesources and Renewable Energies Laboratory (SIGER IN FRENCH), Sidi Mohamed Ben Abdellah University, FST, Fez 30050, Morocco
Data, 2024, vol. 9, issue 6, 1-16
Abstract:
Deep learning (DL)-oriented document processing is widely used in different fields for extraction, recognition, and classification processes from raw corpus of data. The article examines the application of deep learning approaches, based on different neural network methods, including Gated Recurrent Unit (GRU), long short-term memory (LSTM), and convolutional neural networks (CNNs). The compared models were combined with two different word embedding techniques, namely: Bidirectional Encoder Representations from Transformers (BERT) and Gensim Word2Vec. The models are designed to evaluate the performance of architectures based on neural network techniques for the classification of CVs of Moroccan engineering students at ENSAK (National School of Applied Sciences of Kenitra, Ibn Tofail University). The used dataset included CVs collected from engineering students at ENSAK in 2023 for a project on the employability of Moroccan engineers in which new approaches were applied, especially machine learning, deep learning, and big data. Accordingly, 867 resumes were collected from five specialties of study (Electrical Engineering (ELE), Networks and Systems Telecommunications (NST), Computer Engineering (CE), Automotive Mechatronics Engineering (AutoMec), Industrial Engineering (Indus)). The results showed that the proposed models based on the BERT embedding approach had more accuracy compared to models based on the Gensim Word2Vec embedding approach. Accordingly, the CNN-GRU/BERT model achieved slightly better accuracy with 0.9351 compared to other hybrid models. On the other hand, single learning models also have good metrics, especially based on BERT embedding architectures, where CNN has the best accuracy with 0.9188.
Keywords: Gated Recurrent Unit (GRU); long short-term memory (LSTM); convolutional neural network (CNN); BERT; Gensim; Moroccan engineering students; Ibn Tofail University; CVs; ENSAK (search for similar items in EconPapers)
JEL-codes: C8 C80 C81 C82 C83 (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2306-5729/9/6/74/pdf (application/pdf)
https://www.mdpi.com/2306-5729/9/6/74/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jdataj:v:9:y:2024:i:6:p:74-:d:1400795
Access Statistics for this article
Data is currently edited by Ms. Cecilia Yang
More articles in Data from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().