EconPapers    
Economics at your fingertips  
 

Powered SQL education: Automating SQL/PLSQL question classification with LLMs and machine learning

Naif Alzriqat () and Mohammad Al-Oudat ()

International Journal of Innovative Research and Scientific Studies, 2025, vol. 8, issue 2, 1395-1407

Abstract: Mastering Structured Query Language/Procedural Language (SQL/PLSQL) is considered challenging for academic students and industrial professionals, showing a significant gap between academic preparation and industrial demands that leads both to seek solutions on Stack Overflow (SO). This research presents a novel automated framework to classify SQL/PLSQL questions and shed light on learning challenges. A new dataset was collected from SO posts, totaling 10,266 questions, and categorized into five categories—Data Definition Language (DDL), Data Manipulation Language (DML), Data Query Language (DQL), Data Control Language (DCL), and Transaction Control Language (TCL)—using the LLM GPT-4o-mini API, followed by preprocessing and applying Machine Learning (ML) techniques like Random Forest and XGBoost. Results show that Data Query Language (DQL) and Data Manipulation Language (DML) are the most challenging areas, with Random Forest and XGBoost producing the highest classification accuracy at 85.57% and 85.13%, respectively, while DDL and DCL appear less often. This research bridges the gap between academic and industrial requirements, concluding that AI-driven analysis identifies the real challenges, suggesting that the academic curriculum enhance hands-on problem-solving to meet industry needs.

Keywords: Curriculum enhancement; Database education; Database skills; Industry-academic gap; LLM; AI; SQL categorization; Stack Overflow. (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://ijirss.com/index.php/ijirss/article/view/5467/941 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:aac:ijirss:v:8:y:2025:i:2:p:1395-1407:id:5467

Access Statistics for this article

International Journal of Innovative Research and Scientific Studies is currently edited by Natalie Jean

More articles in International Journal of Innovative Research and Scientific Studies from Innovative Research Publishing
Bibliographic data for series maintained by Natalie Jean ().

 
Page updated 2025-03-22
Handle: RePEc:aac:ijirss:v:8:y:2025:i:2:p:1395-1407:id:5467