Detecting LLM-Enabled Plagiarism in Student Essays Using Ensemble Learning and NLP
Mouad Berqia () and
Hafssa Benaboud ()
Additional contact information
Mouad Berqia: Mohammed V University in Rabat
Hafssa Benaboud: Mohammed V University in Rabat
A chapter in Information Systems and Technological Advances for Sustainable Development, 2024, pp 376-382 from Springer
Abstract:
Abstract The increasing use of Large Language Models (LLMs) is causing concern about their ability to replace human jobs. Educators are especially worried about how these models might affect students, specifically their ability to do their own assignments like essays. This paper addresses the fear that LLMs might lead to more plagiarism in schools. We propose a machine learning model to distinguish between essays written by middle and high school students and those generated by LLMs. The proposed method uses Natural Language Processing (NLP) and Ensemble Learning techniques. The proposed model performed well, with a ROC AUC score of 0.9985.
Keywords: Large Language Models; Ensemble Learning; Natural Language Processing; Plagiarism; Academic Integrity (search for similar items in EconPapers)
Date: 2024
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:lnichp:978-3-031-75329-9_42
Ordering information: This item can be ordered from
http://www.springer.com/9783031753299
DOI: 10.1007/978-3-031-75329-9_42
Access Statistics for this chapter
More chapters in Lecture Notes in Information Systems and Organization from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().