Early Detection of Students at Risk - Predicting Student Dropouts Using Administrative Student Data and Machine Learning Methods
Johannes Berens (),
Oster Simon (),
Kerstin Schneider () and
Julian Burghoff ()
Additional contact information
Johannes Berens: WIB, University of Wuppertal
Oster Simon: WIB, University of Wuppertal
Julian Burghoff: University of Düsseldorf
No sdp18006, Schumpeter Discussion Papers from Universitätsbibliothek Wuppertal, University Library
Abstract:
High rates of student attrition in tertiary education are a major concern for universities and public policy, as dropout is not only costly for the students but also wastes public funds. To successfully reduce student attrition, it is imperative to understand which students are at risk of dropping out and what are the underlying determinants of dropout. We develop an early detection system (EDS) that uses machine learning and classic regression techniques to predict student success in tertiary education as a basis for a targeted intervention. The method developed in this paper is highly standardized and can be easily implemented in every German institution of higher education, as it uses student performance and demographic data collected, stored, and maintained by legal mandate at all German universities and therefore self-adjusts to the university where it is employed. The EDS uses regression analysis and machine learning methods, such as neural networks, decision trees and the AdaBoost algorithm to identify student characteristics which distinguish potential dropouts from graduates. The EDS we present is tested and applied on a medium-sized state university with 23,000 students and a medium-sized private university of applied sciences with 6,700 students. Both institutes of higher education differ considerably in their organization, tuition fees and student-teacher ratios. Our results indicate a prediction accuracy at the end of the first semester of 79% for the state university and 85% for the private university of applied sciences. Furthermore, accuracy of the EDS increases with each completed semester as new performance data becomes available. After the fourth semester, the accuracy improves to 90% for the state university and 95% for the private university of applied sciences. At the day of enrollment the accuracy, relying only on demographic data, is 68% for the state university and 67% for the private university.
Pages: 33
Date: 2018-07
New Economics Papers: this item is included in nep-big and nep-cmp
References: Add references at CitEc
Citations: View citations in EconPapers (3)
Downloads: (external link)
https://elekpub.bib.uni-wuppertal.de/ubwhsmig/down ... riginalFilename=true (application/pdf)
Related works:
Working Paper: Early Detection of Students at Risk – Predicting Student Dropouts Using Administrative Student Data and Machine Learning Methods (2018)
Working Paper: Early Detection of Students at Risk - Predicting Student Dropouts Using Administrative Student Data and Machine Learning Methods (2018)
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bwu:schdps:sdp18006
Access Statistics for this paper
More papers in Schumpeter Discussion Papers from Universitätsbibliothek Wuppertal, University Library
Bibliographic data for series maintained by Frank Hoffmann ().