Early Prediction of University Dropouts – A Random Forest Approach
Behr Andreas (),
Giese Marco (),
Teguim K Herve D. () and
Katja Theune
Additional contact information
Behr Andreas: Chair of Statistics, University of Duisburg-Essen, Essen, Germany
Giese Marco: Chair of Statistics, University of Duisburg-Essen, Essen, Germany
Teguim K Herve D.: Chair of Statistics, University of Duisburg-Essen, Essen, Germany
Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), 2020, vol. 240, issue 6, 743-789
Abstract:
We predict university dropout using random forests based on conditional inference trees and on a broad German data set covering a wide range of aspects of student life and study courses. We model the dropout decision as a binary classification (graduate or dropout) and focus on very early prediction of student dropout by stepwise modeling students’ transition from school (pre-study) over the study-decision phase (decision phase) to the first semesters at university (early study phase). We evaluate how predictive performance changes over the three models, and observe a substantially increased performance when including variables from the first study experiences, resulting in an AUC (area under the curve) of 0.86. Important predictors are the final grade at secondary school, and also determinants associated with student satisfaction and their subjective academic self-concept and self-assessment. A direct outcome of this research is the provision of information to universities wishing to implement early warning systems and more personalized counseling services to support students at risk of dropping out during an early stage of study.
Keywords: student dropout; higher education; dropout prediction; educational data mining; random forest (search for similar items in EconPapers)
JEL-codes: I23 (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (4)
Downloads: (external link)
https://doi.org/10.1515/jbnst-2019-0006 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:jns:jbstat:v:240:y:2020:i:6:p:743-789:n:1
DOI: 10.1515/jbnst-2019-0006
Access Statistics for this article
Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik) is currently edited by Peter Winker
More articles in Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik) from De Gruyter
Bibliographic data for series maintained by Peter Golla ().