Forecasting Undergraduate Majors Using Academic Transcript Data
David Nathan Lang,
Alex Wang,
Nathan Dalal,
Andreas Paepcke and
Mitchell Stevens
No u2cwq, EdArXiv from Center for Open Science
Abstract:
Committing to a major is a fateful step in an undergraduate education, yet the relationship between courses taken early in an academic career and ultimate major selection remains little studied at scale. Using transcript data capturing the academic careers of 26,892 undergraduates enrolled at a private university between 2000 and 2020, we describe enrollment histories using natural-language methods and vector embeddings to forecast terminal major on the basis of course sequences beginning at college entry. We find (I) a student's very first enrolled course predicts major thirty times better than random guessing and more than a third better than majority-class voting, (II) modeling strategies substantially influence forecasting accuracy, and (III) course portfolios varies substantially within majors, raising novel questions what majors mean or signify in relation to undergraduate course histories.
Date: 2021-11-22
New Economics Papers: this item is included in nep-for
References: Add references at CitEc
Citations:
Downloads: (external link)
https://osf.io/download/6196e0af0b0c1e03e67fdb14/
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:osf:edarxi:u2cwq
DOI: 10.31219/osf.io/u2cwq
Access Statistics for this paper
More papers in EdArXiv from Center for Open Science
Bibliographic data for series maintained by OSF ().