EconPapers    
Economics at your fingertips  
 

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Jordan M. Wheeler, Allan S. Cohen and Shiyu Wang
Additional contact information
Jordan M. Wheeler: University of Nebraska-Lincoln
Shiyu Wang: University of Georgia

Journal of Educational and Behavioral Statistics, 2024, vol. 49, issue 5, 848-874

Abstract: Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming more common in educational measurement research as a method for analyzing students’ responses to constructed-response items. Two popular topic models are latent semantic analysis (LSA) and latent Dirichlet allocation (LDA). LSA uses linear algebra techniques, whereas LDA uses an assumed statistical model and generative process. In educational measurement, LSA is often used in algorithmic scoring of essays due to its high reliability and agreement with human raters. LDA is often used as a supplemental analysis to gain additional information about students, such as their thinking and reasoning. This article reviews and compares the LSA and LDA topic models. This article also introduces a methodology for comparing the semantic spaces obtained by the two models and uses a simulation study to investigate their similarities.

Keywords: topic models; latent semantic analysis; latent Dirichlet allocation; constructed-response items; semantic spaces (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://journals.sagepub.com/doi/10.3102/10769986231209446 (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:sae:jedbes:v:49:y:2024:i:5:p:848-874

DOI: 10.3102/10769986231209446

Access Statistics for this article

More articles in Journal of Educational and Behavioral Statistics
Bibliographic data for series maintained by SAGE Publications ().

 
Page updated 2025-03-19
Handle: RePEc:sae:jedbes:v:49:y:2024:i:5:p:848-874