The effect of measurement error on clustering
Paulina Pankowska (),
Daniel Oberski (),
Mauricio Garnier-Villarreal () and
Dimitris Pavlopoulos ()
Additional contact information
Paulina Pankowska: Utrecht University
Daniel Oberski: Utrecht University
Mauricio Garnier-Villarreal: Vrije Universiteit Amsterdam
Dimitris Pavlopoulos: Vrije Universiteit Amsterdam
Quality & Quantity: International Journal of Methodology, 2025, vol. 59, issue 5, No 38, 4825-4860
Abstract:
Abstract Clustering is a set of statistical techniques widely applied in the social sciences. While an important and useful tool, traditional clustering techniques tend to assume that the data are free from measurement error, which is often an unrealistic assumption. In this paper, we perform a Monte Carlo study to investigate the sensitivity of different clustering techniques to measurement error. We focus on three commonly used approaches: latent profile analysis (LPA), hierarchical clustering using Ward’s method, and k-means. We examine how the error affects the interpretability of the clusters and the classification of observations into clusters. Our results indicate that LPA fares better in the presence of error. In fact, clustering results from LPA can still be trusted when there is random error affecting one variable. K-means and Ward’s method, on the other hand, appear to already’break down’ when random error affects one variable and lead to inaccurate classifications. When the error is systematic and/or it affects more variables, all clustering methods produce severely biased results.
Keywords: Clustering; Latent profile analysis (LPA); k-means; Ward’s method; Measurement error (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s11135-025-02177-9 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:qualqt:v:59:y:2025:i:5:d:10.1007_s11135-025-02177-9
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11135
DOI: 10.1007/s11135-025-02177-9
Access Statistics for this article
Quality & Quantity: International Journal of Methodology is currently edited by Vittorio Capecchi
More articles in Quality & Quantity: International Journal of Methodology from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().