A methodological proposal to estimate the correlation coefficient under range restriction and clustering
Luz A. Pereira,
César Ojeda and
Jonnathan Cabrera
Journal of Applied Statistics, 2026, vol. 53, issue 1, 84-96
Abstract:
We propose a methodology for estimating the correlation coefficient between paired observations in the presence of range restrictions and clusters. We use a Bayesian nonparametric mixture model with Gaussian kernels to estimate the bivariate density. Within this model, we assume that the parameters of the Gaussian kernel follow a Dirichlet process, resulting in a flexible model that incorporates all available information. The overall correlation coefficient for the observations is estimated posteriorly from the components of the posterior variance-covariance matrix. Through a Monte Carlo study, we demonstrate that our methodology achieves the lowest Root Mean Squared Error (RMSE) compared to classical alternatives such as Pearson, Thorndike, FIML, and MICE. Additionally, we apply our methodology to real data obtained from one of the principal state universities in Colombia. In this application, our main objective is to estimate the relationship between admission test scores and grade-point averages.
Date: 2026
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1080/02664763.2025.2503859 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:japsta:v:53:y:2026:i:1:p:84-96
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/CJAS20
DOI: 10.1080/02664763.2025.2503859
Access Statistics for this article
Journal of Applied Statistics is currently edited by Robert Aykroyd
More articles in Journal of Applied Statistics from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().