Near-collinearity in linear regression revisited: The numerical vs. the statistical perspective
Aris Spanos
Communications in Statistics - Theory and Methods, 2019, vol. 48, issue 22, 5492-5516
Abstract:
The article compares the numerical and statistical perspectives on the problem of near-collinearity with a view to investigate whether assigning statistical interpretation to numerical measures changes the original problem in ways that calls into question certain aspects of the current conventional wisdom. The numerical perspective views the problem as stemming from the ill-conditioning of the (X⊺X) matrix, irrespective of whether the numbers denote data or not. The statistical perspective frames the problem in terms of sample correlations among regressors (simple and partial). It is argued that this reframing changes the nature of the numerical problem into a problem relating to the probabilistic structure of the Linear Regression model. The disparity between the two perspectives arises because high correlations among regressors is neither necessary nor sufficient for (X⊺X) to be ill-conditioned. Moreover, the sample correlations are highly vulnerable to statistical misspecification. For instance, the presence of mean t-heterogeneity will render all statistical measures of near-collinearity untrustworthy. It is argued that many confusions in the near-collinearity literature arise from erroneously attributing symptoms of statistical misspecification to the presence of near-collinearity when the latter is misdiagnosed using unreliable statistical measures.
Date: 2019
References: Add references at CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://hdl.handle.net/10.1080/03610926.2018.1513147 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:lstaxx:v:48:y:2019:i:22:p:5492-5516
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/lsta20
DOI: 10.1080/03610926.2018.1513147
Access Statistics for this article
Communications in Statistics - Theory and Methods is currently edited by Debbie Iscoe
More articles in Communications in Statistics - Theory and Methods from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().