Constructing Common Factors from Continuous and Categorical Data
Serena Ng
Econometric Reviews, 2015, vol. 34, issue 6-10, 1141-1171
Abstract:
The method of principal components is widely used to estimate common factors in large panels of continuous data. This article first reviews alternative methods that obtain the common factors by solving a Procrustes problem. While these matrix decomposition methods do not specify the probabilistic structure of the data and hence do not permit statistical evaluations of the estimates, they can be extended to analyze categorical data. This involves the additional step of quantifying the ordinal and nominal variables. The article then reviews and explores the numerical properties of these methods. An interesting finding is that the factor space can be quite precisely estimated directly from categorical data without quantification. This may require using a larger number of estimated factors to compensate for the information loss in categorical variables. Separate treatment of categorical and continuous variables may not be necessary if structural interpretation of the factors is not required, such as in forecasting exercises.
Date: 2015
References: View references in EconPapers View complete reference list from CitEc 
Citations: View citations in EconPapers (7) 
Downloads: (external link)
http://hdl.handle.net/10.1080/07474938.2014.956625 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX 
RIS (EndNote, ProCite, RefMan) 
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:emetrv:v:34:y:2015:i:6-10:p:1141-1171
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/LECR20
DOI: 10.1080/07474938.2014.956625
Access Statistics for this article
Econometric Reviews is currently edited by Dr. Essie Maasoumi
More articles in Econometric Reviews  from  Taylor & Francis Journals
Bibliographic data for series maintained by  ().