Robust PCA for High-dimensional Data
M. Hubert (),
Peter Rousseeuw and
S. Verboven ()
Additional contact information
M. Hubert: Catholic University of Leuven, Department of Mathematics
S. Verboven: University of Antwerp (UIA), Department of Mathematics and Computer Science
A chapter in Developments in Robust Statistics, 2003, pp 169-179 from Springer
Abstract:
Summary Principal component analysis (PCA) is a well-known technique for dimension reduction. Classical PCA is based on the empirical mean and covariance matrix of the data, and hence is strongly affected by outlying observations. Therefore, there is a huge need for robust PCA. When the original number of variables is small enough, and in particular smaller than the number of observations, it is known that one can apply a robust estimator of multivariate location and scatter and compute the eigenvectors of the scatter matrix. The other situation, where there are many variables (often even more variables than observations), has received less attention in the robustness literature. We will compare two robust methods for this situation. The first one is based on projection pursuit (Li and Chen, 1985; Rousseeuw and Croux, 1993; Croux and Ruiz-Gazen, 1996, 2000; Hubert et al., 2002). The second method is a new proposal, which combines the notion of outlyingness (Stahel, 1981; Donoho, 1982) with the FAST-MCD algorithm (Rousseeuw and Van Driessen, 1999). The performance and the robustness of these two methods are compared through a simulation study. We also illustrate the new method on a chemometrical data set.
Keywords: Covariance Matrix; Mean Square Error; Projection Pursuit; Affine Subspace; Robust Principal Component Analysis (search for similar items in EconPapers)
Date: 2003
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-642-57338-5_14
Ordering information: This item can be ordered from
http://www.springer.com/9783642573385
DOI: 10.1007/978-3-642-57338-5_14
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().