EconPapers    
Economics at your fingertips  
 

Multivariate outlier detection in Stata

Vincenzo Verardi and Catherine Dehon ()

Stata Journal, 2010, vol. 10, issue 2, 259-266

Abstract: Before implementing any multivariate statistical analysis based on em- pirical covariance matrices, it is important to check whether outliers are present because their existence could induce significant biases. In this article, we present the minimum covariance determinant estimator, which is commonly used in ro- bust statistics to estimate location parameters and multivariate scales. These estimators can be used to robustify Mahalanobis distances and to identify outliers. Verardi and Croux (1999, Stata Journal 9: 439–453; 2010, Stata Journal 10: 313) programmed this estimator in Stata and made it available with the mcd command. The implemented algorithm is relatively fast and, as we show in the simulation example section, outperforms the methods already available in Stata, such as the Hadi method. Copyright 2010 by StataCorp LP.

Keywords: mcd; detection; multivariate outliers; robustness; minimum covariance determinant (search for similar items in EconPapers)
Date: 2010
Note: to access software from within Stata, net describe http://www.stata-journal.com/software/sj10-2/st0192/
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (38)

Downloads: (external link)
http://www.stata-journal.com/article.html?article=st0192

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:tsj:stataj:v:10:y:2010:i:2:p:259-266

Ordering information: This journal article can be ordered from
http://www.stata-journal.com/subscription.html

Access Statistics for this article

Stata Journal is currently edited by Nicholas J. Cox and Stephen P. Jenkins

More articles in Stata Journal from StataCorp LLC
Bibliographic data for series maintained by Christopher F. Baum () and Lisa Gilmore ().

 
Page updated 2025-03-20
Handle: RePEc:tsj:stataj:v:10:y:2010:i:2:p:259-266