Ein einfaches Verfahren zur Identifikation von Ausreißern bei multivariaten Daten
Günter Buttler
No 09/1996, Discussion Papers from Friedrich-Alexander University Erlangen-Nuremberg, Chair of Statistics and Econometrics
Abstract:
Statistical analysis is often disturbed by objects which are extremely different from the rest of the data. Those outliers can be due to different causes. Therefore it is always recommended to examine them separately. Outliers in one or two-dimensional cases are easily recognized in a frequency distribution. In multidimensional data they can be identified by sub-ordering. It is proposed to do this by calculating pairwise distances. The necessary standardization of the variables can be done by using the sum of all pairwise distances. Proceeding this way possible outliers can easily be identified in tabular as well as in graphical form. It can also be demonstrated which dimension, that is which variables are contributing to the outlier status. As it is quite simple to remove any object or variable, one can see what is happening to the rest of the data without those extreme values.
Date: 1996
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
https://www.econstor.eu/bitstream/10419/29594/1/615880223.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:zbw:faucse:091996
Access Statistics for this paper
More papers in Discussion Papers from Friedrich-Alexander University Erlangen-Nuremberg, Chair of Statistics and Econometrics Contact information at EDIRC.
Bibliographic data for series maintained by ZBW - Leibniz Information Centre for Economics ().