EconPapers    
Economics at your fingertips  
 

High-dimensional outlier detection using random projections

P. Navarro-Esteban () and J. A. Cuesta-Albertos ()
Additional contact information
P. Navarro-Esteban: Universidad de Cantabria
J. A. Cuesta-Albertos: Universidad de Cantabria

TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, 2021, vol. 30, issue 4, No 5, 908-934

Abstract: Abstract There exist multiple methods to detect outliers in multivariate data in the literature, but most of them require to estimate the covariance matrix. The higher the dimension, the more complex the estimation of the matrix becoming impossible in high dimensions. In order to avoid estimating this matrix, we propose a novel random projection-based procedure to detect outliers in Gaussian multivariate data. It consists in projecting the data in several one-dimensional subspaces where an appropriate univariate outlier detection method, similar to Tukey’s method but with a threshold depending on the initial dimension and the sample size, is applied. The required number of projections is determined using sequential analysis. Simulated and real datasets illustrate the performance of the proposed method.

Keywords: Outlier detection; Multivariate data; High-dimensional data; Random projections; Sequential analysis; 62H15; 62L10 (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s11749-020-00750-y Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:testjl:v:30:y:2021:i:4:d:10.1007_s11749-020-00750-y

Ordering information: This journal article can be ordered from
http://www.springer. ... cs/journal/11749/PS2

DOI: 10.1007/s11749-020-00750-y

Access Statistics for this article

TEST: An Official Journal of the Spanish Society of Statistics and Operations Research is currently edited by Alfonso Gordaliza and Ana F. Militino

More articles in TEST: An Official Journal of the Spanish Society of Statistics and Operations Research from Springer, Sociedad de Estadística e Investigación Operativa
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:testjl:v:30:y:2021:i:4:d:10.1007_s11749-020-00750-y