ICS for complex data with application to outlier detection for density data
Christine Thomas-Agnan,
Camille Mondon,
Thi-Huong Trinh and
Anne Ruiz-Gazen
No 24-1585, TSE Working Papers from Toulouse School of Economics (TSE)
Abstract:
Invariant coordinate selection (ICS) is a dimension reduction method, used as a preliminary step for clustering and outlier detection. It has been primarily applied to multivariate data. This work introduces a coordinate-free definition of ICS in an abstract Euclidean space and extends the method to complex data. Functional and distributional data are preprocessed into a finite-dimensional subspace. For example, in the framework of Bayes Hilbert spaces, distributional data are smoothed into compositional spline functions through the Maximum Penalised Likelihood method. We describe an outlier detection procedure for complex data and study the impact of some preprocessing parameters on the results. We compare our approach with other outlier detection methods through simulations, producing promising results in scenarios with a low proportion of outliers. ICS allows detecting abnormal climate events in a sample of daily maximum temperature distributions recorded across the provinces of Northern Vietnam between 1987 and 2016.
Keywords: Bayes spaces; Distributional data; Extreme weather; Functional data; Invariant coordinate selection; Outlier detection; Temperature distribution (search for similar items in EconPapers)
Date: 2024-10-14, Revised 2025-05
New Economics Papers: this item is included in nep-sea and nep-tra
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.tse-fr.eu/sites/default/files/TSE/docu ... 2024/wp_tse_1585.pdf Working Paper Version (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:tse:wpaper:129830
Access Statistics for this paper
More papers in TSE Working Papers from Toulouse School of Economics (TSE) Contact information at EDIRC.
Bibliographic data for series maintained by ().