ICS for complex data with application to outlier detection for density data objects
Christine Thomas-Agnan,
Camille Mondon,
Thi-Huong Trinh and
Anne Ruiz-Gazen
No 24_1585, TSE Working Papers from Toulouse School of Economics (TSE)
Abstract:
ICS (Invariant coordinate selection) is a method aimed at dimension reduction as a preliminary step for clustering and outlier detection. It can be applied on multivariate or functional data. This work introduces a coordinate-free definition of ICS and extends the ICS method to distributional data. Indeed the inherent constraints of density functions imply a necessary adaptation of functional ICS. Our first achievement is a coordinate-free version of ICS within the framework of Hilbert spaces, assuming that the data lies almost surely in a finite dimensional subspace. Using the Bayes space framework tailored for density functions, we express the centred log-ratio of the density curves in a subspace of L2 0(a, b) of zero-integral spline functions and conduct ICS in this finite dimensional subspace. We describe the different steps of the procedure for outlier detection and study the impact of some parameters of this procedure on the results. The methodology is then illustrated on a sample of daily maximum temperatures densities recorded across northern Vietnamese provinces between 1987 and 2016.
Keywords: Bayes spaces; distributional data; functional data; invariant coordinate selection; outlier detection; Vietnam temperature densities (search for similar items in EconPapers)
Date: 2024-10-14
New Economics Papers: this item is included in nep-sea and nep-tra
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.tse-fr.eu/sites/default/files/TSE/docu ... 2024/wp_tse_1585.pdf Working Paper Version (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:tse:wpaper:129830
Access Statistics for this paper
More papers in TSE Working Papers from Toulouse School of Economics (TSE) Contact information at EDIRC.
Bibliographic data for series maintained by ().