EconPapers    
Economics at your fingertips  
 

Unobserved classes and extra variables in high-dimensional discriminant analysis

Michael Fop (), Pierre-Alexandre Mattei, Charles Bouveyron and Thomas Brendan Murphy
Additional contact information
Michael Fop: University College Dublin
Pierre-Alexandre Mattei: Université Côte d’Azur, Inria, CNRS, Laboratoire J.A. Dieudonné, Maasai team
Charles Bouveyron: Université Côte d’Azur, Inria, CNRS, Laboratoire J.A. Dieudonné, Maasai team
Thomas Brendan Murphy: Université Côte d’Azur, Inria, CNRS, Laboratoire J.A. Dieudonné, Maasai team

Advances in Data Analysis and Classification, 2022, vol. 16, issue 1, No 4, 55-92

Abstract: Abstract In supervised classification problems, the test set may contain data points belonging to classes not observed in the learning phase. Moreover, the same units in the test data may be measured on a set of additional variables recorded at a subsequent stage with respect to when the learning sample was collected. In this situation, the classifier built in the learning phase needs to adapt to handle potential unknown classes and the extra dimensions. We introduce a model-based discriminant approach, Dimension-Adaptive Mixture Discriminant Analysis (D-AMDA), which can detect unobserved classes and adapt to the increasing dimensionality. Model estimation is carried out via a full inductive approach based on an EM algorithm. The method is then embedded in a more general framework for adaptive variable selection and classification suitable for data of large dimensions. A simulation study and an artificial experiment related to classification of adulterated honey samples are used to validate the ability of the proposed framework to deal with complex situations.

Keywords: Adaptive supervised classification; Conditional estimation; Model-based discriminant analysis; Unobserved classes; Variable selection; 62H30 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://link.springer.com/10.1007/s11634-021-00474-3 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:advdac:v:16:y:2022:i:1:d:10.1007_s11634-021-00474-3

Ordering information: This journal article can be ordered from
http://www.springer. ... ds/journal/11634/PS2

DOI: 10.1007/s11634-021-00474-3

Access Statistics for this article

Advances in Data Analysis and Classification is currently edited by H.-H. Bock, W. Gaul, A. Okada, M. Vichi and C. Weihs

More articles in Advances in Data Analysis and Classification from Springer, German Classification Society - Gesellschaft für Klassifikation (GfKl), Japanese Classification Society (JCS), Classification and Data Analysis Group of the Italian Statistical Society (CLADAG), International Federation of Classification Societies (IFCS)
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:advdac:v:16:y:2022:i:1:d:10.1007_s11634-021-00474-3