EconPapers    
Economics at your fingertips  
 

Error rate control for classification rules in multiclass mixture models

Mary-Huard Tristan (), Perduca Vittorio, Martin-Magniette Marie-Laure and Blanchard Gilles
Additional contact information
Mary-Huard Tristan: MIA-Paris, INRAE, AgroParisTech, Université Paris-Saclay, Paris, 75005, France
Perduca Vittorio: Laboratoire MAP5 (UMR CNRS 8145), Université Paris Descartes, Paris
Martin-Magniette Marie-Laure: MIA-Paris, INRAE, AgroParisTech, Université Paris-Saclay, Paris, 75005, France
Blanchard Gilles: Laboratoire de Math’ematiques d’Orsay, Université Paris-Sud, Saint-Aubin, Île-de-France, France

The International Journal of Biostatistics, 2022, vol. 18, issue 2, 381-396

Abstract: In the context of finite mixture models one considers the problem of classifying as many observations as possible in the classes of interest while controlling the classification error rate in these same classes. Similar to what is done in the framework of statistical test theory, different type I and type II-like classification error rates can be defined, along with their associated optimal rules, where optimality is defined as minimizing type II error rate while controlling type I error rate at some nominal level. It is first shown that finding an optimal classification rule boils down to searching an optimal region in the observation space where to apply the classical Maximum A Posteriori (MAP) rule. Depending on the misclassification rate to be controlled, the shape of the optimal region is provided, along with a heuristic to compute the optimal classification rule in practice. In particular, a multiclass FDR-like optimal rule is defined and compared to the thresholded MAP rules that is used in most applications. It is shown on both simulated and real datasets that the FDR-like optimal rule may be significantly less conservative than the thresholded MAP rule.

Keywords: classification rule; error rate control; mixture models (search for similar items in EconPapers)
Date: 2022
References: Add references at CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1515/ijb-2020-0105 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bpj:ijbist:v:18:y:2022:i:2:p:381-396:n:3

Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/ijb/html

DOI: 10.1515/ijb-2020-0105

Access Statistics for this article

The International Journal of Biostatistics is currently edited by Antoine Chambaz, Alan E. Hubbard and Mark J. van der Laan

More articles in The International Journal of Biostatistics from De Gruyter
Bibliographic data for series maintained by Peter Golla ().

 
Page updated 2025-03-19
Handle: RePEc:bpj:ijbist:v:18:y:2022:i:2:p:381-396:n:3