EconPapers    
Economics at your fingertips  
 

Conditional probability estimation based classification with class label missing at random

Ying Sheng and Qihua Wang

Journal of Multivariate Analysis, 2020, vol. 176, issue C

Abstract: For binary classification, it is common that class labels of some subjects are missing. Generally, the complete case analysis and the two stage procedure can be used to extend existing full data classification methods to deal with classification with missing class labels. Nevertheless, these two approaches cannot take full advantage of unlabeled subjects. In this paper, binary classification with the class label missing at random (MAR) is considered. Based on the inverse probability weighting (IPW) method and the augmented inverse probability weighting (AIPW) method, two new methods called IPW–CPC and AIPW–CPC are proposed to construct powerful classifiers by estimating the conditional probability in a reproducing kernel Hilbert space (RKHS). Compared with the complete case analysis and the two stage procedure, the proposed IPW–CPC and AIPW–CPC methods can make the best use of unlabeled subjects, which contributes a lot to improving classification accuracy. Theoretically, we show that conditional misclassification rates of the proposed classifiers converge to the Bayes misclassification rate in probability and rates of convergence are also obtained. Finally, simulations and the real data analysis well demonstrate good performances of the proposed IPW–CPC and AIPW–CPC methods in comparison with existing methods.

Keywords: Binary classification; Conditional probability estimation; Missing at random; Reproducing kernel Hilbert space (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0047259X19302015
Full text for ScienceDirect subscribers only

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:jmvana:v:176:y:2020:i:c:s0047259x19302015

Ordering information: This journal article can be ordered from
http://www.elsevier.com/wps/find/supportfaq.cws_home/regional
https://shop.elsevie ... _01_ooc_1&version=01

DOI: 10.1016/j.jmva.2019.104566

Access Statistics for this article

Journal of Multivariate Analysis is currently edited by de Leeuw, J.

More articles in Journal of Multivariate Analysis from Elsevier
Bibliographic data for series maintained by Catherine Liu ().

 
Page updated 2025-03-19
Handle: RePEc:eee:jmvana:v:176:y:2020:i:c:s0047259x19302015