Zero-Inflated Binary Classification Model with Elastic Net Regularization

Xin, Hua; Lio, Yuhlong; Chen, Hsien-Ching; Tsai, Tzong-Ru

Zero-Inflated Binary Classification Model with Elastic Net Regularization

Hua Xin, Yuhlong Lio, Hsien-Ching Chen and Tzong-Ru Tsai ()
Additional contact information
Hua Xin: School of Mathematics and Statistics, Northeast Petroleum University, Daqing 163318, China
Yuhlong Lio: Department of Mathematical Sciences, University of South Dakota, Vermillion, SD 57069, USA
Hsien-Ching Chen: Department of Statistics, Tamkang University, Tamsui District, New Taipei City 251301, Taiwan
Tzong-Ru Tsai: Department of Statistics, Tamkang University, Tamsui District, New Taipei City 251301, Taiwan

Mathematics, 2024, vol. 12, issue 19, 1-17

Abstract: Zero inflation and overfitting can reduce the accuracy rate of using machine learning models for characterizing binary data sets. A zero-inflated Bernoulli (ZIBer) model can be the right model to characterize zero-inflated binary data sets. When the ZIBer model is used to characterize zero-inflated binary data sets, overcoming the overfitting problem is still an open question. To improve the overfitting problem for using the ZIBer model, the minus log-likelihood function of the ZIBer model with the elastic net regularization rule for an overfitting penalty is proposed as the loss function. An estimation procedure to minimize the loss function is developed in this study using the gradient descent method (GDM) with the momentum term as the learning rate. The proposed estimation method has two advantages. First, the proposed estimation method can be a general method that simultaneously uses L 1 - and L 2 -norm terms for penalty and includes the ridge and least absolute shrinkage and selection operator methods as special cases. Second, the momentum learning rate can accelerate the convergence of the GDM and enhance the computation efficiency of the proposed estimation procedure. The parameter selection strategy is studied, and the performance of the proposed method is evaluated using Monte Carlo simulations. A diabetes example is used as an illustration.

Keywords: expectation-maximization algorithm; gradient descent method; learning rate; maximum likelihood estimation; zero-inflated model (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.mdpi.com/2227-7390/12/19/2990/pdf (application/pdf)
https://www.mdpi.com/2227-7390/12/19/2990/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:12:y:2024:i:19:p:2990-:d:1485801

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().