A graphic and tabular variable deduction method in logistic regression
Guoping Zeng
Communications in Statistics - Theory and Methods, 2022, vol. 51, issue 16, 5412-5427
Abstract:
Logistic regression assumes linearity between independent variables and log odds. Currently, there are two methods to check the linearity as part of variable deduction. The first method is to use the well-known Box–Tidwell test, which runs logistic regression with interaction terms for all independent variables. The second method estimates log odds by running logistic regression for all independent variables. However, these two methods have some common issues. Firstly, they don’t treat missing values, leading loss of information. Secondly, they don’t treat outliers. Thirdly, both run logistic regression for all independent variables, which is infeasible for big data. In this article, we propose a graphic and tabular variable deduction method by simultaneously checking the linearity assumption of each independent variable using a logit plot and treating missing values and outliers using a summary table. Our novel method is easy to implement for any real-life predictive modeling using logistic regression.
Date: 2022
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1080/03610926.2020.1839499 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:lstaxx:v:51:y:2022:i:16:p:5412-5427
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/lsta20
DOI: 10.1080/03610926.2020.1839499
Access Statistics for this article
Communications in Statistics - Theory and Methods is currently edited by Debbie Iscoe
More articles in Communications in Statistics - Theory and Methods from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().