Prognostic Modeling with Logistic Regression Analysis
Ewout W. Steyerberg,
Marinus J. C. Eijkemans,
Frank E. Harrell and
J. Dik F. Habbema
Additional contact information
Ewout W. Steyerberg: Center for Clinical Decision Sciences, Department of Public Health, Erasmus University, Rotterdam, the Netherlands
Marinus J. C. Eijkemans: Center for Clinical Decision Sciences, Department of Public Health, Erasmus University, Rotterdam, the Netherlands
Frank E. Harrell: Division of Biostatistics and Epidemiology, Department of Health Evaluation Sciences, University of Virginia, Charlottesville, Virginia
J. Dik F. Habbema: Center for Clinical Decision Sciences, Department of Public Health, Erasmus University, Rotterdam, the Netherlands
Medical Decision Making, 2001, vol. 21, issue 1, 45-56
Abstract:
Clinical decision making often requires estimates of the likelihood of a dichotomous outcome in individual patients. When empirical data are available, these estimates may well be obtained from a logistic regression model. Several strategies may be followed in the development of such a model. In this study, the authors compare alternative strategies in 23 small subsamples from a large data set of patients with an acute myocardial infarction, where they developed predictive models for 30-day mortality. Evaluations were performed in an independent part of the data set. Specifically, the authors studied the effect of coding of covariables and stepwise selection on discriminative ability of the resulting model, and the effect of statistical “shrinkage†techniques on calibration. As expected, dichotomization of continuous covariables implied a loss of information. Remarkably, stepwise selection resulted in less discriminating models compared to full models including all available covariables, even when more than half of these were randomly associated with the outcome. Using qualitative information on the sign of the effect of predictors slightly improved the predictive ability. Calibration improved when shrinkage was applied on the standard maximum likelihood estimates of the regression coefficients. In conclusion, a sensible strategy in small data sets is to apply shrinkage methods in full models that include well-coded predictors that are selected based on external information.
Keywords: regression analysis; logistic models; bias; variable selection; prediction (search for similar items in EconPapers)
Date: 2001
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (10)
Downloads: (external link)
https://journals.sagepub.com/doi/10.1177/0272989X0102100106 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:sae:medema:v:21:y:2001:i:1:p:45-56
DOI: 10.1177/0272989X0102100106
Access Statistics for this article
More articles in Medical Decision Making
Bibliographic data for series maintained by SAGE Publications ().