Modeling and variable selection in epidemiologic analysis
S. Greenland
American Journal of Public Health, 1989, vol. 79, issue 3, 340-349
Abstract:
This paper provides an overview of problems in multivariate modeling of epidemiologic data, and examines some proposed solutions. Special attention is given to the task of model selection, which involves selection of the model form, selection of the variables to enter the model form, selection of the variables to enter the model, and selection of the form of these variables in the model. Several conclusions are drawn, among them: a) model and variable forms should be selected based on regression diagnostic procedures, in addition to goodness-of fit tests; b) variable-selection algorithms in current packaged programs, such as conventional stepwise regression, can easily lead to invalid estimates and tests of effect; and c) variable selection is better approached by direct estimation of the degree of confounding produced by each variable than by significance-testing algorithms. As a general rule, before using a model to estimate effects, one should evaluate the assumptions implied by the model against both the data and prior information.
Date: 1989
References: Add references at CitEc
Citations: View citations in EconPapers (23)
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:aph:ajpbhl:1989:79:3:340-349_6
Access Statistics for this article
American Journal of Public Health is currently edited by Alfredo Morabia
More articles in American Journal of Public Health from American Public Health Association
Bibliographic data for series maintained by Christopher F Baum ().