Multivariable regression models with continuous covariates, with a practical emphasis on fractional polynomials and applications in clinical epidemiology
Patrick Royston ()
Additional contact information
Patrick Royston: MRC Clinical Trials Unit, London
German Stata Users' Group Meetings 2005 from Stata Users Group
Abstract:
Regression models play a central role in epidemiology and clinical studies. In epidemiology the emphasis is typically either on determining whether a given risk factor affects the outcome of interest (adjusted for confounders), or on estimating a dose/response curve for a given factor, again adjusting for confounders. An important class of clinical studies is the so-called prognostic factors studies, in which the outcome for patients with chronic diseases such as cancer is predicted from various clinical features. In both application areas, it is almost always necessary to build a multivariable model incorporating known or suspected influential variables while eliminating those found to be unimportant. It is commonplace for risk or prognostic factors to be measured on a continuous scale, an obvious example being a person's age. Conventionally, such factors are either modelled as linear functions or are converted into categories according to some chosen set of cut-points. However, categorisation and use of the resulting estimates is a procedure known to be fraught with difficulty. A linear function may fit the data badly and give misleading estimates of risk. Therefore, reliable approaches for representing the effects of continuous factors in multivariable models are urgently needed. Building multivariable regression models by selecting influential covariates and determining the functional form of the relationship between a continuous covariate and the outcome when analysing data from clinical and epidemiological studies is the main concern of this talk. Systematic procedures which combine selection of influential variables with determination of functional form for continuous factors are rare. Analysts may apply their individual subjective preferences for each part of the model-building process, estimate parameters for several models and then decide on the final strategy according to the results they find. By contrast, we will present here the multivariable fractional polynomial (MFP) approach as a systematic way to determine a multivariable regression model. The MFP approach was made generally available to Stata users in version 8 as the -mfp- command. Major concerns will be discussed, including robustness and possible model instability. Regarding determination of the functional form, we will also discuss some alternatives with more emphasis on local estimation of the function (e.g. splines). The MFP procedure may be used for various types of regression models (linear regression model, logistic model, Cox model, and many more). Examples with real data will be used as illustrations.
Date: 2005-04-03
References: Add references at CitEc
Citations: View citations in EconPapers (6)
Downloads: (external link)
http://repec.org/dsug2005/royston_berlin.ppt (application/x-mspowerpoint)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:boc:dsug05:01
Access Statistics for this paper
More papers in German Stata Users' Group Meetings 2005 from Stata Users Group Contact information at EDIRC.
Bibliographic data for series maintained by Christopher F Baum ().