EconPapers    
Economics at your fingertips  
 

A new framework for managing and analyzing multiply imputed data in Stata

John B. Carlin (), John C. Galati and Patrick Royston ()
Additional contact information
John B. Carlin: Clinical Epidemiology & Biostatistics Unit, Murdoch Children’s Research Institute & University of Melbourne
John C. Galati: Clinical Epidemiology & Biostatistics Unit, Murdoch Children’s Research Institute & University of Melbourne
Patrick Royston: Cancer Group, MRC Clinical Trials Unit

Stata Journal, 2008, vol. 8, issue 1, 49-67

Abstract: A new set of tools is described for performing analyses of an ensemble of datasets that includes multiple copies of the original data with imputations of missing values, as required for the method of multiple imputation. The tools replace those originally developed by the authors. They are based on a simple data management paradigm in which the imputed datasets are all stored along with the original data in a single dataset with a vertically stacked format, as proposed by Royston in his ice and micombine commands. Stacking into a single dataset simplifies the management of the imputed datasets compared with storing them individually. Analysis and manipulation of the stacked datasets is performed with a new prefix command, mim, which can accommodate data imputed by any method as long as a few simple rules are followed in creating the imputed data. mim can validly fit most of the regression models available in Stata to multiply imputed datasets, giving parameter estimates and confidence intervals computed according to Rubin’s results for multiple imputation inference. Particular attention is paid to limiting the available postestimation commands to those that are known to be valid within the multiple imputation context. However, the user has flexibility to override these defaults. Features of these new tools are illustrated using two previously published examples. Copyright 2008 by StataCorp LP.

Keywords: mim; mimstack; ice; micombine; miset; mifit; multiple imputa- tion; missing data; missing at random (search for similar items in EconPapers)
Date: 2008
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (19)

Downloads: (external link)
http://www.stata-journal.com/article.html?article=st0139
http://www.stata-journal.com/software/sj8-1/st0139/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:tsj:stataj:v:8:y:2008:i:1:p:49-67

Ordering information: This journal article can be ordered from
http://www.stata-journal.com/subscription.html

Access Statistics for this article

Stata Journal is currently edited by Nicholas J. Cox and Stephen P. Jenkins

More articles in Stata Journal from StataCorp LLC
Bibliographic data for series maintained by Christopher F. Baum () and Lisa Gilmore ().

 
Page updated 2025-03-20
Handle: RePEc:tsj:stataj:v:8:y:2008:i:1:p:49-67