Missing Data
Geert Molenberghs,
Caroline Beunckens,
Ivy Jansen,
Herbert Thijs,
Geert Verbeke and
Michael G. Kenward
Additional contact information
Geert Molenberghs: Limburg University Centrum, Biostatistics Centre for Statistics
Caroline Beunckens: Limburg University Centrum, Biostatistics Centre for Statistics
Ivy Jansen: Limburg University Centrum, Biostatistics Centre for Statistics
Herbert Thijs: Limburg University Centrum, Biostatistics Centre for Statistics
Geert Verbeke: Limburg University Centrum, Biostatistics Centre for Statistics
Michael G. Kenward: London School of Hygiene & Tropical Medicine, Medical Statistics Unit
Chapter II.6 in Handbook of Epidemiology, 2005, pp 767-827 from Springer
Abstract:
Abstract The problem of dealing with missing values is common throughout statistical work and is present whenever human subjects are enrolled. Respondents may refuse participation or may be unreachable. Patients in clinical and epidemiological studies may with draw their initial consent without further explanation. Early work on missing values was largely concerned with algorithmic and computational solutions to the induced lack of balance or deviations from the intended study design (Afifi and Elashoff 1966; Hartley and Hocking 1971). More recently general algorithms such as the Expectation-Maximization (EM) (Dempster et al. 1977), and data imputation and augmentation procedures (Rubin1987;Tanner andWong1987) combined with powerful computing resources have largely provided a solution to this aspect of the problem. There remains the very difficult and important question of assessing the impact of missing data on subsequent statistical inference. Conditions can be formulated, under which an analysis that proceeds as if the missing data are missing by design, that is, ignoring the missing value process, can provide valid answers to study questions. While such an approach is attractive from a pragmatic point of view, the difficulty is that such conditions can rarely be assumed to hold with full certainty. Indeed, assumptions will be required that cannot be assessed from the data under analysis. Hence in this setting there cannot be anything that could be termed a definitive analysis, and hence any analysis of preference is ideally to be supplemented with a so-called sensitivity analysis.
Keywords: Generalize Linear Mixed Model; Generalize Estimate Equation; American Statistical Association; Last Observation Carry Forward; Royal Statistical Society Series (search for similar items in EconPapers)
Date: 2005
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-540-26577-1_20
Ordering information: This item can be ordered from
http://www.springer.com/9783540265771
DOI: 10.1007/978-3-540-26577-1_20
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().