EconPapers    
Economics at your fingertips  
 

Iatrogenic Specification Error: A Cautionary Tale of Cleaning Data

Christopher Bollinger and Amitabh Chandra

No 1093, IZA Discussion Papers from Institute of Labor Economics (IZA)

Abstract: In empirical research it is common practice to use sensible rules of thumb for cleaning data. Measurement error is often the justification for removing (trimming) or recoding (winsorizing) observations whose values lie outside a specified range. We consider a general measurement error process that nests many plausible models. Analytic results demonstrate that winsorizing and trimming are only solutions for a narrow class of measurement error processes. Indeed, for the measurement error processes found in most social-science data, such procedures can induce or exacerbate bias, and even inflate the variance estimates. We term this source of bias "Iatrogenic" (or econometrician induced) error. Monte Carlo simulations and empirical results from the Census PUMS data and 2001 CPS data demonstrate the fragility of trimming and winsorizing as solutions to measurement error in the dependent variable. Even on asymptotic variance and RMSE criteria, we are unable to find generalizable justifications for commonly used cleaning procedures.

Keywords: winsorizing; measurement error models; trimming (search for similar items in EconPapers)
JEL-codes: C1 J1 (search for similar items in EconPapers)
Pages: 24 pages
Date: 2004-03
New Economics Papers: this item is included in nep-cmp and nep-ecm
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)

Published - published in: Journal of Labor Economics, 2005, 23 (2), 235-257

Downloads: (external link)
https://docs.iza.org/dp1093.pdf (application/pdf)

Related works:
Journal Article: Iatrogenic Specification Error: A Cautionary Tale of Cleaning Data (2005) Downloads
Working Paper: Iatrogenic Specification Error: A Cautionary Tale of Cleaning Data (2003) Downloads
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:iza:izadps:dp1093

Ordering information: This working paper can be ordered from
IZA, Margard Ody, P.O. Box 7240, D-53072 Bonn, Germany

Access Statistics for this paper

More papers in IZA Discussion Papers from Institute of Labor Economics (IZA) IZA, P.O. Box 7240, D-53072 Bonn, Germany. Contact information at EDIRC.
Bibliographic data for series maintained by Holger Hinte ().

 
Page updated 2025-03-30
Handle: RePEc:iza:izadps:dp1093