EconPapers    
Economics at your fingertips  
 

Missing Income Data in the German SOEP: Incidence, Imputation and its Impact on the Income Distribution

Joachim Frick and Markus Grabka

No 376, Discussion Papers of DIW Berlin from DIW Berlin, German Institute for Economic Research

Abstract: This paper deals with the question of selectivity of missing data on income questions in large panel surveys due to item-non-response and with imputation as one alternative strategy to cope with this issue. In contrast to cross-section surveys, the imputation of missing values in panel data can profit from longitudinal information which is available for the very same observation units from other points in time. The “row-and-column imputation procedure” developed by Little & Su (1989) considers longitudinal as well as cross-sectional information in the imputation process. This procedure is applied to the German Socio-Economic Panel study (SOEP) when deriving annual income variables, complemented by purely cross-sectional techniques. Based on the SOEP, our empirical work starts with a description of the overall incidence of imputation and its relevance given by imputed income as a percentage share of the total income mass: e.g. while 21 % of all observations have at least one missing income component of their pre-tax post-transfer income, 9 % of the overall income mass is imputed. However, this picture varies considerably for more recent sub-samples of the panel survey. Secondly, we analyze the respective impact of imputation on the personal distribution of income as well as on results of income mobility. When comparing income inequality measures based only on truly observed information to those derived from all (i.e., observed and imputed) observations, we find an increase in inequality due to imputation and this effect appears to be relevant in both tails of the distribution, although somewhat more prominent among higher incomes. Longitudinal analyses show firstly a positive correlation of item-non-response on income data over time, but also provide evidence of item-non-response as being a predictor of subsequent unit-non-response. Applying various income mobility indicators there is a robust picture about income mobility being understated using truly observed information only. Finally, multivariate models show that survey-related factors (number of interviews, interview mode) as well as indicators for variability in income receipt (due to increased complexity of household structure and income composition) are significantly correlated with item-non-response. In conclusion, our empirical results based on the German SOEP indicate the selectivity of item-non-response on income questions in social surveys and push the necessity for adequate imputation.

Keywords: Item-Non-Response; Imputation; Income Inequality (search for similar items in EconPapers)
JEL-codes: C81 D31 I32 (search for similar items in EconPapers)
Pages: 35 p.
Date: 2003
New Economics Papers: this item is included in nep-lab
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (5)

Downloads: (external link)
https://www.diw.de/documents/publikationen/73/diw_01.c.40900.de/dp376.pdf (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:diw:diwwpp:dp376

Access Statistics for this paper

More papers in Discussion Papers of DIW Berlin from DIW Berlin, German Institute for Economic Research Contact information at EDIRC.
Bibliographic data for series maintained by Bibliothek ().

 
Page updated 2025-04-01
Handle: RePEc:diw:diwwpp:dp376