Inaccurate age and sex data in the Census PUMS files: evidence and implications
Betsey Stevenson (),
Michael Davern and
J. Trent Alexander
No 2010-03, Working Paper Series from Federal Reserve Bank of San Francisco
We discover and document errors in public use microdata samples (\\"PUMS files\\") of the 2000 Census, the 2003-2006 American Community Survey, and the 2004-2009 Current Population Survey. For women and men ages 65 and older, age- and sex-specific population estimates generated from the PUMS files differ by as much as 15% from counts in published data tables. Moreover, an analysis of labor force participation and marriage rates suggests the PUMS samples are not representative of the population at individual ages for those ages 65 and over. PUMS files substantially underestimate labor force participation of those near retirement ages and overestimate labor force participation rates of those at older ages. These problems were an unintentional by-product of the misapplication of a newer generation of disclosure avoidance procedures carried out on the data. The resulting errors in the public use data could significantly impact studies of people ages 65 and older, particularly analyses of variables that are expected to change by age.
Keywords: Population; Census; Labor supply (search for similar items in EconPapers)
Date: 2010, Revised 2010
New Economics Papers: this item is included in nep-age and nep-lab
References: Add references at CitEc
Citations: View citations in EconPapers (12) Track citations by RSS feed
Downloads: (external link)
Working Paper: Inaccurate Age and Sex Data in the Census PUMS Files: Evidence and Implications (2010)
Working Paper: Inaccurate age and sex data in the Census PUMS files: Evidence and Implications (2010)
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
Persistent link: https://EconPapers.repec.org/RePEc:fip:fedfwp:2010-03
Ordering information: This working paper can be ordered from
Access Statistics for this paper
More papers in Working Paper Series from Federal Reserve Bank of San Francisco Contact information at EDIRC.
Bibliographic data for series maintained by Federal Reserve Bank of San Francisco Research Library ().