EconPapers    
Economics at your fingertips  
 

The 2010 Census Confidentiality Protections Failed, Here’s How and Why

John Abowd (), Tamara Adams, Robert Ashmead, David Darais, Sourya Dey, Simson L. Garfinkel, Nathan Goldschlag, Daniel Kifer, Philip Leclerc, Ethan Lew, Scott Moore, Rolando A. Rodríguez, Ramy N. Tadros and Lars Vilhuber

Working Papers from U.S. Census Bureau, Center for Economic Studies

Abstract: Using only 34 published tables, we reconstruct five variables (census block, sex, age, race, and ethnicity) in the confidential 2010 Census person records. Using the 38-bin age variable tabulated at the census block level, at most 20.1% of reconstructed records can differ from their confidential source on even a single value for these five variables. Using only published data, an attacker can verify that all records in 70% of all census blocks (97 million people) are perfectly reconstructed. The tabular publications in Summary File 1 thus have prohibited disclosure risk similar to the unreleased confidential microdata. Reidentification studies confirm that an attacker can, within blocks with perfect reconstruction accuracy, correctly infer the actual census response on race and ethnicity for 3.4 million vulnerable population uniques (persons with nonmodal characteristics) with 95% accuracy, the same precision as the confidential data achieve and far greater than statistical baselines. The flaw in the 2010 Census framework was the assumption that aggregation prevented accurate microdata reconstruction, justifying weaker disclosure limitation methods than were applied to 2010 Census public microdata. The framework used for 2020 Census publications defends against attacks that are based on reconstruction, as we also demonstrate here. Finally, we show that alternatives to the 2020 Census Disclosure Avoidance System with similar accuracy (enhanced swapping) also fail to protect confidentiality, and those that partially defend against reconstruction attacks (incomplete suppression implementations) destroy the primary statutory use case: data for redistricting all legislatures in the country in compliance with the 1965 Voting Rights Act.

Keywords: Keywords: statistical disclosure limitation; reconstruction attack; record linkage attack; differential privacy; swapping; suppression (search for similar items in EconPapers)
Pages: 77 pages
Date: 2023-12
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www2.census.gov/library/working-papers/2023/adrm/ces/CES-WP-23-63.pdf First version, 2023 (application/pdf)

Related works:
Working Paper: The 2010 Census Confidentiality Protections Failed, Here's How and Why (2023) Downloads
Working Paper: The 2010 Census Confidentiality Protections Failed, Here's How and Why (2023) Downloads
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:cen:wpaper:23-63

Access Statistics for this paper

More papers in Working Papers from U.S. Census Bureau, Center for Economic Studies Contact information at EDIRC.
Bibliographic data for series maintained by Dawn Anderson ().

 
Page updated 2025-03-24
Handle: RePEc:cen:wpaper:23-63