EconPapers    
Economics at your fingertips  
 

Distribution-Preserving Statistical Disclosure Limitation

Simon Woodcock and Gary Benedetto ()
Additional contact information
Gary Benedetto: US Census Bureau

Discussion Papers from Department of Economics, Simon Fraser University

Abstract: One approach to limiting disclosure risk in public-use microdata is to release multiply-imputed, partially synthetic data sets. These are data on actual respondents, but with con dential data replaced by multiply-imputed synthetic values. When imputing confidential values, a mis-specified model can invalidate inferences, because the distribution of synthetic data is determined by the model used to generate them. We present a practical method to generate synthetic values when the imputer has only limited information about the true data generating process. We combine a simple imputation model (such as regression) with a series of density-based transformations to pre- serve the distribution of the con dential data, up to sampling error, on speci ed subdomains. We demonstrate through simulation and a large scale application that our approach preserves important statistical properties of the con dential data, including higher moments, with low disclosure risk.

Keywords: statistical disclosure limitation; confidentiality; privacy; multiple imputation; partially synthetic data (search for similar items in EconPapers)
JEL-codes: C1 C4 C5 (search for similar items in EconPapers)
Pages: 39
Date: 2007-09
New Economics Papers: this item is included in nep-ecm
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://www.econ.sfu.ca/research/RePEc/sfu/sfudps/dp07-15.pdf (application/pdf)

Related works:
Journal Article: Distribution-preserving statistical disclosure limitation (2009) Downloads
Working Paper: Distribution Preserving Statistical Disclosure Limitation (2006) Downloads
Working Paper: Distribution-Preserving Statistical Disclosure Limitation (2006) Downloads
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:sfu:sfudps:dp07-15

Ordering information: This working paper can be ordered from

Access Statistics for this paper

More papers in Discussion Papers from Department of Economics, Simon Fraser University Department of Economics, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada. Contact information at EDIRC.
Bibliographic data for series maintained by Working Paper Coordinator ().

 
Page updated 2025-03-20
Handle: RePEc:sfu:sfudps:dp07-15