Synthetic Individual Income Tax Data: Promises and Challenges
Claire McKay Bowen,
Victoria L. Bryant,
Leonard Burman,
Surachai Khitatrakun,
Robert McClelland,
Livia Mucciolo,
Madeline Pickens and
Aaron R. Williams
National Tax Journal, 2022, vol. 75, issue 4, 767 - 790
Abstract:
Tax data are invaluable for research, but privacy concerns severely limit access. Although the US Internal Revenue Service produces a public-use file (PUF), improved technology and the proliferation of individual data have made it increasingly difficult to protect. Synthetic data are an alternative that reproduce the statistical properties of administrative data without revealing individual taxpayer information. This paper evaluates the quality and safety of the first fully synthetic PUF and demonstrates its performance in tax model microsimulations. The synthetic PUF could also be used to develop and debug statistical programs that could then be safely run on confidential data via a validation server.
Date: 2022
References: Add references at CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://dx.doi.org/10.1086/722094 (application/pdf)
http://dx.doi.org/10.1086/722094 (text/html)
Access to the online full text or PDF requires a subscription.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ucp:nattax:doi:10.1086/722094
Access Statistics for this article
More articles in National Tax Journal from University of Chicago Press
Bibliographic data for series maintained by Journals Division ().