A Framework for Sharing Confidential Research Data, Applied to Investigating Differential Pay by Race in the U. S. Government
Andrés F. Barrientos,
Alexander Bolton,
Tom Balmat,
Jerome P. Reiter,
John M. de Figueiredo,
Ashwin Machanavajjhala,
Yan Chen,
Charles Kneifel and
Mark DeLong
No 23534, NBER Working Papers from National Bureau of Economic Research, Inc
Abstract:
Data stewards seeking to provide access to large-scale social science data face a difficult challenge. They have to share data in ways that protect privacy and confidentiality, are informative for many analyses and purposes, and are relatively straightforward to use by data analysts. We present a framework for addressing this challenge. The framework uses an integrated system that includes fully synthetic data intended for wide access, coupled with means for approved users to access the confidential data via secure remote access solutions, glued together by verification servers that allow users to assess the quality of their analyses with the synthetic data. We apply this framework to data on the careers of employees of the U. S. federal government, studying differentials in pay by race. The integrated system performs as intended, allowing users to explore the synthetic data for potential pay differentials and learn through verifications which findings in the synthetic data hold up in the confidential data and which do not. We find differentials across races; for example, the gap between black and white female federal employees' pay increased over the time period. We present models for generating synthetic careers and differentially private algorithms for verification of regression results.
JEL-codes: C51 C53 C55 C81 J15 J45 (search for similar items in EconPapers)
Date: 2017-06
New Economics Papers: this item is included in nep-ict and nep-lma
Note: LE LS PE TWP
References: View references in EconPapers View complete reference list from CitEc
Citations:
Published as Barrientos, Andres F., Alexander Bolton, Tom Balmat, Jerome P. Reiter, John M. de Figueiredo, Ashwin Machanavajjhala, Yan Chen, Charley Kneifel, and Mark DeLong (2018). “A Framework for Sharing Confidential Research Data, Applied to Investigating Differential Pay by Race in the U.S. Government,” Annals of Applied Statistics 12(2): 1124-1156.
Downloads: (external link)
http://www.nber.org/papers/w23534.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nbr:nberwo:23534
Ordering information: This working paper can be ordered from
http://www.nber.org/papers/w23534
Access Statistics for this paper
More papers in NBER Working Papers from National Bureau of Economic Research, Inc National Bureau of Economic Research, 1050 Massachusetts Avenue Cambridge, MA 02138, U.S.A.. Contact information at EDIRC.
Bibliographic data for series maintained by ().