Differentially private histogram with valid statistics
Zilong Cao,
Shisong Wu,
Xuanang Li and
Hai Zhang
Statistics & Probability Letters, 2025, vol. 219, issue C
Abstract:
Differentially private histograms (DP-Histograms) are integral to data publication and privacy preservation efforts. However, conventional DP-Histograms often fail to preserve valid statistical information and the essential characteristics of the original data. This paper shows the invalidity of variance is the inherent shortcomings in general DP-Histograms, and introduces a novel algorithm called the Differentially Private Histogram with Valid Statistics (VSDPH) to overcome the problem. The VSDPH, grounded in linear programming and bounded Lipschitz distance, efficiently generates DP histograms while preserving the valid statistics of the original data. Our theoretical analysis demonstrates that histograms produced by VSDPH maintain asymptotically valid variance, and we establish an upper bound based on the 1-Wasserstein distance. Through experiments, we validate that VSDPH can accurately hold the statistical characteristics of the original data. This capability brings the resulting histograms closer to the originals.
Keywords: Differential privacy; Histogram; Wasserstein distance; Statistics; Linear programming (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0167715224003237
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:stapro:v:219:y:2025:i:c:s0167715224003237
Ordering information: This journal article can be ordered from
http://www.elsevier.com/wps/find/supportfaq.cws_home/regional
https://shop.elsevie ... _01_ooc_1&version=01
DOI: 10.1016/j.spl.2024.110354
Access Statistics for this article
Statistics & Probability Letters is currently edited by Somnath Datta and Hira L. Koul
More articles in Statistics & Probability Letters from Elsevier
Bibliographic data for series maintained by Catherine Liu ().