EconPapers    
Economics at your fingertips  
 

A Differential Privacy Budget Allocation Algorithm Based on Out-of-Bag Estimation in Random Forest

Xin Li, Baodong Qin (), Yiyuan Luo and Dong Zheng
Additional contact information
Xin Li: School of Cyberspace Security, Xi’an University of Posts and Telecommunications, Xi’an 710121, China
Baodong Qin: School of Cyberspace Security, Xi’an University of Posts and Telecommunications, Xi’an 710121, China
Yiyuan Luo: School of Computer Science and Engineering, Huizhou University, Huizhou 516007, China
Dong Zheng: School of Cyberspace Security, Xi’an University of Posts and Telecommunications, Xi’an 710121, China

Mathematics, 2022, vol. 10, issue 22, 1-15

Abstract: The issue of how to improve the usability of data publishing under differential privacy has become one of the top questions in the field of machine learning privacy protection, and the key to solving this problem is to allocate a reasonable privacy protection budget. To solve this problem, we design a privacy budget allocation algorithm based on out-of-bag estimation in random forest. The algorithm firstly calculates the decision tree weights and feature weights by the out-of-bag data under differential privacy protection. Secondly, statistical methods are introduced to classify features into best feature set, pruned feature set, and removable feature set. Then, pruning is performed using the pruned feature set to avoid decision trees over-fitting when constructing an ϵ -differential privacy random forest. Finally, the privacy budget is allocated proportionally based on the decision tree weights and feature weights in the random forest. We conducted experimental comparisons with real data sets from Adult and Mushroom to demonstrate that this algorithm not only protects data security and privacy, but also improves model classification accuracy and data availability.

Keywords: differential privacy; machine learning; privacy protection; random forest; out-of-bag estimation (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2022
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/10/22/4338/pdf (application/pdf)
https://www.mdpi.com/2227-7390/10/22/4338/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:10:y:2022:i:22:p:4338-:d:977472

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:10:y:2022:i:22:p:4338-:d:977472