EconPapers    
Economics at your fingertips  
 

Hiding Sensitive Information when Sharing Distributed Transactional Data

Abhijeet Ghoshal (), Jing Hao (), Syam Menon and Sumit Sarkar ()
Additional contact information
Abhijeet Ghoshal: University of Illinois at Urbana–Champaign, Champaign, Illinois 61820;
Jing Hao: JCPenney, Plano, Texas 75024;
Syam Menon: The University of Texas at Dallas, Richardson, Texas 75083
Sumit Sarkar: The University of Texas at Dallas, Richardson, Texas 75083

Information Systems Research, 2020, vol. 31, issue 2, 473-490

Abstract: Retailers have been sharing transactional data with supply chain partners for a long time to the benefit of all involved. However, many are still reluctant to share, and there is evidence that the extent of sharing would be greater if information sensitive to retailers is concealed before data are shared. Although there has been considerable research into methods to hide sensitive information from transactional data, extant research has focused only on sensitive information at the organizational level. This is rarely the case in reality; the retail industry has recognized and adapted their offerings to region-wide differences in customer tastes for decades, and when stores offer a mix of standardized and customized products, the differences in customer characteristics across regions lead to sensitive information that is region-specific in addition to sensitive information at the organizational level. To date, this version of the problem has been overlooked, and no effective methods exist to solve it; this paper fills that gap. Although some existing approaches can be adapted to this more realistic context, the existence of region-level requirements substantially increases the size of an already difficult (NP-hard) problem to be solved, making such adaptations impractical. Traditional decomposition-based approaches, such as Lagrangian relaxation, are not viable either as they require the repeated solution of NP-hard problems involving millions of variables multiple times. In this paper, we present an ensemble approach that draws intuition from Lagrangian relaxation to maximize the accuracy of a shared transactional dataset. Extensive computational experiments show that this approach not only identifies near-optimal solutions, it can do so even when other approaches fail. We also show that the precision of recommendations made using datasets that have been modified using the ensemble approach is not statistically different from that of recommendations made using the original datasets; this demonstrates that using the ensemble approach to hide sensitive information before sharing transactional data has negligible negative impact.

Keywords: data quality; Lagrangian relaxation; ensemble; itemset hiding; privacy (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (3)

Downloads: (external link)
https://doi.org/10.1287/isre.2019.0898 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:inm:orisre:v:31:y:2020:i:2:p:473-490

Access Statistics for this article

More articles in Information Systems Research from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().

 
Page updated 2025-03-19
Handle: RePEc:inm:orisre:v:31:y:2020:i:2:p:473-490