EconPapers    
Economics at your fingertips  
 

Drive More Effective Data-Based Innovations: Enhancing the Utility of Secure Databases

Yi Qian () and Hui Xie ()
Additional contact information
Yi Qian: Department of Marketing and Behavioral Science, Sauder School of Business, University of British Columbia, Vancouver, British Columbia V6T 1Z2, Canada
Hui Xie: Division of Epidemiology and Biostatistics, University of Illinois, Chicago, Illinois 60612

Management Science, 2015, vol. 61, issue 3, 520-541

Abstract: Databases play a central role in evidence-based innovations in business, economics, social, and health sciences. In modern business and society, there are rapidly growing demands for constructing analytically valid databases that also are secure and protect sensitive information to meet customer and public expectations, to minimize financial losses, and to comply with privacy regulations and laws. We propose new data perturbation and shuffling (DPS) procedures, named MORE, for this purpose. As compared with existing DPS methods, MORE can substantially increase the utility of secure databases without increasing disclosure risk. MORE is capable of preserving important nonmonotonic relationships among attributes, such as the inverted-U relationship between competition and innovation. Maintaining such relationships is often the key to determining optimal levels of policy and managerial interventions. MORE does not require data to be of particular types or have particular distributional shapes. Instead, it provides unified, flexible, and robust algorithms to mask general types of confidential variables with arbitrary distributions, thereby making it suitable for general-purpose data masking. Since MORE nests the commonly used generalized linear models as special cases, a much wider range of statistical analyses can be conducted by using the secure databases with results similar to those achieved by using the original databases. Unlike existing DPS approaches that typically require a joint model for all variables, MORE requires no modeling of nonconfidential variables and thus further increases the robustness of secure databases. Evaluation of MORE through Monte Carlo simulation studies and empirical applications demonstrates that it performs better than existing data-masking methods.Data, as supplemental material, are available at http://dx.doi.org/10.1287/mnsc.2014.2026 . This paper was accepted by Sandra Slaughter, information systems .

Keywords: database; digital economy; innovation; nonparametric; perturbation; privacy; shuffling (search for similar items in EconPapers)
Date: 2015
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://dx.doi.org/10.1287/mnsc.2014.2026 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:inm:ormnsc:v:61:y:2015:i:3:p:520-541

Access Statistics for this article

More articles in Management Science from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().

 
Page updated 2025-03-19
Handle: RePEc:inm:ormnsc:v:61:y:2015:i:3:p:520-541