EconPapers    
Economics at your fingertips  
 

A Data Disclosure Policy for Count Data Based on the COM-Poisson Distribution

Joseph B. Kadane (), Ramayya Krishnan () and Galit Shmueli ()
Additional contact information
Joseph B. Kadane: Department of Statistics, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213
Ramayya Krishnan: The Heinz School of Public Policy and Management, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213
Galit Shmueli: Department of Decision and Information Technologies, Smith School of Business, University of Maryland, College Park, Maryland 20742

Management Science, 2006, vol. 52, issue 10, 1610-1617

Abstract: Count data arise in various organizational settings. When the release of such data is sensitive, organizations need information-disclosure policies that protect data confidentiality while still providing data access. In contrast to extant disclosure policies, we describe a new policy for count tables that is based on disclosing only the sufficient statistics of a flexible discrete distribution. This distribution, the COM-Poisson, well approximates Poisson counts but also under- and over-dispersed counts. The sufficient statistics mask the exact cell counts and often also the table size. Under the scenario of a data holding agency and a data snooper, we show that this policy has low disclosure risk with no loss of data utility: Usually, many count tables correspond to the disclosed sufficient statistics. Furthermore, these count tables are equally likely to be the undisclosed table. Finding these solutions requires solving a system of linear equations, which are underdetermined for tables with more than three cells, and can be computationally prohibitive for even small tables. We also consider cell-specific interval bounds, a commonly used disclosure limitation policy, and compare them to our policy. We describe several types of snooper knowledge, their integration with the disclosed statistics, and implications. Applying this policy to three real data sets, we illustrate the low associated disclosure risk.

Keywords: sufficient statistics; Conway-Maxwell-Poisson; disclosure risk; data snooper (search for similar items in EconPapers)
Date: 2006
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (5)

Downloads: (external link)
http://dx.doi.org/10.1287/mnsc.1060.0562 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:inm:ormnsc:v:52:y:2006:i:10:p:1610-1617

Access Statistics for this article

More articles in Management Science from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().

 
Page updated 2025-03-19
Handle: RePEc:inm:ormnsc:v:52:y:2006:i:10:p:1610-1617