Matching Anonymized Individuals with Errors for Service Systems
Wai Kin (Victor) Chan ()
Additional contact information
Wai Kin (Victor) Chan: Tsinghua-Berkeley Shenzhen Institute, Tsinghua University
A chapter in Smart Service Systems, Operations Management, and Analytics, 2020, pp 161-168 from Springer
Abstract:
Abstract Data privacy is of great importance for the healthy development of service systemsService systems . Companies and governments that provide services to people often have big concerns in sharing their data. Because of that, data must be preprocessed (e.g., anonymized) before they can be shared. However, without identification, it is difficult to match data from different sources and thus the data cannot be used together. This paper investigates how the performance of two simple individual matching methods was affected by errors in the similarity scores between individuals. The first method is a greedy method (GM) that simply matches individuals based on the maximum similarity scores. The second method is an optimal assignment problem (AP), which maximizes the total similarity scores of the matched individuals. Consistent with the literature, we found that GM outperforms AP in most situations. However, we also discovered that AP could be better in fixing errors.
Keywords: Data matching; Data correlation; Service systems (search for similar items in EconPapers)
Date: 2020
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:prbchp:978-3-030-30967-1_15
Ordering information: This item can be ordered from
http://www.springer.com/9783030309671
DOI: 10.1007/978-3-030-30967-1_15
Access Statistics for this chapter
More chapters in Springer Proceedings in Business and Economics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().