Linkage in medical records and bioinformatics data
Shen Lu and
Richard S. Segall
International Journal of Information and Decision Sciences, 2013, vol. 5, issue 2, 169-187
Abstract:
Multiple records for different visits of patients result in redundant information among multiple data sources. We can increase the amount of information available for population units required by stand-alone and distributed databases by matching and merging duplicate records. In this paper, we provide an algorithm, called entity resolution of the Fellegi-Sunter (ERFS) model. In this paper, we used the Fellegi-Sunter model to improve the results of semantic analysis for identification of similar records. According to our experimental results we find that ERFS yields rates that are higher for about 11.07% of the experiments than those using the Stanford entity resolution framework (SERF). Because we found that for these 11.07% there were 38.1% of the experiments conducted having increases ranging from 12.7% to 21.9%, with mid-range size of the number of records having an average increase of 16.96%, it can be concluded that ERFS should be used to link similar records.
Keywords: Fellegi-Sunter model; expectation maximisation; SERF model; record linkage; medical records; bioinformatics data; patient records; redundant information; semantic analysis; entity resolution. (search for similar items in EconPapers)
Date: 2013
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.inderscience.com/link.php?id=53803 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ids:ijidsc:v:5:y:2013:i:2:p:169-187
Access Statistics for this article
More articles in International Journal of Information and Decision Sciences from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().