EconPapers    
Economics at your fingertips  
 

A Network Analysis Model for Disambiguation of Names in Lists

Bradley Malin (), Edoardo Airoldi () and Kathleen M. Carley ()
Additional contact information
Bradley Malin: Carnegie Mellon University
Edoardo Airoldi: Carnegie Mellon University
Kathleen M. Carley: School of Computer Science, Carnegie Mellon University

Computational and Mathematical Organization Theory, 2005, vol. 11, issue 2, No 3, 119-139

Abstract: Abstract In research and application, social networks are increasingly extracted from relationships inferred by name collocations in text-based documents. Despite the fact that names represent real entities, names are not unique identifiers and it is often unclear when two name observations correspond to the same underlying entity. One confounder stems from ambiguity, in which the same name correctly references multiple entities. Prior name disambiguation methods measured similarity between two names as a function of their respective documents. In this paper, we propose an alternative similarity metric based on the probability of walking from one ambiguous name to another in a random walk of the social network constructed from all documents. We experimentally validate our model on actor-actor relationships derived from the Internet Movie Database. Using a global similarity threshold, we demonstrate random walks achieve a significant increase in disambiguation capability in comparison to prior models.

Keywords: disambiguation; social networks; link analysis; random walks; clustering (search for similar items in EconPapers)
Date: 2005
References: View complete reference list from CitEc
Citations: View citations in EconPapers (2)

Downloads: (external link)
http://link.springer.com/10.1007/s10588-005-3940-3 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:comaot:v:11:y:2005:i:2:d:10.1007_s10588-005-3940-3

Ordering information: This journal article can be ordered from
http://www.springer.com/journal/10588

DOI: 10.1007/s10588-005-3940-3

Access Statistics for this article

Computational and Mathematical Organization Theory is currently edited by Terrill Frantz and Kathleen Carley

More articles in Computational and Mathematical Organization Theory from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:comaot:v:11:y:2005:i:2:d:10.1007_s10588-005-3940-3