A Declarative Approach to Entity Resolution
Tanton H. Gibbs
Additional contact information
Tanton H. Gibbs: Acxiom Corporation
Chapter 2 in Data Engineering, 2009, pp 17-38 from Springer
Abstract:
Abstract As companies gather and process more data from disparate sources, they are relying more heavily on entity resolution. Currently, creating an entity resolutionentity resolution system is a very procedural process. Blocking, transitive closure, and matching must all be pieced together whether by an Extract, Transform, and Load (ETL) tool or by a custom program (Galhardas et al. 2000). This is similar to the state of data querying before the advent of the Structured Query Language (SQL). In this chapter, a declarative approach to entity resolution is presented that gives the user the ability to specify what he or she would like resolved while allowing a code generator to determine the best way to resolve it. This chapter does not explore algorithms for blocking, transitive closure, clustering, or matching, but instead refers to papers on those subjects written by other authors (Baxter et al. 2003; Gu and Baxter 2004; Winkler 2000, 2003; Jaro 1989; Bhattacharya and Getoor 2006). Instead a background and defense of entity resolution and declarative languages is presented with a declarative solution and a possible representation.
Keywords: Transitive Closure; Record Linkage; Input Reference; Match Function; Closure Attribute (search for similar items in EconPapers)
Date: 2009
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:isochp:978-1-4419-0176-7_2
Ordering information: This item can be ordered from
http://www.springer.com/9781441901767
DOI: 10.1007/978-1-4419-0176-7_2
Access Statistics for this chapter
More chapters in International Series in Operations Research & Management Science from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().