RELAIS: An Open Source Toolkit for Record Linkage
Nicoletta Cibella,
Marco Fortini,
Monica Scannapieco,
Laura Tosco and
Tiziana Tuoto ()
Additional contact information
Tiziana Tuoto: Istat
Rivista di statistica ufficiale, 2007, vol. 9, issue 2-3, 55-68
Abstract:
The combined use of statistical and administrative sources allow to save time and money, reducing survey costs, response burden, etc.; sometimes data sources are hard to integrate since errors or lacking information in the record identifiers may complicate this process. The purpose of record linkage is to identify the same real world entity, which can be differently represented in data sources. To deal with record linkage complexity and application dependency, we propose a toolkit called RELAIS (REcord Linkage At IStat). The toolkit is based on the idea of choosing the most appropriate technique for each phase, and of dynamically combining such techniques in order to build a workflow, on the basis of application constraints and data features provided as input. RELAIS is configured as an open source project giving the possibility of gathering together the efforts already done in the scientific community towards the definition of a record linkage project. A real case study validates the RELAIS idea.
Keywords: record linkage; open source software (search for similar items in EconPapers)
JEL-codes: C88 (search for similar items in EconPapers)
Date: 2007
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.istat.it/it/files/2011/05/2_3_20071.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:isa:journl:v:9:y:2007:i:2:p:55-68
Access Statistics for this article
More articles in Rivista di statistica ufficiale from ISTAT - Italian National Institute of Statistics - (Rome, ITALY) Contact information at EDIRC.
Bibliographic data for series maintained by Stefania Rossetti ().