EconPapers    
Economics at your fingertips  
 

Optimizing the Accuracy of Entity-Based Data Integration of Multiple Data Sources Using Genetic Programming Methods

Yinle Zhou, Ali Kooshesh and John Talburt
Additional contact information
Yinle Zhou: University of Arkansas at Little Rock, USA
Ali Kooshesh: Sonoma State University, USA
John Talburt: University of Arkansas at Little Rock, USA

International Journal of Business Intelligence Research (IJBIR), 2012, vol. 3, issue 1, 72-82

Abstract: Entity-based data integration (EBDI) is a form of data integration in which information related to the same real-world entity is collected and merged from different sources. It often happens that not all of the sources will agree on one value for a common attribute. These cases are typically resolved by invoking a rule that will select one of the non-null values presented by the sources. One of the most commonly used selection rules is called the naïve selection operator that chooses the non-null value provided by the source with the highest overall accuracy for the attribute in question. However, the naïve selection operator will not always produce the most accurate result. This paper describes a method for automatically generating a selection operator using methods from genetic programming. It also presents the results from a series of experiments using synthetic data that indicate that this method will yield a more accurate selection operator than either the naïve or naïve-voting selection operators.

Date: 2012
References: Add references at CitEc
Citations:

Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 4018/jbir.2012010105 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:igg:jbir00:v:3:y:2012:i:1:p:72-82

Access Statistics for this article

International Journal of Business Intelligence Research (IJBIR) is currently edited by Ana Azevedo

More articles in International Journal of Business Intelligence Research (IJBIR) from IGI Global
Bibliographic data for series maintained by Journal Editor ().

 
Page updated 2025-03-19
Handle: RePEc:igg:jbir00:v:3:y:2012:i:1:p:72-82