Enriching a Large-Scale Survey from a Representative Sample by Data Fusion: Models and Validation
Tomàs Aluja-Banet (),
Josep Daunis-i-Estadella () and
Yan Hong Chen ()
Additional contact information
Tomàs Aluja-Banet: Universitat Politècnica de Catalunya—Barcelona Tech
Josep Daunis-i-Estadella: Universitat de Girona, Campus de Montilivi
Yan Hong Chen: Institut d’Estadística de Catalunya
A chapter in Survey Data Collection and Integration, 2013, pp 121-137 from Springer
Abstract:
Abstract Data Fusion is a series of operations which takes advantage of collected information. Here we present a complete, real practice of Data Fusion, focussing on all the necessary operational steps carried out. These steps define the actual key points of such a procedure: selection of the hinge variables, grafting donors and recipients, choosing the imputation model and assessing the quality of the imputed data. We present a standard methodology for calibrating the convenience of the chosen imputation model. To that end we use a validation suite of seven statistics that measure different facets of the quality of the imputed data: comparing the marginal global statistics, assessing the truthfulness of imputed values and evaluating the goodness of fit of the imputed data. To measure the adequacy of the recipient individuals in respect to the donor set, we compute the significance of the validation statistics by bootstrapping under the assumption that recipients are a random sample of the donor population. To illustrate the proposed approach, we perform a real data fusion operation on the victimization of citizens, where the collected imputation of opinion on perceived safety is used to enrich a large scale survey on citizen victimization.
Keywords: Specific Variable; Data Fusion; Validation Statistic; Impute Data; Entrepreneurial Purpose (search for similar items in EconPapers)
Date: 2013
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-642-21308-3_8
Ordering information: This item can be ordered from
http://www.springer.com/9783642213083
DOI: 10.1007/978-3-642-21308-3_8
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().