EconPapers    
Economics at your fingertips  
 

File concatenation of survey data: a computer intensive approach to sampling weights estimation

Marco Ballin, Marco Di Zio, Marcello D'Orazio, Mauro Scanu and Nicola Torelli ()
Additional contact information
Nicola Torelli: Italian National Institute of Statistics

Rivista di statistica ufficiale, 2008, vol. 10, issue 2, 5-12

Abstract: File concatenation is an approach that can be used to integrate two (or more) sources of data which refer to the same target population. It consists in considering the concatenation of the two files as a unique data set. Although this approach seems to be natural in an integration procedure, it is not generally adopted, especially when data are obtained by means of different complex sampling designs. In fact, it requires the computation of the sampling weights for the concatenated data set and this can be often a very hard task. To this aim, some simplifying assumptions are adopted: for instance, when large population and simple survey designs are concerned, it can be reasonable to assume that the chance that a unit is included in both the samples is negligible. This assumption can be questioned when different and complex survey designs are adopted. This is, for instance, the case of enterprise surveys where survey designs with probability proportional to size selection are often considered and the probability of including in both the samples the same units is far from being negligible. In this paper we propose a method to deal with the problem of computing sampling weights of a concatenated sample in a general sampling design context. The method is a computer intensive approach and its applicability to real cases is shown by computing the weights of a data set obtained by concatenating two agricultural Istat surveys: the Farm Structural Survey and the Farm Economic Accounts Survey.

Keywords: data fusion; data integration; missing data (search for similar items in EconPapers)
JEL-codes: C83 (search for similar items in EconPapers)
Date: 2008
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://www.istat.it/it/files/2011/05/2_3_20081.pdf (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:isa:journl:v:10:y:2008:i:2:p:5-12

Access Statistics for this article

More articles in Rivista di statistica ufficiale from ISTAT - Italian National Institute of Statistics - (Rome, ITALY) Contact information at EDIRC.
Bibliographic data for series maintained by Stefania Rossetti ().

 
Page updated 2025-03-19
Handle: RePEc:isa:journl:v:10:y:2008:i:2:p:5-12