EconPapers    
Economics at your fingertips  
 

An Evolutionary Schema for Using “it-is-what-it-is” Data in Official Statistics

Lothian Jack (), Holmberg Anders () and Seyb Allyson ()
Additional contact information
Lothian Jack: 360 Hinton Ave S, OttawaON K1Y1A5Canada.
Holmberg Anders: Statistics Norway, Division for Methodology, Akersveien 26 Oslo, Norway.
Seyb Allyson: Stats NZ, Statistical Methods, Private Bag 4741, Christchurch8011, New Zealand.

Journal of Official Statistics, 2019, vol. 35, issue 1, 137-165

Abstract: The linking of disparate data sets across time, space and sources is probably the foremost current issue facing Central Statistical Agencies (CSA). If one reviews the current literature looking for the prevalent challenges facing CSAs, three issues stand out: 1) using administrative data effectively; 2) big data and what it means for CSAs; and 3) integrating disparate data set (such as health, education and wealth) to provide measurable facts that can guide policy makers. CSAs are being challenged to explore the same kind of challenges faced by Google, Facebook, and Yahoo, which are using graphical/semantic web models for organizing, searching and analysing data. Additionally, time and space (geography) are becoming more important dimensions (domains) for CSAs as they start to explore new data sources and ways to integrate those to study relationships. Central agency methodologists are being pushed to include these new perspectives into their standard theories, practises and policies. Like most methodologists, the authors see surveys and the publications of their results as a process where estimation is the key tool to achieve the final goal of an accurate statistical output. Randomness and sampling exists to support this goal, and early on it was clear to us that the incoming “it-is-what-it-is” data sources were not randomly selected. These sources were obviously biased and thus would produce biased estimates. So, we set out to design a strategy to deal with this issue.This article presents a schema for integrating and linking traditional and non-traditional datasets. Like all survey methodologies, this schema addresses the fundamental issues of representativeness, estimation and total survey error measurement.

Keywords: Representativeness; timeline databases; statistical registers; Estimation; administrative data (search for similar items in EconPapers)
Date: 2019
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://doi.org/10.2478/jos-2019-0007 (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:vrs:offsta:v:35:y:2019:i:1:p:137-165:n:7

DOI: 10.2478/jos-2019-0007

Access Statistics for this article

Journal of Official Statistics is currently edited by Annica Isaksson and Ingegerd Jansson

More articles in Journal of Official Statistics from Sciendo
Bibliographic data for series maintained by Peter Golla ().

 
Page updated 2025-03-20
Handle: RePEc:vrs:offsta:v:35:y:2019:i:1:p:137-165:n:7