Improving the Representativeness of a Simple Random Sample: An Optimization Model and Its Application to the Continuous Sample of Working Lives
Vicente Núñez-Antón,
Juan Manuel Pérez-Salamero González,
Marta Regúlez-Castillo and
Carlos Vidal-Melia
Additional contact information
Juan Manuel Pérez-Salamero González: Department of Financial Economics and Actuarial Science, Faculty of Economics, University of Valencia, 46022 Valencia, Spain
Marta Regúlez-Castillo: Department of Econometrics and Statistics (A.E. III), Faculty of Economics and Business, University of the Basque Country UPV/EHU, 48015 Bilbao, Spain
Mathematics, 2020, vol. 8, issue 8, 1-27
Abstract:
This paper proposes an optimization model for selecting a larger subsample that improves the representativeness of a simple random sample previously obtained from a population larger than the population of interest. The problem formulation involves convex mixed-integer nonlinear programming (convex MINLP) and is, therefore, NP-hard. However, the solution is found by maximizing the size of the subsample taken from a stratified random sample with proportional allocation and restricting it to a p -value large enough to achieve a good fit to the population of interest using Pearson’s chi-square goodness-of-fit test. The paper also applies the model to the Continuous Sample of Working Lives (CSWL), which is a set of anonymized microdata containing information on individuals from Spanish Social Security records and the results prove that it is possible to obtain a larger subsample from the CSWL that (far) better represents the pensioner population for each of the waves analyzed.
Keywords: chi-square test; continuous sample of working lives; optimization; p -value; subsampling (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
https://www.mdpi.com/2227-7390/8/8/1225/pdf (application/pdf)
https://www.mdpi.com/2227-7390/8/8/1225/ (text/html)
Related works:
Working Paper: Improving the representativeness of a simple random sample: an optimization model and its application to the Continuous Sample of Working Lives (2019) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:8:y:2020:i:8:p:1225-:d:389925
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().