EconPapers    
Economics at your fingertips  
 

Improving the Representativeness of a Simple Random Sample: An Optimization Model and Its Application to the Continuous Sample of Working Lives

Vicente Núñez-Antón, Juan Manuel Pérez-Salamero González, Marta Regúlez-Castillo and Carlos Vidal-Melia
Additional contact information
Juan Manuel Pérez-Salamero González: Department of Financial Economics and Actuarial Science, Faculty of Economics, University of Valencia, 46022 Valencia, Spain
Marta Regúlez-Castillo: Department of Econometrics and Statistics (A.E. III), Faculty of Economics and Business, University of the Basque Country UPV/EHU, 48015 Bilbao, Spain

Mathematics, 2020, vol. 8, issue 8, 1-27

Abstract: This paper proposes an optimization model for selecting a larger subsample that improves the representativeness of a simple random sample previously obtained from a population larger than the population of interest. The problem formulation involves convex mixed-integer nonlinear programming (convex MINLP) and is, therefore, NP-hard. However, the solution is found by maximizing the size of the subsample taken from a stratified random sample with proportional allocation and restricting it to a p -value large enough to achieve a good fit to the population of interest using Pearson’s chi-square goodness-of-fit test. The paper also applies the model to the Continuous Sample of Working Lives (CSWL), which is a set of anonymized microdata containing information on individuals from Spanish Social Security records and the results prove that it is possible to obtain a larger subsample from the CSWL that (far) better represents the pensioner population for each of the waves analyzed.

Keywords: chi-square test; continuous sample of working lives; optimization; p -value; subsampling (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.mdpi.com/2227-7390/8/8/1225/pdf (application/pdf)
https://www.mdpi.com/2227-7390/8/8/1225/ (text/html)

Related works:
Working Paper: Improving the representativeness of a simple random sample: an optimization model and its application to the Continuous Sample of Working Lives (2019) Downloads
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:8:y:2020:i:8:p:1225-:d:389925

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-27
Handle: RePEc:gam:jmathe:v:8:y:2020:i:8:p:1225-:d:389925