Combining the strengths of Dutch survey and register data in a data challenge to predict fertility (PreFer)
Elizaveta Sivak (),
Paulina Pankowska,
Adriënne Mendrik,
Tom Emery,
Javier Garcia-Bernardo,
Seyit Höcük,
Kasia Karpinska,
Angelica Maineri,
Joris Mulder,
Malvina Nissim and
Gert Stulp
Additional contact information
Elizaveta Sivak: University of Groningen
Paulina Pankowska: Utrecht University
Adriënne Mendrik: Eyra
Tom Emery: Erasmus University Rotterdam
Javier Garcia-Bernardo: Utrecht University
Seyit Höcük: Centerdata, Tilburg University
Kasia Karpinska: Erasmus University Rotterdam
Angelica Maineri: Erasmus University Rotterdam
Joris Mulder: Centerdata, Tilburg University
Malvina Nissim: University of Groningen
Gert Stulp: University of Groningen
Journal of Computational Social Science, 2024, vol. 7, issue 2, No 10, 1403-1431
Abstract:
Abstract The social sciences have produced an impressive body of research on determinants of fertility outcomes, or whether and when people have children. However, the strength of these determinants and underlying theories are rarely evaluated on their predictive ability on new data. This prevents us from systematically comparing studies, hindering the evaluation and accumulation of knowledge. In this paper, we present two datasets which can be used to study the predictability of fertility outcomes in the Netherlands. One dataset is based on the LISS panel, a longitudinal survey which includes thousands of variables on a wide range of topics, including individual preferences and values. The other is based on the Dutch register data which lacks attitudinal data but includes detailed information about the life courses of millions of Dutch residents. We provide information about the datasets and the samples, and describe the fertility outcome of interest. We also introduce the fertility prediction data challenge PreFer which is based on these datasets and will start in Spring 2024. We outline the ways in which measuring the predictability of fertility outcomes using these datasets and combining their strengths in the data challenge can advance our understanding of fertility behaviour and computational social science. We further provide details for participants on how to take part in the data challenge.
Keywords: Fertility; Data challenge; Benchmark; Out-of-sample prediction; Survey data; Register data (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s42001-024-00275-6 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:jcsosc:v:7:y:2024:i:2:d:10.1007_s42001-024-00275-6
Ordering information: This journal article can be ordered from
http://www.springer. ... iences/journal/42001
DOI: 10.1007/s42001-024-00275-6
Access Statistics for this article
Journal of Computational Social Science is currently edited by Takashi Kamihigashi
More articles in Journal of Computational Social Science from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().