Ten propositions on machine learning in official statistics
Arnout Delden (),
Joep Burger () and
Marco Puts ()
Additional contact information
Arnout Delden: Statistics Netherlands
Joep Burger: Statistics Netherlands
Marco Puts: Statistics Netherlands
AStA Wirtschafts- und Sozialstatistisches Archiv, 2023, vol. 17, issue 3, No 2, 195-221
Abstract:
Abstract Machine learning (ML) is increasingly being used in official statistics with a range of different applications. The main focus of ML models is to accurately predict attributes of new, unlabeled cases whereas the focus of classical statistical models is to describe the relations between independent and dependent variables. There is already a lot of experience in the sound use of classical statistical models in official statistics, but for ML models this is still under development. Recent discussions concerning the quality aspects of using ML in official statistics have concentrated on its implications for existing quality frameworks. We are in favor of the use of ML in official statistics, but the main question remains as to what factors need to be considered when using ML models in official statistics. As a means of raising awareness regarding these factors, we pose ten propositions regarding the (sensible) use of ML in official statistics.
Keywords: Performance measures; Causation; Explainability; Mass imputation; Train test split (search for similar items in EconPapers)
Date: 2023
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s11943-023-00330-0 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:astaws:v:17:y:2023:i:3:d:10.1007_s11943-023-00330-0
Ordering information: This journal article can be ordered from
http://www.springer. ... ce/journal/11943/PS2
DOI: 10.1007/s11943-023-00330-0
Access Statistics for this article
AStA Wirtschafts- und Sozialstatistisches Archiv is currently edited by Ralf Münnich
More articles in AStA Wirtschafts- und Sozialstatistisches Archiv from Springer, Deutsche Statistische Gesellschaft - German Statistical Society
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().