Is your dataset big enough? Sample size requirements when using artificial neural networks for discrete choice analysis
Ahmad Alwosheel,
Sander van Cranenburgh and
Caspar Chorus
Journal of choice modelling, 2018, vol. 28, issue C, 167-182
Abstract:
Artificial Neural Networks (ANNs) are increasingly used for discrete choice analysis. But, at present, it is unknown what sample size requirements are appropriate when using ANNs in this particular context. This paper fills this knowledge gap: we empirically establish a rule-of-thumb for ANN-based discrete choice analysis based on analyses of synthetic and real data. To investigate the effect of complexity of the data generating process on the minimum required sample size, we conduct extensive Monte Carlo analyses using a series of different model specifications with different levels of model complexity, including RUM and RRM models, with and without random taste parameters. Based on our analyses we advise to use a minimum sample size of fifty times the number of weights in the ANN; it should be noted, that the number of weights is generally much larger than the number of parameters in a discrete choice model. This rule-of-thumb is considerably more conservative than the rule-of-thumb that is most often used in the ANN community, which advises to use at least ten times the number of weights.
Date: 2018
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (27)
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S1755534518300058
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:eejocm:v:28:y:2018:i:c:p:167-182
DOI: 10.1016/j.jocm.2018.07.002
Access Statistics for this article
Journal of choice modelling is currently edited by S. Hess and J.M. Rose
More articles in Journal of choice modelling from Elsevier
Bibliographic data for series maintained by Catherine Liu ().