Probability Matching and Reinforcement Learning*
Javier Rivas
No 11/20, Discussion Papers in Economics from Division of Economics, School of Business, University of Leicester
Abstract:
Probability matching occurs when an action is chosen with a frequency equivalent to the probability of that action being the best choice. This sub-optimal behavior has been reported repeatedly by psychologist and experimental economist. We provide an evolutionary foundation for this phenomenon by showing that learning by reinforcement can lead to probability matching and, if learning occurs suffciently slowly, probability matching does not only occur in choice frequencies but also in choice probabilities. Our results are completed by proving that there exists no quasi-linear reinforcement learning specification such that behavior is optimal for all environments where counterfactuals are observed.
Keywords: Probability Matching; Reinforcement Learning (search for similar items in EconPapers)
JEL-codes: C73 (search for similar items in EconPapers)
Date: 2011-03
New Economics Papers: this item is included in nep-cbe, nep-evo, nep-gth and nep-neu
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.le.ac.uk/economics/research/RePEc/lec/leecon/dp11-20.pdf (application/pdf)
Related works:
Journal Article: Probability matching and reinforcement learning (2013) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:lec:leecon:11/20
Ordering information: This working paper can be ordered from
https://www2.le.ac.u ... -1/discussion-papers
Access Statistics for this paper
More papers in Discussion Papers in Economics from Division of Economics, School of Business, University of Leicester School of Business, University of Leicester, University Road. Leicester. LE1 7RH. UK Provider-Homepage: https://le.ac.uk/school-of-business. Contact information at EDIRC.
Bibliographic data for series maintained by Abbie Sleath ().