Counter Intuitive Learning: An Exploratory Study
Nobuyuki Hanaki,
Alan Kirman and
Paul Pezanis-Christou
No 6029, CESifo Working Paper Series from CESifo
Abstract:
The literature on learning in unknown environments emphasises reinforcing on actions which produce positive results. But, in some cases, success requires shifting from a currently successful actions to others. We examine, experimentally and theoretically in a very simple framework, how individuals initially learn by exploiting information from the pay-offs of actions taken but also from exploring new actions. We analyse if and how they learn that pay-offs are inter-temporally dependent. We then ran the same experiments but where individuals could observe the actions taken or the pay-offs obtained by others or both. Such observations improved pay-offs if one of the pair had learned to obtain the maximum pay-off.
Keywords: multi-armed bandit; reinforcement learning; eureka moment; pay-off patterns; observational learning (search for similar items in EconPapers)
JEL-codes: D81 D83 (search for similar items in EconPapers)
Date: 2016
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.cesifo.org/DocDL/cesifo1_wp6029.pdf (application/pdf)
Related works:
Working Paper: Counter Intuitive Learning: An Exploratory Study (2016) 
Working Paper: Counter intuitive learning: An exploratory study (2016) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ces:ceswps:_6029
Access Statistics for this paper
More papers in CESifo Working Paper Series from CESifo Contact information at EDIRC.
Bibliographic data for series maintained by Klaus Wohlrabe ().