EconPapers    
Economics at your fingertips  
 

Counter Intuitive Learning: An Exploratory Study

Nobuyuki Hanaki, Alan Kirman and Paul Pezanis-Christou

No 6029, CESifo Working Paper Series from CESifo

Abstract: The literature on learning in unknown environments emphasises reinforcing on actions which produce positive results. But, in some cases, success requires shifting from a currently successful actions to others. We examine, experimentally and theoretically in a very simple framework, how individuals initially learn by exploiting information from the pay-offs of actions taken but also from exploring new actions. We analyse if and how they learn that pay-offs are inter-temporally dependent. We then ran the same experiments but where individuals could observe the actions taken or the pay-offs obtained by others or both. Such observations improved pay-offs if one of the pair had learned to obtain the maximum pay-off.

Keywords: multi-armed bandit; reinforcement learning; eureka moment; pay-off patterns; observational learning (search for similar items in EconPapers)
JEL-codes: D81 D83 (search for similar items in EconPapers)
Date: 2016
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.cesifo.org/DocDL/cesifo1_wp6029.pdf (application/pdf)

Related works:
Working Paper: Counter Intuitive Learning: An Exploratory Study (2016) Downloads
Working Paper: Counter intuitive learning: An exploratory study (2016) Downloads
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:ces:ceswps:_6029

Access Statistics for this paper

More papers in CESifo Working Paper Series from CESifo Contact information at EDIRC.
Bibliographic data for series maintained by Klaus Wohlrabe ().

 
Page updated 2025-03-30
Handle: RePEc:ces:ceswps:_6029