Counter intuitive learning: An exploratory study
Nobuyuki Hanaki,
Alan Kirman and
Paul Pezanis-Christou
Working Papers from HAL
Abstract:
The literature on learning in unknown environments emphasises reinforcing on actions which produce positive results. But, in some cases, success requires shifting from a currently successful actions to others. We examine, experimentally and theoretically in a very simple framework, how individuals initially learn by exploiting information from the pay-offs of actions taken but also from exploring new actions. We analyse if and how they learn that pay-offs are inter-temporally dependent. We then ran the same experiments but where individuals could observe the actions taken or the pay-offs obtained by others or both. Such observations improved pay-offs if one of the pair had learned to obtain the maximum pay-off.
Keywords: multi-armed bandit; reinforcement learning; eureka moment; pay-off patterns; observational learning (search for similar items in EconPapers)
Date: 2016-09-01
Note: View the original document on HAL open archive server: https://hal.science/hal-01358716
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://hal.science/hal-01358716/document (application/pdf)
Related works:
Working Paper: Counter Intuitive Learning: An Exploratory Study (2016) 
Working Paper: Counter Intuitive Learning: An Exploratory Study (2016) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hal:wpaper:hal-01358716
Access Statistics for this paper
More papers in Working Papers from HAL
Bibliographic data for series maintained by CCSD ().