Counter intuitive learning: An exploratory study

Hanaki, Nobuyuki; Kirman, Alan; Pezanis-Christou, Paul

Counter intuitive learning: An exploratory study

Nobuyuki Hanaki, Alan Kirman and Paul Pezanis-Christou

Abstract: The literature on learning in unknown environments emphasises reinforcing on actions which produce positive results. But, in some cases, success requires shifting from a currently successful actions to others. We examine, experimentally and theoretically in a very simple framework, how individuals initially learn by exploiting information from the pay-offs of actions taken but also from exploring new actions. We analyse if and how they learn that pay-offs are inter-temporally dependent. We then ran the same experiments but where individuals could observe the actions taken or the pay-offs obtained by others or both. Such observations improved pay-offs if one of the pair had learned to obtain the maximum pay-off.

Keywords: multi-armed bandit; reinforcement learning; eureka moment; pay-off patterns; observational learning (search for similar items in EconPapers)
Date: 2016-09-01
Note: View the original document on HAL open archive server: https://hal.science/hal-01358716v1
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://hal.science/hal-01358716v1/document (application/pdf)

Related works:
Working Paper: Counter Intuitive Learning: An Exploratory Study (2016)
Working Paper: Counter Intuitive Learning: An Exploratory Study (2016)
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:hal:wpaper:hal-01358716

Access Statistics for this paper

More papers in Working Papers from HAL
Bibliographic data for series maintained by CCSD ().