Observational and reinforcement pattern-learning: An exploratory study

Hanaki, Nobuyuki; Kirman, Alan; Pezanis-Christou, Paul

Observational and reinforcement pattern-learning: An exploratory study

Nobuyuki Hanaki, Alan Kirman and Paul Pezanis-Christou

Abstract: Understanding how individuals learn in an unknown environment is an important problem in economics. We model and examine experimentally behavior in a very simple multi-armed bandit framework in which participants do not know the inter-temporal payoff structure. We propose a baseline reinforcement learning model that allows for pattern-recognition and change in the strategy space. We also analyse three augmented versions that accommodate observational learning from the actions and/or payoffs of another player. The models successfully reproduce the distributional properties of observed discovery times and total payoffs. Our study further shows that when one of the pair discovers the hidden pattern, observing another's actions and/or payoffs improves discovery time compared to the baseline case.

Keywords: Multi-armed bandit; Reinforcement learning; Payoff patterns; Observational learning (search for similar items in EconPapers)
Date: 2018-05
New Economics Papers: this item is included in nep-cbe and nep-exp
Note: View the original document on HAL open archive server: https://shs.hal.science/halshs-01723513v1
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (5)

Published in European Economic Review, 2018, 104, pp.1 - 21. ⟨10.1016/j.euroecorev.2018.01.009⟩

Downloads: (external link)
https://shs.hal.science/halshs-01723513v1/document (application/pdf)

Related works:
Journal Article: Observational and reinforcement pattern-learning: An exploratory study (2018)
Working Paper: Observational and Reinforcement Pattern-learning: An Exploratory Study (2017)
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:hal:journl:halshs-01723513

DOI: 10.1016/j.euroecorev.2018.01.009

Access Statistics for this paper

More papers in Post-Print from HAL
Bibliographic data for series maintained by CCSD ().