An Axiomatization of Learning Rules when Counterfactuals are not Observed
Friederike Mengel and
Javier Rivas
The B.E. Journal of Theoretical Economics, 2012, vol. 12, issue 1, 19
Abstract:
In this paper we study learning procedures when counterfactuals (payoffs of not chosen actions) are not observed. The decision maker reasons in two steps: First, she updates her propensities for choosing each action after every payoff experience, where propensities can be interpreted as preferences. Then, she transforms these propensities into choice probabilities. We introduce a set of axioms on how propensities are updated and on how these propensities are translated into choices and study the decision marker's behavior when such axioms are in place. Our characterization includes the linear reinforcement learning rule from Roth and Erev (1995).
Keywords: learning without counterfactuals; partial information; reinforcement learning (search for similar items in EconPapers)
Date: 2012
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
https://doi.org/10.1515/1935-1704.1828 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bpj:bejtec:v:12:y:2012:i:1:n:25
Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/bejte/html
DOI: 10.1515/1935-1704.1828
Access Statistics for this article
The B.E. Journal of Theoretical Economics is currently edited by Burkhard C. Schipper
More articles in The B.E. Journal of Theoretical Economics from De Gruyter
Bibliographic data for series maintained by Peter Golla ().