Learn Without Counterfactuals
Friederike Mengel and
Javier Rivas
No 10/15, Discussion Papers in Economics from Division of Economics, School of Business, University of Leicester
Abstract:
In this paper we study learning procedures when counterfactuals (payo s of not-chosen actions) are not observed. The decision maker reasons in two steps: First, she updates her propensities for each action after every payo experience, where propensity is de ned as how much she prefers each action. Then, she transforms these propensities into choice probabilities. We introduce natural axioms in the way propensities are updated and the way propensities are translated into choice, and study the decision marker's behavior when such axioms are in place.
Keywords: Adaptive Learning; Counterfactuals; Partial Information; Reinforcement Learning (search for similar items in EconPapers)
JEL-codes: D83 (search for similar items in EconPapers)
Date: 2010-02
New Economics Papers: this item is included in nep-cbe and nep-dcm
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.le.ac.uk/economics/research/RePEc/lec/leecon/dp10-15.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:lec:leecon:10/15
Ordering information: This working paper can be ordered from
https://www2.le.ac.u ... -1/discussion-papers
Access Statistics for this paper
More papers in Discussion Papers in Economics from Division of Economics, School of Business, University of Leicester School of Business, University of Leicester, University Road. Leicester. LE1 7RH. UK Provider-Homepage: https://le.ac.uk/school-of-business. Contact information at EDIRC.
Bibliographic data for series maintained by Abbie Sleath ().