Endogenous Learning with Bounded Memory

Kocer, Yilmaz

Endogenous Learning with Bounded Memory

No 1290, Working Papers from Princeton University, Department of Economics, Econometric Research Program.

Abstract: I analyze the effects of memory limitations on the endogenous learning behavior of an agent in a standard two-armed bandit problem. An infinitely lived agent chooses each period between two alternatives with unknown types, to maximize discounted payoffs. The agent can experiment with each alternative and receive payoffs that are partially informative about its type. The agent does not recall past actions or payoffs. Instead, the agent has a finite number of memory states as in Wilson (2004): he can condition his actions only on the memory state he is currently in, and he can update his memory state depending on the payoff received. I find that the inclination to choose the currently better alternative does not con- strain learning in the limit as discounting vanishes. Even though uncertainties are independent, the agent optimally holds correlated beliefs across memory states. Optimally, memory states reflect the magnitude of the relative ranking of alternatives. After a high payoff from one of the alternatives, the agent optimally moves to a memory state with more pessimistic beliefs on the other, even though no information about the latter alternative is received. For the case where one alternative is substantially more informative than the other, he chooses the latter only for myopic exploitation purposes, and ignores any information about it, suggesting specialization in learning. For the special case with one known (safe) alternative, a sufficiently patient agent never ceases experimentation and tries the unknown alternative at least occasionally after any history. Furthermore, he chooses the safe alternative with more optimistic beliefs than the optimal full memory cutoff belief, suggesting under-experimentation. Both are counter to what theory predicts with full memory, but in agreement with experimental findings.

Keywords: endogenous learning behavior; memory limitations; two-armed bandit problem (search for similar items in EconPapers)
JEL-codes: C01 D01 D11 H31 (search for similar items in EconPapers)
Date: 2010-11
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (6)

Downloads: (external link)
https://economics.ucdavis.edu/events/papers/YilmazOct16.pdf
Our link check indicates that this URL is bad, the error code is: 404 Not Found

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:pri:metric:wp001_2011.pdf

Access Statistics for this paper

More papers in Working Papers from Princeton University, Department of Economics, Econometric Research Program. Contact information at EDIRC.
Bibliographic data for series maintained by Bobray Bordelon ().