LEARNING IN RANDOM UTILITY MODELS VIA ONLINE DECISION PROBLEMS

Melo, Emerson

LEARNING IN RANDOM UTILITY MODELS VIA ONLINE DECISION PROBLEMS

Emerson Melo ()
Additional contact information
Emerson Melo: Indiana University, Bloomington

CAEPR Working Papers from Center for Applied Economics and Policy Research, Department of Economics, Indiana University Bloomington

Abstract: This paper studies the Random Utility Model (RUM) in environments where the decision maker is imperfectly informed about the payoffs associated to each of the alternatives he faces. By embedding the RUM into an online decision problem, we make four contributions. First, we propose a gradient-based learning algorithm and show that a large class of RUMs are Hannan consistent (Hannan [1957]); that is, the average difference between the expected payoffs generated by a RUM and that of the best ?xed policy in hindsight goes to zero as the number of periods increase. Second, we show that the class of Generalized Extreme Value (GEV) models can be implemented with our learning algorithm. Examples in the GEV class include the Nested Logit, Ordered, and Product Differentiation models among many others. Third, we show that our gradient-based algorithm is the dual, in a convex analysis sense, of the Follow the Regularized Leader (FTRL) algorithm, which is widely used in the Machine Learning literature. Finally, we discuss how our approach can incorporate recency bias and be used to implement prediction markets in general environments.javascript:void(0);

Keywords: Random utility models; Multinomial Logit Model; Generalized Nested Logit models; GEV class; Online optimization; Online learning; Hannan consistency; no-regret learning (search for similar items in EconPapers)
Pages: 51 pages
Date: 2021-08
New Economics Papers: this item is included in nep-big, nep-dcm and nep-upt
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://caepr.indiana.edu/RePEc/inu/caeprp/caepr2022-003.pdf (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:inu:caeprp:2022003

Access Statistics for this paper

More papers in CAEPR Working Papers from Center for Applied Economics and Policy Research, Department of Economics, Indiana University Bloomington Contact information at EDIRC.
Bibliographic data for series maintained by Center for Applied Economics and Policy Research ().