LEARNING IN RANDOM UTILITY MODELS VIA ONLINE DECISION PROBLEMS
Emerson Melo ()
Additional contact information
Emerson Melo: Indiana University, Bloomington
CAEPR Working Papers from Center for Applied Economics and Policy Research, Department of Economics, Indiana University Bloomington
Abstract:
This paper studies the Random Utility Model (RUM) in environments where the decision maker is imperfectly informed about the payoffs associated to each of the alternatives he faces. By embedding the RUM into an online decision problem, we make four contributions. First, we propose a gradient-based learning algorithm and show that a large class of RUMs are Hannan consistent (Hannan [1957]); that is, the average difference between the expected payoffs generated by a RUM and that of the best ?xed policy in hindsight goes to zero as the number of periods increase. Second, we show that the class of Generalized Extreme Value (GEV) models can be implemented with our learning algorithm. Examples in the GEV class include the Nested Logit, Ordered, and Product Differentiation models among many others. Third, we show that our gradient-based algorithm is the dual, in a convex analysis sense, of the Follow the Regularized Leader (FTRL) algorithm, which is widely used in the Machine Learning literature. Finally, we discuss how our approach can incorporate recency bias and be used to implement prediction markets in general environments.javascript:void(0);
Keywords: Random utility models; Multinomial Logit Model; Generalized Nested Logit models; GEV class; Online optimization; Online learning; Hannan consistency; no-regret learning (search for similar items in EconPapers)
Pages: 51 pages
Date: 2021-08
New Economics Papers: this item is included in nep-big, nep-dcm and nep-upt
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
https://caepr.indiana.edu/RePEc/inu/caeprp/caepr2022-003.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:inu:caeprp:2022003
Access Statistics for this paper
More papers in CAEPR Working Papers from Center for Applied Economics and Policy Research, Department of Economics, Indiana University Bloomington Contact information at EDIRC.
Bibliographic data for series maintained by Center for Applied Economics and Policy Research ().