Testing indexability and computing Whittle and Gittins index in subcubic time
Nicolas Gast (),
Bruno Gaujal () and
Kimang Khun ()
Additional contact information
Nicolas Gast: Univ. Grenoble Alpes
Bruno Gaujal: Univ. Grenoble Alpes
Kimang Khun: Univ. Grenoble Alpes
Mathematical Methods of Operations Research, 2023, vol. 97, issue 3, No 6, 436 pages
Abstract:
Abstract Whittle index is a generalization of Gittins index that provides very efficient allocation rules for restless multi-armed bandits. In this work, we develop an algorithm to test the indexability and compute the Whittle indices of any finite-state restless bandit arm. This algorithm works in the discounted and non-discounted cases, and can compute Gittins index. Our algorithm builds on three tools: (1) a careful characterization of Whittle index that allows one to compute recursively the kth smallest index from the $$(k-1)$$ ( k - 1 ) th smallest and to test indexability, (2) the use of the Sherman–Morrison formula to make this recursive computation efficient, and (3) a sporadic use of the fastest matrix inversion and multiplication methods to obtain a subcubic complexity. We show that an efficient use of the Sherman–Morrison formula leads to an algorithm that computes Whittle index in $$(2/3)n^3 + o(n^3)$$ ( 2 / 3 ) n 3 + o ( n 3 ) arithmetic operations, where n is the number of states of the arm. The careful use of fast matrix multiplication leads to the first subcubic algorithm to compute Whittle or Gittins index: By using the current fastest matrix multiplication, the theoretical complexity of our algorithm is $$O(n^{2.5286})$$ O ( n 2.5286 ) . We also develop an efficient implementation of our algorithm that can compute indices of Markov chains with several thousands of states in less than a few seconds.
Keywords: Whittle index; Gittins index; Restless bandit; Multi-armed bandit; Sherman–Morrison; Markov decision process; Fast matrix multiplication (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://link.springer.com/10.1007/s00186-023-00821-4 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:mathme:v:97:y:2023:i:3:d:10.1007_s00186-023-00821-4
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/00186
DOI: 10.1007/s00186-023-00821-4
Access Statistics for this article
Mathematical Methods of Operations Research is currently edited by Oliver Stein
More articles in Mathematical Methods of Operations Research from Springer, Gesellschaft für Operations Research (GOR), Nederlands Genootschap voor Besliskunde (NGB)
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().