EconPapers    
Economics at your fingertips  
 

On the sensitivity of restless bandits solutions to uncertainty in the models of the arms

Amit Sinha () and Aditya Mahajan ()
Additional contact information
Amit Sinha: McGill University, Department of Electrical and Computer Engineering
Aditya Mahajan: McGill University, Department of Electrical and Computer Engineering

Annals of Operations Research, 2025, vol. 355, issue 3, No 7, 2939-2969

Abstract: Abstract Restless multi-armed bandits (RMAB) are a popular framework for modeling resource allocation and scheduling problems arising in various applications. Such applications can be modeled as Markov decision processes (MDP), but optimal or sub-optimal solution through dynamic programming suffer from high complexity. RMAB provides a heuristic solution, where the solution complexity scales linearly with the number of alternatives. However, these heuristic solutions are derived under the assumption that the model of all arms are known perfectly. In this paper, we consider RMAB with uncertainty in the rewards and dynamics of the arms. In such a setting, using a robust MDP solution is not possible due to high computational complexity. So, we consider a certainty equivalence approach and bound the additional loss in performance due to model inaccuracy. Our bounds are directly in terms of the model uncertainty of each arm and we illustrate their use via examples.

Keywords: Restless multi-armed bandits; Model mismatch; Markov decision process; Certainty equivalence; Whittle index; Gittins index (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s10479-025-06821-3 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:annopr:v:355:y:2025:i:3:d:10.1007_s10479-025-06821-3

Ordering information: This journal article can be ordered from
http://www.springer.com/journal/10479

DOI: 10.1007/s10479-025-06821-3

Access Statistics for this article

Annals of Operations Research is currently edited by Endre Boros

More articles in Annals of Operations Research from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-12-20
Handle: RePEc:spr:annopr:v:355:y:2025:i:3:d:10.1007_s10479-025-06821-3