Technical Note—On the Strength of Relaxations of Weakly Coupled Stochastic Dynamic Programs
David B. Brown () and
Jingwei Zhang ()
Additional contact information
David B. Brown: Fuqua School of Business, Duke University, Durham, North Carolina 27708
Jingwei Zhang: Fuqua School of Business, Duke University, Durham, North Carolina 27708
Operations Research, 2023, vol. 71, issue 6, 2374-2389
Abstract:
Many stochastic dynamic programs (DPs) have a weakly coupled structure in that a set of linking constraints in each period couples an otherwise independent collection of subproblems. Two widely studied approximations of such problems are approximate linear programs (ALPs), which involve optimizing value function approximations that additively separate across subproblems, and Lagrangian relaxations, which involve relaxing the linking constraints. It is well known that both of these approximations provide upper bounds on the optimal value function in all states and that the ALP provides a tighter upper bound in the initial state. The purpose of this short paper is to provide theoretical justification for the fact that these upper bounds are often close if not identical. We show that (i) for any weakly coupled DP, the difference between these two upper bounds—the relaxation gap —is bounded from above in terms of the integrality gap of the separation problems associated with the ALP. (ii) If subproblem rewards are uniformly bounded and some broadly applicable conditions on the linking constraints hold, the relaxation gap is bounded from above by a constant that is independent of the number of subproblems. (iii) When the linking constraints are independent of subproblem states and have a unimodular structure, the relaxation gap equals zero. The conditions for (iii) hold in several widely studied problems: generalizations of restless bandit problems, online stochastic matching problems, network revenue management problems, and price-directed control of relocating resources. These findings generalize and unify existing results.
Keywords: Decision Analysis; weakly coupled stochastic dynamic programs; Lagrangian relaxations; approximate dynamic programming; restless bandit problems; network revenue management (search for similar items in EconPapers)
Date: 2023
References: Add references at CitEc
Citations:
Downloads: (external link)
http://dx.doi.org/10.1287/opre.2022.2287 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:inm:oropre:v:71:y:2023:i:6:p:2374-2389
Access Statistics for this article
More articles in Operations Research from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().