Reinforcement Learning with Guarantees

Zanon, Mario; Gros, Sébastien

Reinforcement Learning with Guarantees

Mario Zanon () and Sébastien Gros ()
Additional contact information
Mario Zanon: IMT School for Advanced Studies Lucca
Sébastien Gros: Norwegian University of Science and Technology (NTNU)

Chapter Chapter 8 in Model Predictive Control, 2025, pp 191-224 from Springer

Abstract: Abstract Markov Decision Processes formalize many problems of interest and have been tackled using a variety of techniques, including Reinforcement Learning (RL) and Model Predictive Control (MPC). While each approach has both advantages and disadvantages, RL and MPC have been very successful in the respective domains. RL makes it possible to obtain optimality for the real system, without the need for a model. MPC requires a model, but makes it possible to provide strict stability and safety guarantees, as well as to promote explainability. In this regard, the two techniques are complementary, and this chapter focuses on how they can be combined in order to leverage the advantages of both.

Keywords: Reinforcement learning; Model predictive control; Markov decision process; Optimality (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:dymchp:978-3-031-85256-5_8

Ordering information: This item can be ordered from
http://www.springer.com/9783031852565

DOI: 10.1007/978-3-031-85256-5_8

Access Statistics for this chapter

More chapters in Dynamic Modeling and Econometrics in Economics and Finance from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().