Safe reinforcement learning for cooperative tracking consensus problem of discrete-time multiagent systems with control barrier functions
Shihan Liu,
Zhen Yu and
Dongxu Gao
International Journal of Systems Science, 2025, vol. 56, issue 12, 2889-2909
Abstract:
The cooperative tracking consensus problem for discrete-time multiagent systems (MASs) with state and input constraints is addressed by a novel safe reinforcement learning (RL) algorithm. In MASs, all agents communicate through a directed communication graph. The algorithm is developed within the policy iteration (PI) framework, relying solely on measured data from the system trajectories in the environment. Additionally, the reward functions are enhanced by the integration of control barrier functions (CBFs) and input constraint functions. The iterative value function’s monotonically non-increasing feature guarantees that the safe set stays forward invariant inside the PI framework. Furthermore, theoretical proofs are provided to establish both safety and convergence. The algorithm’s design and implementation are elaborated through neural networks. Finally, experiment results validate the efficacy and safety of the proposed algorithm.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1080/00207721.2025.2461016 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:tsysxx:v:56:y:2025:i:12:p:2889-2909
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/TSYS20
DOI: 10.1080/00207721.2025.2461016
Access Statistics for this article
International Journal of Systems Science is currently edited by Visakan Kadirkamanathan
More articles in International Journal of Systems Science from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().