A data-driven α-policy iteration algorithm for optimal leader-following consensus of discrete-time multi-agent systems
Aoxue Xiang,
Xinyuan Zhao and
Ruicheng Ma
International Journal of Systems Science, 2025, vol. 56, issue 16, 4055-4072
Abstract:
In this paper, the data-driven α-policy iteration (PI) algorithm is proposed to address the optimal leader-following consensus problem of discrete-time multi-agent systems (MASs). Unlike existing results for state consensus problem that utilise the PI algorithm, the novel algorithm leverages only the system's trajectory from historical data over a finite number of steps and and does not require an admissible initial policy. Firstly, the linear quadratic regulator (LQR) design method is applied to derive the Bellman equation and the control policy based on the available measured data. Then, the data-driven α-PI algorithm is introduced, demonstrating a convergence rate that outperforms the value iteration (VI) algorithm and enabling all follower agents to track the trajectory of the leader agent. Finally, two examples are presented to demonstrate the performance of the proposed method.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1080/00207721.2025.2482006 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:tsysxx:v:56:y:2025:i:16:p:4055-4072
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/TSYS20
DOI: 10.1080/00207721.2025.2482006
Access Statistics for this article
International Journal of Systems Science is currently edited by Visakan Kadirkamanathan
More articles in International Journal of Systems Science from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().