EconPapers    
Economics at your fingertips  
 

Quadratic Tracking Control of Linear Stochastic Systems with Unknown Dynamics Using Average Off-Policy Q-Learning Method

Longyan Hao, Chaoli Wang () and Yibo Shi
Additional contact information
Longyan Hao: Department of Control Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China
Chaoli Wang: Department of Control Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China
Yibo Shi: Department of Control Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China

Mathematics, 2024, vol. 12, issue 10, 1-21

Abstract: This article investigates the optimal tracking control problem for data-based stochastic discrete-time linear systems. An average off-policy Q-learning algorithm is proposed to solve the optimal control problem with random disturbances. Compared with the existing off-policy reinforcement learning (RL) algorithm, the proposed average off-policy Q-learning algorithm avoids the assumption of an initial stability control. First, a pole placement strategy is used to design an initial stable control for systems with unknown dynamics. Second, the initial stable control is used to design a data-based average off-policy Q-learning algorithm. Then, this algorithm is used to solve the stochastic linear quadratic tracking (LQT) problem, and a convergence proof of the algorithm is provided. Finally, numerical examples show that this algorithm outperforms other algorithms in a simulation.

Keywords: average off-policy Q-learning; linear quadratic tracking; data based control; reinforcement learning; stochastic linear system (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/12/10/1533/pdf (application/pdf)
https://www.mdpi.com/2227-7390/12/10/1533/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:12:y:2024:i:10:p:1533-:d:1394587

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:12:y:2024:i:10:p:1533-:d:1394587