Potential-Based Least-Squares Policy Iteration for a Parameterized Feedback Control System
Kang Cheng (),
Kanjian Zhang (),
Shumin Fei () and
Haikun Wei ()
Additional contact information
Kang Cheng: Southeast University
Kanjian Zhang: Southeast University
Shumin Fei: Southeast University
Haikun Wei: Southeast University
Journal of Optimization Theory and Applications, 2016, vol. 169, issue 2, No 17, 692-704
Abstract:
Abstract In the paper, a potential-based policy iteration method is proposed for optimal control of a stochastic dynamic system with an average cost criterion and a parameterized control law. In this method, the potential function and the optimal control parameters are obtained via a least-squares-based approach. The potential estimation algorithm is derived from a temporal difference learning method, which can be viewed as a continuous version of the least-squares policy evaluation algorithm. The policy iteration algorithm is validated by solving a linear quadratic gaussian problem in the simulation.
Keywords: Stochastic system; Markov decision processes; Performance potential; Least-squares policy evaluation; Policy iteration; 49K45; 93E20; 93C55 (search for similar items in EconPapers)
Date: 2016
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s10957-015-0809-6 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:joptap:v:169:y:2016:i:2:d:10.1007_s10957-015-0809-6
Ordering information: This journal article can be ordered from
http://www.springer. ... cs/journal/10957/PS2
DOI: 10.1007/s10957-015-0809-6
Access Statistics for this article
Journal of Optimization Theory and Applications is currently edited by Franco Giannessi and David G. Hull
More articles in Journal of Optimization Theory and Applications from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().