EconPapers    
Economics at your fingertips  
 

Min–Max Dynamic Programming Control for Systems with Uncertain Mathematical Models via Differential Neural Network Bellman’s Function Approximation

Alexander Poznyak, Sebastian Noriega-Marquez, Alejandra Hernandez-Sanchez, Mariana Ballesteros-Escamilla and Isaac Chairez ()
Additional contact information
Alexander Poznyak: Automatic Control Department, Centro de Investigacion y Estudios Avanzados del Instituto Politecnico Nacional, Ciudad de Mexico 07360, Mexico
Sebastian Noriega-Marquez: Automatic Control Department, Centro de Investigacion y Estudios Avanzados del Instituto Politecnico Nacional, Ciudad de Mexico 07360, Mexico
Alejandra Hernandez-Sanchez: Institute of Advanced Materials for the Sustainable Manufacturing, Tecnologico de Monterrey, Ciudad de Mexico 14380, Mexico
Mariana Ballesteros-Escamilla: Centro De Innovacion Y Desarrollo Tecnologico En Computo, Instituto Politecnico Nacional, Ciudad de Mexico 07700, Mexico
Isaac Chairez: Institute of Advanced Materials for the Sustainable Manufacturing, Tecnologico de Monterrey, Jalisco 45201, Mexico

Mathematics, 2023, vol. 11, issue 5, 1-15

Abstract: This research focuses on designing a min–max robust control based on a neural dynamic programming approach using a class of continuous differential neural networks (DNNs). The proposed controller solves the robust optimization of a proposed cost function that depends on the trajectories of a system with an uncertain mathematical model satisfying a class of non-linear perturbed systems. The dynamic programming min–max formulation enables robust control concerning bounded modelling uncertainties and disturbances. The Hamilton–Jacobi–Bellman (HJB) equation’s value function, approximated by a DNN, permits to estimate the closed-loop formulation of the controller. The controller design is based on an estimated state trajectory with the worst possible uncertainties/perturbations that provide the degree of robustness using the proposed controller. The class of learning laws for the time-varying weights in the DNN is produced by studying the HJB partial differential equation. The controller uses the solution of the obtained learning laws and a time-varying Riccati equation. A recurrent algorithm based on the Kiefer–Wolfowitz method leads to adjusting the initial conditions for the weights to satisfy the final condition of the given cost function. The robust control suggested in this work is evaluated using a numerical example confirming the optimizing solution based on the DNN approximate for Bellman’s value function.

Keywords: robust optimal control; artificial neural networks; approximate models; Kiefer–Wolfowitz method (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2023
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/11/5/1211/pdf (application/pdf)
https://www.mdpi.com/2227-7390/11/5/1211/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:11:y:2023:i:5:p:1211-:d:1085067

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:11:y:2023:i:5:p:1211-:d:1085067