Research on Deep Q-Network Hybridization with Extended Kalman Filter in Maneuvering Decision of Unmanned Combat Aerial Vehicles

Ruan, Juntao; Qin, Yi; Wang, Fei; Huang, Jianjun; Wang, Fujie; Guo, Fang; Hu, Yaohua

Research on Deep Q-Network Hybridization with Extended Kalman Filter in Maneuvering Decision of Unmanned Combat Aerial Vehicles

Juntao Ruan, Yi Qin (), Fei Wang, Jianjun Huang, Fujie Wang, Fang Guo and Yaohua Hu
Additional contact information
Juntao Ruan: School of Electrical Engineering and Intelligentization, Dongguan University of Technology, Dongguan 523000, China
Yi Qin: School of Electrical Engineering and Intelligentization, Dongguan University of Technology, Dongguan 523000, China
Fei Wang: School of Electronic and Information Engineering, Harbin Institute of Technology, Shenzhen 518060, China
Jianjun Huang: College of Electronics and Information Engineering, Shenzhen University, Shenzhen 518060, China
Fujie Wang: School of Electrical Engineering and Intelligentization, Dongguan University of Technology, Dongguan 523000, China
Fang Guo: School of Electrical Engineering and Intelligentization, Dongguan University of Technology, Dongguan 523000, China
Yaohua Hu: School of Electrical Engineering and Intelligentization, Dongguan University of Technology, Dongguan 523000, China

Mathematics, 2024, vol. 12, issue 2, 1-14

Abstract: To adapt to the development trend of intelligent air combat, it is necessary to research the autonomous generation of maneuvering decisions for unmanned combat aerial vehicles (UCAV). This paper presents a maneuver decision-making method for UCAV based on a hybridization of deep Q-network (DQN) and extended Kalman filtering (EKF). Firstly, a three-dimensional air combat simulation environment is constructed, and a flight motion model of UCAV is designed to meet the requirements of the simulation environment. Secondly, we evaluate the current situation of UCAV based on their state variables in air combat, for further network learning and training to obtain the optimal maneuver strategy. Finally, based on the DQN, the system state equation is constructed using the uncertain parameter values of the current network, and the observation equation of the system is constructed using the parameters of the target network. The optimal parameter estimation value of the DQN is obtained by iteratively updating the solution through EKF. Simulation experiments have shown that this autonomous maneuver decision-making method hybridizing DQN with EKF is effective and reliable, as it can eliminate the opponent and preserve its side.

Keywords: maneuvering decision; DQN; EKF; intelligent air combat; UCAV (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/12/2/261/pdf (application/pdf)
https://www.mdpi.com/2227-7390/12/2/261/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:12:y:2024:i:2:p:261-:d:1318404

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().