loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Juntao Ruan 1 ; 2 ; Yi Qin 2 ; Fei Wang 3 ; Jianjun Huang 1 ; Fujie Wang 2 ; Fang Guo 2 and Yaohua Hu 2

Affiliations: 1 College of Electronics and Information Engineering, Shenzhen University, Shenzhen, China ; 2 School of Electrical Engineering and Intelligentization, Dongguan University of Technology, Dongguan, China ; 3 Harbin Institute of Technology, Shenzhen, China

Keyword(s): UCAV, Segmented Reward Function, EKF, DQN, Maneuvering Decision-making.

Abstract: Intelligent air combat is the main trend in the future, and the maneuvering decision-making ability of Unmanned combat aerial vehicles (UCAVs) affects the win-lose ending of the air battlefield. In order to study the problem of maneuver decision-making in UCAV 1V1 air combat, this paper proposes a maneuver decision generation algorithm based on the fusion of segmented reward function and improved deep Q-network. Firstly, this paper establishes a real mathematical model problem in the complex environment of air combat, and provides an equation expression that describes the spatial coordinate information, attitude information velocity information of the current state of UCAVs. This expression can provide basic maneuvering action instructions after passing the overload coefficient. Then, a segmented reward function was designed to guide unmanned aerial vehicles to develop towards their own advantages in the turn of aerial combat. Aiming at the problem of parameter uncertainty of deep Q- network (DQN) in maneuver decision-making process, an improved deep Q-network algorithm is proposed in the next. By introducing the extended Kalman filter (EKF), the uncertain parameter values of the strategy network are used to construct the system state equations, the parameters of the target network are used to construct the observation equations of the system, and the optimal parameter estimates of the DQN are obtained through the iterative updating solution of EKF. Simulation experiments show the effectiveness of the designed segmented reward function and the improved deep Q-network algorithm in autonomous maneuvering decision-making for UCAVs. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.119.162.226

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Ruan, J.; Qin, Y.; Wang, F.; Huang, J.; Wang, F.; Guo, F. and Hu, Y. (2024). Research on Maneuver Decision of Unmanned Combat Aerial Vehicles Based on Segmented Reward Function and Improved Deep Q-Network. In Proceedings of the 1st International Conference on Data Mining, E-Learning, and Information Systems - DMEIS; ISBN 978-989-758-715-3, SciTePress, pages 94-101. DOI: 10.5220/0012911900004536

@conference{dmeis24,
author={Juntao Ruan. and Yi Qin. and Fei Wang. and Jianjun Huang. and Fujie Wang. and Fang Guo. and Yaohua Hu.},
title={Research on Maneuver Decision of Unmanned Combat Aerial Vehicles Based on Segmented Reward Function and Improved Deep Q-Network},
booktitle={Proceedings of the 1st International Conference on Data Mining, E-Learning, and Information Systems - DMEIS},
year={2024},
pages={94-101},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012911900004536},
isbn={978-989-758-715-3},
}

TY - CONF

JO - Proceedings of the 1st International Conference on Data Mining, E-Learning, and Information Systems - DMEIS
TI - Research on Maneuver Decision of Unmanned Combat Aerial Vehicles Based on Segmented Reward Function and Improved Deep Q-Network
SN - 978-989-758-715-3
AU - Ruan, J.
AU - Qin, Y.
AU - Wang, F.
AU - Huang, J.
AU - Wang, F.
AU - Guo, F.
AU - Hu, Y.
PY - 2024
SP - 94
EP - 101
DO - 10.5220/0012911900004536
PB - SciTePress