CN116880186B

Movatterモバイル変換

Info

Publication number: CN116880186B
Application number: CN202310861633.0A
Authority: CN
Inventors: 李彬; 宁召柯; 史明明; 李清亮; 陶呈纲; 孙绍山; 李导
Original assignee: Sichuan University
Current assignee: Sichuan University
Priority date: 2023-07-13
Filing date: 2023-07-13
Publication date: 2024-04-16
Anticipated expiration: 2043-07-13
Also published as: CN116880186A

Abstract

Translated fromChinese

本发明公开了一种数据驱动的自适应动态规划空战决策方法，包括以下步骤：S1，建立无人机追逃问题系统模型；S2，采用无模型自适应动态规划求解上述无人机追逃问题；采用离线神经网络模型训练算法获得红方无人机和蓝方无人机实时控制率，并实时收集红方控制率信息和红蓝双方状态信息；S4，通过在线模型训练算法在线更新神经网络，实现红方无人机和蓝方无人机在“追踪‑逃逸”问题中的自适应动态规划的空战决策。本发明结合了离线训练和在线训练的优点，提升了本发明的在线自适应调整策略的能力。且本发明不依赖于飞行器系统模型，具有很强的泛化能力，可以推广的多个应用场景。

The present invention discloses a data-driven adaptive dynamic programming air combat decision-making method, comprising the following steps: S1, establishing a system model for the UAV pursuit and escape problem; S2, using model-free adaptive dynamic programming to solve the above-mentioned UAV pursuit and escape problem; using an offline neural network model training algorithm to obtain the real-time control rate of the red party UAV and the blue party UAV, and collecting the red party control rate information and the red and blue party status information in real time; S4, updating the neural network online through an online model training algorithm to realize the adaptive dynamic programming air combat decision of the red party UAV and the blue party UAV in the "pursuit-escape" problem. The present invention combines the advantages of offline training and online training, and improves the ability of the online adaptive adjustment strategy of the present invention. In addition, the present invention does not rely on the aircraft system model, has a strong generalization ability, and can be promoted to multiple application scenarios.