中国科学:信息科学(英文版)2024,Vol.67Issue(8) :45-62.DOI:10.1007/s11432-023-4088-2

UAV swarm air combat maneuver decision-making method based on multi-agent reinforcement learning and transferring

Zhiqiang ZHENG Chen WEI Haibin DUAN
中国科学:信息科学(英文版)2024,Vol.67Issue(8) :45-62.DOI:10.1007/s11432-023-4088-2

UAV swarm air combat maneuver decision-making method based on multi-agent reinforcement learning and transferring

Zhiqiang ZHENG 1Chen WEI 1Haibin DUAN1
扫码查看

作者信息

  • 1. State Key Laboratory of Virtual Reality Technology and Systems,School of Automation Science and Electrical Engineering,Beihang University,Beijing 100083,China
  • 折叠

Abstract

During short-range air combat involving unmanned aircraft vehicle(UAV)swarms,UAVs must make accurate maneuver decisions based on information from both enemy and friendly UAVs.This dual requirement of competition and cooperation presents a significant challenge in the field of unmanned air combat.In this paper,a method based on multi-agent reinforcement learning(MARL)is proposed to address this issue.An actor network containing three subnetworks that can handle different types of situational information is designed.Hence,the results from simpler one-on-one scenarios are leveraged to enhance the complex swarm air combat training process.Separate state spaces for local and global information are designed for the actor and critic networks.A detailed reward function is proposed to encourage participation.To prevent lazy participants in air combat,a reward assignment operation is applied to distribute these dense rewards.Simulation testing and ablation experiments demonstrate that both the transfer operation and reward assignment operation can effectively deal with the swarm air combat scenario,and reflect the effectiveness of the proposed method.

Key words

UAV swarm/short-range air combat/multi-agent reinforcement learning/reward assignment/transfer

引用本文复制引用

基金项目

National Key R&D Program of China(2023YFC3011001)

National Natural Science Foundation of China(U20B2071)

National Natural Science Foundation of China(62350048)

National Natural Science Foundation of China(T2121003)

出版年

2024
中国科学:信息科学(英文版)
中国科学院

中国科学:信息科学(英文版)

CSTPCDEI
影响因子:0.715
ISSN:1674-733X
参考文献量2
段落导航相关论文