基于改进MAAC算法的多无人机自主路径规划

Multi-UAV Autonomous Path Planning Based on Improved MAAC Algorithm

周从航 ¹李建兴 ¹石宇静 ²林致睿¹

扫码查看

作者信息

1. 福建理工大学电子电气与物理学院,福建福州 350118;福建省工业集成自动化行业技术开发基地,福建福州 350118
2. 福建理工大学电子电气与物理学院,福建福州 350118
折叠

摘要

利用深度强化学习方法对威胁区域环境下多无人机(UAV)自主路径规划问题进行研究.为了解决强化学习算法中普遍存在难以收敛的问题,提出了一种改进的 Actor-Attention-Critic for Multi-Agent Reinforcement Learning(MAAC)算法用于多UAV的自主路径规划.通过建立多UAV势场环境模型定义强化学习的马尔科夫决策过程(Markov Modulated Process,MDP),在动态环境中规划出合理的无碰撞路径.仿真实验验证了所设计的多UAV自主路径规划控制算法的有效性,并通过对比仿真验证了该算法在收敛速度和避免碰撞方面具有更优越的性能.

Abstract

Deep reinforcement learning methods are used in multi-UAV autonomous path planning in threat area environments.In order to solve the common problem of difficult convergence in reinforcement learning algorithms,an improved Actor-Attention-Critic for Multi-Agent Reinforcement Learning(MAAC)algorithm is proposed for multi-UAV autonomous path planning.The Markov decision process of reinforcement learning is defined by modeling the multi-UAV potential field environment to provide a reasonable collision-free path planning in dynamic environment.Simulation experiments validate the effectiveness of the proposed algorithm,and verify its superior performance in terms of convergence speed and collision avoidance through comparative simulations.

关键词

无人机/多智能体深度强化学习/自主路径规划/MAAC算法

Key words

UAV/multi-agent deep reinforcement learning/autonomous path planning/MAAC algorithm

引用本文复制引用

基金项目

福建省自然科学基金(2020J01876)

福建工程学院科研启动基金(GY-Z21215)

福建工程学院科研启动基金(GY-Z21216)

出版年

2024

无线电工程

中国电子科技集团公司第五十四研究所

无线电工程

影响因子：0.667

ISSN：1003-3106

段落导航