基于演员-评论家框架的层次化多智能体协同决策方法

A Hierarchical Multi-Agent Collaborative Decision-making Method Based on the Actor-critic Framework

傅妍芳 ¹雷凯麟 ¹魏佳宁 ²曹子建 ¹杨博 ¹王炜 ³孙泽龙 ⁴李秦洁¹

扫码查看

作者信息

1. 西安工业大学计算机科学与工程学院,陕西西安 710021
2. 北京机电工程研究所,北京 100083
3. 95810部队,北京 100076
4. 西安工业大学兵器科学与技术学院,陕西西安 710021
折叠

摘要

针对复杂作战环境下多智能体协同决策中出现的任务分配不合理、决策一致性较差等问题,提出一种基于演员-评论家(Actor-Critic,AC)框架的层次化多智能体协同决策方法.通过将决策过程分为不同层次,并使用AC框架来实现智能体之间的信息交流和决策协同,以提高决策效率和战斗力.在高层次,顶层智能体制定任务决策,将总任务分解并分配给底层智能体.在低层次,底层智能体根据子任务进行动作决策,并将结果反馈给高层次.实验结果表明,所提方法在多种作战仿真场景下均取得了较好的性能,展现了其在提升军事作战协同决策能力方面的潜力.

Abstract

A hierarchical multi-agent collaborative decision-making method based on the actor-critic(AC)frameworkis proposed to address the issues of improper task allocation and weak decision consistency in the collaborative decision-making of multiple agents in complex operational environments.The proposed method divides the decision-making process into different levels and utilizes the AC framework to facilitate information exchange and decision coordination among the agents,thereby enhancing thedecision efficiency and combat effectiveness.At the higher level,the top-level agents formulate thetask decisions by decomposing and assigning overall tasks to the lower-level agents.At the lower level,the lower-level agents make action decisions based on subtasks and provide feedback to the higher level.Experimental results demonstrate that the proposed method performs well in various operational simulation scenarios,showcasing its potential to enhance themilitary operational collaborative decision-making capability.

关键词

深度强化学习/层次化多智能体/信息共享/智能兵棋推演

Key words

deep reinforcement learning/hierarchical multi-agent/information sharing/intelligent war-gaming simulation

引用本文复制引用

出版年

2024

兵工学报

中国兵工学会

兵工学报

CSTPCD北大核心

影响因子：0.735

ISSN：1000-1093

段落导航