兵工学报2024,Vol.45Issue(10) :3385-3396.DOI:10.12382/bgxb.2023.0862

基于演员-评论家框架的层次化多智能体协同决策方法

A Hierarchical Multi-Agent Collaborative Decision-making Method Based on the Actor-critic Framework

傅妍芳 雷凯麟 魏佳宁 曹子建 杨博 王炜 孙泽龙 李秦洁
兵工学报2024,Vol.45Issue(10) :3385-3396.DOI:10.12382/bgxb.2023.0862

基于演员-评论家框架的层次化多智能体协同决策方法

A Hierarchical Multi-Agent Collaborative Decision-making Method Based on the Actor-critic Framework

傅妍芳 1雷凯麟 1魏佳宁 2曹子建 1杨博 1王炜 3孙泽龙 4李秦洁1
扫码查看

作者信息

  • 1. 西安工业大学计算机科学与工程学院,陕西西安 710021
  • 2. 北京机电工程研究所,北京 100083
  • 3. 95810部队,北京 100076
  • 4. 西安工业大学兵器科学与技术学院,陕西西安 710021
  • 折叠

摘要

针对复杂作战环境下多智能体协同决策中出现的任务分配不合理、决策一致性较差等问题,提出一种基于演员-评论家(Actor-Critic,AC)框架的层次化多智能体协同决策方法.通过将决策过程分为不同层次,并使用AC框架来实现智能体之间的信息交流和决策协同,以提高决策效率和战斗力.在高层次,顶层智能体制定任务决策,将总任务分解并分配给底层智能体.在低层次,底层智能体根据子任务进行动作决策,并将结果反馈给高层次.实验结果表明,所提方法在多种作战仿真场景下均取得了较好的性能,展现了其在提升军事作战协同决策能力方面的潜力.

Abstract

A hierarchical multi-agent collaborative decision-making method based on the actor-critic(AC)frameworkis proposed to address the issues of improper task allocation and weak decision consistency in the collaborative decision-making of multiple agents in complex operational environments.The proposed method divides the decision-making process into different levels and utilizes the AC framework to facilitate information exchange and decision coordination among the agents,thereby enhancing thedecision efficiency and combat effectiveness.At the higher level,the top-level agents formulate thetask decisions by decomposing and assigning overall tasks to the lower-level agents.At the lower level,the lower-level agents make action decisions based on subtasks and provide feedback to the higher level.Experimental results demonstrate that the proposed method performs well in various operational simulation scenarios,showcasing its potential to enhance themilitary operational collaborative decision-making capability.

关键词

深度强化学习/层次化多智能体/信息共享/智能兵棋推演

Key words

deep reinforcement learning/hierarchical multi-agent/information sharing/intelligent war-gaming simulation

引用本文复制引用

出版年

2024
兵工学报
中国兵工学会

兵工学报

CSTPCD北大核心
影响因子:0.735
ISSN:1000-1093
段落导航相关论文