Targeted multi-agent communication algorithm based on state control

扫码查看

原文链接

万方数据
维普

外文摘要：As an important mechanism in multi-agent interaction,communication can make agents form complex team relationships rather than constitute a simple set of multiple independent agents.However,the existing communication schemes can bring much timing redundancy and irrelevant messages,which seriously affects their practical application.To solve this problem,this paper proposes a targeted multi-agent communication algorithm based on state control(SCTC).The SCTC uses a gating mechanism based on state control to reduce the timing redundancy of communication between agents and determines the interaction relationship between agents and the importance weight of a communication message through a series connection of hard-and self-attention mechanisms,realizing targeted communication message processing.In addition,by minimizing the difference between the fusion message generated from a real communication message of each agent and a fusion message generated from the buffered message,the correctness of the final action choice of the agent is ensured.Our evaluation using a challenging set of StarCraft Ⅱ benchmarks indicates that the SCTC can significantly improve the learning performance and reduce the communication overhead between agents,thus ensuring better cooperation between agents.

外文关键词：

Multi-agent deep reinforcement learningState controlTargeted interactionCommunication mechanism

作者：

Li-yang Zhao、Tian-qing Chang、Lei Zhang、Jie Zhang、Kai-xuan Chu、De-peng Kong

展开 >

作者单位：

Department of Weaponry and Control,Army Academy of Armored Forces,Beijing,100072,China

Unit 92942,Beijing,100161,China

出版年：

2024

DOI：

10.1016/j.dt.2022.07.005

防务技术

中国兵工学会

防务技术

CSTPCD

影响因子：0.358

ISSN：2214-9147

年,卷(期)：2024.(1)

参考文献量40