中国科学:信息科学(英文版)2024,Vol.67Issue(5) :111-131.DOI:10.1007/s11432-022-3666-2

Perception field based imitation learning for unlabeled multi-agent pathfinding

Wenjie CHU Ailun YU Wei ZHANG Haiyan ZHAO Zhi JIN
中国科学:信息科学(英文版)2024,Vol.67Issue(5) :111-131.DOI:10.1007/s11432-022-3666-2

Perception field based imitation learning for unlabeled multi-agent pathfinding

Wenjie CHU 1Ailun YU 1Wei ZHANG 1Haiyan ZHAO 1Zhi JIN1
扫码查看

作者信息

  • 1. School of Computer Science,Peking University,Beijing 100871,China;Key Laboratory of High-Confidence Software Technologies(Peking University),Ministry of Education,Beijing 100871,China
  • 折叠

Abstract

This paper proposes an imitation learning method to learn a universal agent policy for unlabeled multi-agent pathfinding(unlabeled MAPF)in grid environments.The method transforms the unlabeled MAPF problem into a series of temporal-independent homogeneous classification problems for each agent.Based on this transformation,a neural network is designed to imitate a distance-optimal expert algorithm.The neural network consists of two successive modules:perception field learner and field integrating classifier.The former refines and encodes the current system state into a perception field for each agent by combining a set of learnable field-generating functions.The latter takes an agent's perception field as input and decides the agent's next action based on a triplet cross-attention mechanism.We evaluate our method on a diverse set of unlabeled MAPF tasks.Compared with state-of-the-art counterparts,the experimental results manifest the superiority of the proposed method in both generalization ability and scalability.

Key words

unlabeled multi-agent pathfinding/perception field/triplet cross-attention/multi-agent imita-tion learning/learning-based planning

引用本文复制引用

基金项目

National Natural Science Foundation of China(62192731)

National Natural Science Foundation of China(62192730)

National Natural Science Foundation of China(62190200)

出版年

2024
中国科学:信息科学(英文版)
中国科学院

中国科学:信息科学(英文版)

CSTPCDEI
影响因子:0.715
ISSN:1674-733X
参考文献量48
段落导航相关论文