中国科学:信息科学(英文版)2024,Vol.67Issue(8) :31-44.DOI:10.1007/s11432-023-4029-9

Autonomous multi-drone racing method based on deep reinforcement learning

Yu KANG Jian DI Ming LI Yunbo ZHAO Yuhui WANG
中国科学:信息科学(英文版)2024,Vol.67Issue(8) :31-44.DOI:10.1007/s11432-023-4029-9

Autonomous multi-drone racing method based on deep reinforcement learning

Yu KANG 1Jian DI 2Ming LI 3Yunbo ZHAO 3Yuhui WANG4
扫码查看

作者信息

  • 1. Department of Automation,University of Science and Technology of China,Hefei 230026,China;Institute of Advanced Technology,University of Science and Technology of China,Hefei 230088,China;Institute of Artificial Intelligence,Hefei Comprehensive National Science Center,Hefei 230088,China
  • 2. Institute of Advanced Technology,University of Science and Technology of China,Hefei 230088,China
  • 3. Department of Automation,University of Science and Technology of China,Hefei 230026,China
  • 4. College of Automation Engineering,Nanjing University of Aeronautics and Astronautics,Nanjing 210016,China
  • 折叠

Abstract

Racing drones have attracted increasing attention due to their remarkable high speed and excel-lent maneuverability.However,autonomous multi-drone racing is quite difficult since it requires quick and agile flight in intricate surroundings and rich drone interaction.To address these issues,we propose a novel autonomous multi-drone racing method based on deep reinforcement learning.A new set of reward functions is proposed to make racing drones learn the racing skills of human experts.Unlike previous methods that required global information about tracks and track boundary constraints,the proposed method requires only limited localized track information within the range of its own onboard sensors.Further,the dynamic re-sponse characteristics of racing drones are incorporated into the training environment,so that the proposed method is more in line with the requirements of real drone racing scenarios.In addition,our method has a low computational cost and can meet the requirements of real-time racing.Finally,the effectiveness and superiority of the proposed method are verified by extensive comparison with the state-of-the-art methods in a series of simulations and real-world experiments.

Key words

racing drone/autonomous multi-drone racing/sim-to-real/deep reinforcement learning/Markov game

引用本文复制引用

基金项目

National Key Research and Development Program of China(2018AAA0100801)

National Natural Science Foundation of China(62033012)

出版年

2024
中国科学:信息科学(英文版)
中国科学院

中国科学:信息科学(英文版)

CSTPCDEI
影响因子:0.715
ISSN:1674-733X
参考文献量1
段落导航相关论文