Sequencing of multi-robot behaviors using reinforcement learning

扫码查看

原文链接

NETL
NSTL
维普
万方数据

外文摘要：Given a collection of parameterized multi-robot controllers associated with individual behaviors designed for particular tasks,this paper considers the problem of how to sequence and instantiate the behaviors for the purpose of completing a more complex,overarching mission.In addition,uncertainties about the environment or even the mission specifications may require the robots to learn,in a cooperative manner,how best to sequence the behaviors.In this paper,we approach this problem by using reinforcement learning to approximate the solution to the computationally intractable sequencing problem,combined with an online gradient descent approach to selecting the individual behavior parameters,while the transitions among behaviors are triggered automatically when the behaviors have reached a desired performance level relative to a task performance cost.To illustrate the effectiveness of the proposed method,it is implemented on a team of differential-drive robots for solving two different missions,namely,convoy protection and object manipulation.

外文关键词：

Multi-robot systemsReinforcement learningDistributed control

作者：

Pietro Pierpaoli、Thinh T.Doan、Justin Romberg、Magnus Egerstedt

展开 >

作者单位：

School of Electrical and Computer Engineering,Georgia Institute of Technology,Atlanta,GA 30332,USA

Department of Electrical and Computer Engineering,Virginia Tech,Blacksburg,VA 24061,USA

Samueli School of Engineering,University of California,Irvine,CA 92697,USA

基金：

Army Research Lab

项目编号：

DCIST CRA W911NF-17-2-0181

出版年：

2021

控制理论与技术(英文版)

华南理工大学

控制理论与技术(英文版)

CSCDEI

影响因子：0.307

ISSN：2095-6983

年,卷(期)：2021.19(4)

参考文献量30