Curriculum learning with Hindsight Experience Replay for sequential object manipulation tasks

扫码查看

原文链接

NSTL
Elsevier

外文摘要：? 2021 Elsevier LtdLearning complex tasks from scratch is challenging and often impossible for humans as well as for artificial agents. Instead, a curriculum can be used, which decomposes a complex task – the target task – into a sequence of source tasks. Each source task is a simplified version of the next source task with increasing complexity. Learning then occurs gradually by training on each source task while using knowledge from the curriculum's prior source tasks. In this study, we present a new algorithm that combines curriculum learning with Hindsight Experience Replay (HER), to learn sequential object manipulation tasks for multiple goals and sparse feedback. The algorithm exploits the recurrent structure inherent in many object manipulation tasks and implements the entire learning process in the original simulation without adjusting it to each source task. We test our algorithm on three challenging throwing tasks in simulation and show significant improvements compared to vanilla-HER.

外文关键词：

Curriculum learningHindsight Experience ReplayMulti-goal reinforcement learningObject manipulation tasksSparse reward function

作者：

Manela B.、Biess A.

展开 >

作者单位：

Department of Industrial Engineering and Management Ben-Gurion University of the Negev

出版年：

2022

DOI：

10.1016/j.neunet.2021.10.011

Neural Networks

EISCI

ISSN：0893-6080

年,卷(期)：2022.145

被引量6
参考文献量35