信息与电子工程前沿(英文)2024,Vol.25Issue(11) :1446-1465.DOI:10.1631/FITEE.2300668

Domain adaptation in reinforcement learning:a comprehensive and systematic study

Amirfarhad FARHADI Mitra MIRZAREZAEE Arash SHARIFI Mohammad TESHNEHLAB
信息与电子工程前沿(英文)2024,Vol.25Issue(11) :1446-1465.DOI:10.1631/FITEE.2300668

Domain adaptation in reinforcement learning:a comprehensive and systematic study

Amirfarhad FARHADI 1Mitra MIRZAREZAEE 1Arash SHARIFI 1Mohammad TESHNEHLAB2
扫码查看

作者信息

  • 1. Department of Computer Engineering,Science and Research Branch,Islamic Azad University,Tehran 1477893855,Iran
  • 2. Department of Control Engineering,K.N.Toosi University of Technology,Tehran 1999143344,Iran
  • 折叠

Abstract

Reinforcement learning(RL)has shown significant potential for dealing with complex decision-making problems.However,its performance relies heavily on the availability of a large amount of high-quality data.In many real-world situations,data distribution in the target domain may differ significantly from that in the source domain,leading to a significant drop in the performance of RL algorithms.Domain adaptation(DA)strategies have been proposed to address this issue by transferring knowledge from a source domain to a target domain.However,there have been no comprehensive and in-depth studies to evaluate these approaches.In this paper we present a comprehensive and systematic study of DA in RL.We first introduce the basic concepts and formulations of DA in RL and then review the existing DA methods used in RL.Our main objective is to fill the existing literature gap regarding DA in RL.To achieve this,we conduct a rigorous evaluation of state-of-the-art DA approaches.We aim to provide comprehensive insights into DA in RL and contribute to advancing knowledge in this field.The existing DA approaches are divided into seven categories based on application domains.The approaches in each category are discussed based on the important data adaptation metrics,and then their key characteristics are described.Finally,challenging issues and future research trends are highlighted to assist researchers in developing innovative improvements.

Key words

Reinforcement learning/Domain adaptation/Machine learning

引用本文复制引用

出版年

2024
信息与电子工程前沿(英文)
浙江大学

信息与电子工程前沿(英文)

CSTPCD
影响因子:0.371
ISSN:2095-9184
段落导航相关论文