Learning to select the recombination operator for derivative-free optimization

扫码查看

原文链接

万方数据
维普

外文摘要：Extensive studies on selecting recombination operators adaptively,namely,adaptive operator selection(AOS),during the search process of an evolutionary algorithm(EA),have shown that AOS is promising for improving EA's performance.A variety of heuristic mechanisms for AOS have been proposed in recent decades,which usually contain two main components:the feature extraction and the policy setting.The feature extraction refers to as extracting relevant features from the information collected during the search process.The policy setting means to set a strategy(or policy)on how to select an operator from a pool of operators based on the extracted feature.Both components are designed by hand in existing studies,which may not be efficient for adapting optimization problems.In this paper,a generalized framework is proposed for learning the components of AOS for one of the main streams of EAs,namely,differential evolution(DE).In the framework,the feature extraction is parameterized as a deep neural network(DNN),while a Dirichlet distribution is considered to be the policy.A reinforcement learning method,named policy gradient,is used to train the DNN.As case studies,the proposed framework is applied to two DEs including the classic DE and a recently-proposed DE,which result in two new algorithms named PG-DE and PG-MPEDE,respectively.Experiments on the Congress of Evolutionary Computation(CEC)2018 test suite show that the proposed new algorithms perform significantly better than their counterparts.Finally,we prove theoretically that the considered classic methods are the special cases of the proposed framework.

外文关键词：

evolutionary algorithmdifferential evolutionadaptive operator selectionreinforcement learningdeep learning

作者：

Haotian Zhang、Jianyong Sun、Thomas B?ck、Zongben Xu

展开 >

作者单位：

School of Mathematics and Statistics,Xi'an Jiaotong University,Xi'an 710049,China

Leiden Institute of Advanced Computer Science,Leiden University,Leiden 2333 CA,Netherland

基金：

National Natural Science Foundation of ChinaKey Research and Development Project of Shaanxi Province

项目编号：

620761972022GXLH-01-15

出版年：

2024

DOI：

10.1007/s11425-023-2252-9

中国科学:数学(英文版)

中国科学院

中国科学:数学(英文版)

CSTPCD

影响因子：0.36

ISSN：1674-7283

年,卷(期)：2024.67(6)