基于改进的指针网络深度强化学习算法求解旅行商问题

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
万方数据

中文摘要：旅行商问题是组合优化问题中的经典问题，而深度强化学习的发展为该类问题的求解提供了新思路。在基于指针网络的深度强化学习算法求解旅行商问题中，策略网络和价值网络的编码器都采用了复杂的长短期记忆网络结构，这在求解大规模旅行商问题时会造成训练时间过长的现象。鉴于输入节点间位置顺序的无关性，本文对指针网络中编码器的循环神经网络进行了修改，将策略网络和价值网络编码器中的长短期记忆网络都替换为一维卷积神经网络，最终提出了一种改进的基于指针网络的深度强化学习算法，其在相同求解问题规模上所需要的训练时间比原模型减少12％～15％，实验结果充分验证了本文改进算法的有效性。

外文标题：Improved Deep Reinforcement Learning Algorithm Based on Pointer Network for Traveling Salesman Problem

外文摘要：Traveling salesman problem is a classic problem in combinatorial optimization. The development of deep rein-forcement learning provides a new way to solve this problem. In the deep reinforcement learning algorithm based on the point-er network for the traveling salesman problem, the encoders of the strategy network and the value network both employ the complex long short-term memory network structure, which leads a long training time to the large-scale traveling salesman problem. Considering the independence of the position order among the input nodes, this paper modifies the recurrent neural network of the encoder in the pointer network and replaces the long short-term memory network of encoders in the strategy network and the value network with the one-dimensional convolutional neural network. An improved deep reinforcement learning algorithm based on the pointer network is proposed, which reduces the training time by 12％to 15％compared with the original model on the same scale of resolving the problem. The experimental results verify the effectiveness of the im-proved algorithm.

外文关键词：

traveling salesman problemdeep reinforcement learningpointer networkconvolutional neural networklong short-term memorypolicy gradient

作者：

唐娇娇、左烔菲、陈逢林

展开 >

作者单位：

安庆师范大学数理学院,安徽安庆 246133

关键词：

旅行商问题深度强化学习指针网络卷积神经网络长短期记忆网络策略梯度

基金：

安徽省教育厅重点项目安徽省教育厅教研项目

项目编号：

KJ2019A05802020xsxxkc259

出版年：

2024

DOI：

10.13757/j.cnki.cn34-1328/n.2024.02.011

安庆师范大学学报(自然科学版)

安庆师范学院

安庆师范大学学报(自然科学版)

影响因子：0.252

ISSN：1007-4260

年,卷(期)：2024.30(2)

参考文献量2