基于制动特征自学习的磁浮列车强化学习制动控制

Reinforcement Learning Braking Control of Maglev Trains Based on Self-Learning of Hybrid Braking Features

刘鸿恩 ¹胡闽胜 ²胡海林²

扫码查看

作者信息

1. 江西理工大学永磁磁浮技术与轨道交通研究院,江西赣州 341000;江西省磁悬浮技术重点实验室,江西赣州 341000;北京全路通信信号研究设计院集团有限公司,北京 100073
2. 江西理工大学永磁磁浮技术与轨道交通研究院,江西赣州 341000;江西省磁悬浮技术重点实验室,江西赣州 341000
折叠

摘要

精准、平稳停车是磁浮列车自动驾驶制动控制的重要目标.中低速磁浮列车停站制动过程受到电-液混合制动状态强耦合等影响,基于制动特性机理模型的传统制动控制方法难以保障磁浮列车的停车精度和舒适性.本文提出一种基于混合制动特征自学习的磁浮列车强化学习制动控制方法.首先,采用长短期记忆网络建立磁浮列车混合制动特征模型,结合磁浮列车运行环境和状态数据进行动态制动特征自学习;然后,根据动态特征学习结果更新强化学习的奖励函数与学习策略,提出基于深度强化学习的列车制动优化控制方法;最后,采用中低速磁浮列车现场运行数据开展仿真实验.实验结果表明:本文所提出的制动控制方法较传统方法的舒适性和停车精度分别提高41.18％和22％,证明了本文建模与制动优化控制方法的有效性.

Abstract

Accurate and smooth parking is an essential goal for automatic driving braking control of maglev trains.The strong coupling of the electro-hydraulic hybrid braking state affects the medium and low-speed maglev trains during the stopping braking process,and the traditional braking control method based on the theoretical model of braking features makes it difficult to guarantee the parking accuracy and comfort of the maglev train.This paper proposed a reinforcement learning braking control method for maglev trains based on self-learning of hybrid braking features.First,a long short-term memory(LSTM)network was used to establish a hybrid braking feature model for maglev trains,and the self-learning of dynamic braking features was performed based on the operating environment and status data of maglev trains.Then,the reward function and learning strategy of reinforcement learning were updated according to the learning results of dynamic features,and a train braking optimization control method based on deep reinforcement learning was proposed.Finally,simulation experiments were carried out by using on-site operation data of medium and low-speed maglev trains.The experimental results show that the braking control method proposed in this paper improves comfort and parking accuracy by 41.18％and 22％,respectively,compared with the traditional method.It proves the effectiveness of the modeling and braking optimization control method in this paper.

关键词

磁浮列车/制动控制/多目标优化控制/强化学习/制动特征自学习

Key words

maglev train/braking control/multi-objective optimal control/reinforcement learning/self-learning of braking features

引用本文复制引用

基金项目

国家自然科学基金(52262050)

江西省自然科学基金(20224BAB202025)

出版年

2024

西南交通大学学报

西南交通大学

西南交通大学学报

CSTPCD北大核心

影响因子：0.973

ISSN：0258-2724

参考文献量10

段落导航