基于PER-DDPG算法的城市轨道交通越区切换研究

Research on Handover of Urban Rail Transit Based on PER-DDPG Algorithm

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：针对传统IEEE802.11 越区切换方式存在较高的切换延时以及乒乓切换等问题,提出深度强化学习(Deep Q-Network,DQN)越区切换算法.通过对列车运行的特征状态信息进行提取输入,考虑列车运行速度及场强、切换阈值等动态信息构建越区切换模型.同时针对算法时间成本复杂度及稳定性,采用优先经验回放深度确定性策略梯度(Prioritized Experience Replay-Deep Deterministic Policy Gradient,PER-DDPG)算法,将列车状态空间信息传输至PER-DDPG网络中进行优化分析.结果表明基于PER-DDPG算法优化后的列车越区切换模型使用该算法时间计算成本降低,数据包传输延时约降低55%.

外文摘要：In order to solve the problems of the traditional IEEE802.11 off-zone switching mode,such as high switching delay and ping-pong switching,a deep Q-Network off-zone switching algorithm is proposed.A crossover algorithm of deep Q-Network is proposed.By extracting and inputting the characteristic state information of train op-eration,a crossover switching model is constructed considering the dynamic information of train running speed,field strength and switching threshold.Meanwhile,Prioritized Experience Replay-Deep Deterministic Policy Gradi-ent is prioritized to the complexity and stability of the algorithm's time cost.The spatial information of train status is transmitted to PER-DDPG network for optimization analysis.The results show that the time calculation cost of the optimized train crossover model based on PER-DDPG algorithm is reduced.the packet transmission delay is re-duced by about 55%.

外文关键词：

communication-based train controlCBTCsystemIEEE802.11 standardPER-DDPG

作者：

张军平、王小鹏、王冶力

展开 >

作者单位：

兰州交通大学研究院,甘肃兰州 730070

兰州交通大学电子与信息工程学院,甘肃兰州 730070

太原中铁轨道交通建设运营有限公司,山西太原 030032

关键词：

基于通信列车控制 CBTC系统 IEEE802.11标准优先经验回放机制深度确定策略梯度算法

基金：

甘肃省教育厅优秀研究生创新之星项目甘肃省自然科学基金甘肃省科技计划

项目编号：

2021CXZX-50721JR11RA06120YF8GA036

出版年：

2024

山西电子技术

山西省电子工业科学研究院山西省电子学会

山西电子技术

影响因子：0.197

ISSN：1674-4578

年,卷(期)：2024.(3)