首页|基于分层强化学习的矿区无人驾驶车辆路径规划算法

基于分层强化学习的矿区无人驾驶车辆路径规划算法

扫码查看
在复杂而危险的矿区环境中,矿区无人驾驶车辆的路径规划涉及如何使车辆智能地选择最佳路径,以实现安全和高效运行.然而,传统的路径规划算法难以有效应对矿区内多变的路况和环境.提出了一种基于分层强化学习的矿区无人驾驶车辆路径规划算法,该算法通过分层强化学习技术训练图指针网络,求解矿区无人驾驶车辆路径规划问题.为将矿区无人驾驶车辆节点的向量映射成低维稠密向量,首先对图嵌入层的上下文向量进行均值化处理,用于保持网络的全局属性.再将交叉熵损失函数的范式加入分层强化学习的基准函数中,用于衡量 2 个不同驾驶车辆间的差异分布程度.试验结果表明:该算法在复杂的矿区环境下能够实现高效、安全、智能的路径选择,且模型收敛速度、时间花费上的优化效果优于传统算法和专业求解器,并具有良好的适应性和泛化能力.研究结果对于提高矿区无人驾驶的自主性、效率和安全性具有重要意义.
Path Planning Algorithm of Unmanned Vehicle in Mining Area Based on Hierarchical Reinforcement Learning
In complex and dangerous mining environments,path planning for unmanned vehicles in mining areas involves how to enable the vehicle to intelligently choose the best path to achieve safety and efficient operation.However,the traditional path planning algorithm is difficult to effectively deal with the changing road conditions and environment in the mining area.A path planning algorithm for unmanned vehicles in mining area based on hierarchical reinforcement learning is proposed.The al-gorithm trains graph pointer network by hierarchical reinforcement learning technology to solve the path planning problem of unmanned vehicles in mining area.In order to map the vector of the unmanned vehicle nodes in mining area into a low-dimen-sional dense vector,firstly,the context vector of the graph embedding layer is normalized to maintain the global properties of the network.Then,the cross entropy loss function is added to the benchmark function of hierarchical reinforcement learning to measure the difference distribution degree between two different driving vehicles.The experimental results show that this algo-rithm can realize efficient,safe and intelligent path selection in complex mining environment,and the optimization effect of model convergence speed and time cost exceeds the traditional algorithm and professional solver,and has good adaptability and generalization ability.The study results are of great significance for improving the autonomy,efficiency and safety of autonomous driving in mining areas.

mining area driverless vehiclehierarchical reinforcement learningpath planninggraph pointer network

魏晓娟、李纪云、巩闯

展开 >

河南职业技术学院现代信息技术学院,河南 郑州 450046

华北水利水电大学电子工程学院,河南 郑州 450046

矿区无人驾驶车辆 分层强化学习 路径规划 图指针网络

2024

金属矿山
中钢集团马鞍山矿山研究院 中国金属学会

金属矿山

CSTPCD北大核心
影响因子:0.935
ISSN:1001-1250
年,卷(期):2024.(11)