基于深度强化学习的行人检测研究

Research on Deep Reinforcement Learning-based Pedestrian Detection

扫码查看

原文链接

维普
万方数据

中文摘要：针对目前深度强化学习(Deep Reinforcement Learning,DRL)在目标检测中智能体初始化训练窗口固定单一、多目标和小目标图像错检率、漏检率高的问题,提出一种结合YOLOv5s和DQN算法的行人检测方法.该方法能够通过YOLOv5s搜索到含有目标的数量和区域,将回归的初步包围框设定为智能体的初始化窗口,提升尺度适应性.改进传统强化学习模型的奖励函数,使奖惩反馈更精准,提高模型检测精度和速度.与现有的基于深度学习、深度强化学习的目标检测模型对比实验,实测结果表明所提出的行人检测方法能够有效地提高检测精确度.

外文摘要：To address the current problems of fixed single initialization training window of intelligences in target detection by deep reinforcement learning(DRL),high error detection rate and leakage rate of multi-target and small target images,a pedestrian detection method combining YOLOv5s and DQN algorithm is proposed in this paper.The method is able to search the number and area containing targets by YOLOv5s,set the initial enclosing frame of regression as the initialization window of the intelligent body,and improve the scale adaptation.The reward function of the traditional reinforcement learn-ing model is improved to make the reward and punishment feedback more accurate and improve the model detection ac-curacy and speed.Comparison experiments with existing target detection models based on deep learning and deep rein-forcement learning are conducted to obtain empirical results showing that the proposed pedestrian detection method can effectively improve the detection accuracy.

外文关键词：

pedestrian detectiondeep reinforcement learningYOLOv5reward function

作者：

李新羽、徐野

展开 >

作者单位：

沈阳理工大学自动化与电气工程学院,辽宁沈阳 110159

沈阳建筑大学智能建造实验室,辽宁沈阳 110168

关键词：

行人检测深度强化学习 YOLOv5 奖励函数

基金：

国家自然科学基金项目

项目编号：

61373159

出版年：

2024

工业控制计算机

中国计算机学会工业控制计算机专业委员会江苏省计算技术研究所有限责任公司

工业控制计算机

影响因子：0.258

ISSN：1001-182X

年,卷(期)：2024.37(3)

参考文献量13