改进YOLOv7的交通标志识别模型

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：目的随着自动驾驶和辅助驾驶的快速发展,交通标志识别研究变得越来越重要.但是现阶段交通标志识别算法对交通标志识别的精度较低,尤其在面对目标背景较为复杂、光照不足和小目标交通标志的场景时,更加容易出现错检和漏检情况.针对以上问题,提出了一种改进YOLOv7(you only look once version 7)的交通标志识别模型.方法首先,采用空间金字塔池化快速跨级部分连接(spatial pyramid pooling fast cross stage partial concat,SPPFCSPC)方法,替换YOLOv7算法使用的空间金字塔池化跨级部分连接(spatial pyramid pooling cross stage partial concat,SPPCSPC)方法,提高算法的特征提取能力.其次,采用加权双向特征金字塔网络(bi-directional feature pyra-mid network,BiFPN),增强算法的多尺度特征融合能力.接着,采用一种新的框间距离度量的归一化Wasserstein距离(normalized Wasserstein distance,NWD)方法,解决传统的 IoU(intersection over union)度量对小目标交通标志检测过于敏感的问题.最后,使用特征内容的感知重组(content-aware reassembly of feature,CARAFE)算子,通过输入的特征,自适应生成上采样内核,有效地增加模型的感受域,更好地利用目标周边的信息,减少交通标志错检和漏检情况.结果实验结果表明,在减少算法参数量的基础上,改进算法在TT100K交通标志数据集上的mAP@0.5和mAP@0.5∶0.9值分别达到了 92.50％和72.21％,较原始的YOLOv7算法分别提高了 3.24％和1.83％.同时,在具有小目标特性的CCTSDB交通标志数据集和整理的国外交通标志数据集上验证了模型改进的有效性.结论通过实验验证和主客观评价,证明了本文改进算法的可行性,能够有效地对多种环境下的小目标交通标志进行识别,并在降低算法参数量的前提下,进一步提高了 YOLOv7算法对交通标志识别的平均精度.

外文标题：Improved traffic sign recognition model for YOLOv7

外文摘要：Objective Traffic sign recognition has become an important research direction given the rapid development of driverless and assisted driving.To date,driverless and assisted driving pose additional requirements for accurate traffic sign recognition,especially in a real driving environment.The correct recognition rate of traffic signs is easily interfered by the external environment.In the identification of small-target traffic signs,most algorithms still present a very low accu-racy,which easily results in erroneous and missed detection.Such a condition has a great impact on the driver's accurate judgment of the state of road traffic signs.Given the hidden dangers of traffic,for the improved accuracy of traffic sign detection,the occurrence of accidents must be reduced and the driver s driving safety be improved.On the basis of YOLOv7 model,this paper proposes a traffic sign recognition method to improve the YOLOv7 algorithm.Method First,drawing on the idea of spacelab payload processing facility,on the basis of the spatial pyramid pooling cross stage partial cat(SPPCSPC)module of the original YOLOv7 model,the input feature map was reblocked,and pooling operations of dif-ferent sizes are implemented in each block.Then,the pooled results were spliced based on the position of the original block.Finally,convolution operation was performed to obtain a new spatial pyramid pooling structure called spatial pyra-mid pooling fast cross stage partial concat(SPPFCSPC).Instead of the spatial pyramid pooling cross stage partial cat,the SPPFCSPC in the original model was used to pool the input feature map at multiple scales to optimize the training model,improve the accuracy of the algorithm,and identify targets more accurately.On the basis of this algorithm,given that the ordinary feature fusion method often adds characteristics of different resolutions after resizing without discrimination,to solve this problem,we used bidirectional feature pyramid network in the neck part to add a more weight to each input dur-ing feature fusion.Each input was allowed to learn the importance of each feature during fusion of features to effectively merge the multiscale features of the target and improve the detection capability for small targets.Then,aiming at the issue of small-target detection tasks requiring a high positioning performance,a normalized Wasserstein distance(NWD)method of interframe distance measurement was adopted to solve the high-sensitivity problem of the traditional intersection over union(IoU)metric in regard to small targets,which is used in anchor frame detectors to enhance the performance of non-maximum suppression module and loss function.Specifically,bbox was remodeled as a two-dimensional Gaussian distribu-tion for additional consistency with the characteristics of small targets,and the IoU of the prediction and truth boxes were converted into similarity between the two distributions.In addition,NWD was designed as a new evaluation indicator and used to measure the similarity of both distributions.The NWD metric can be applied to detectors that use the IoU metric,with the IoU being directly replaced with NWD.This metric can improve the capability to recognize traffic signs with less features in real traffic scenarios.Finally,through the lightweight upsampling content-aware reassembly of features opera-tor,the output size of the input feature map was matched with the original image,and as a result,the input features were adapted to generate an upsampling kernel,realize the feature fusion of various scales,effectively increase the sensitivity domain of the model,improve the use of information around the target,increase the target detection capability,and reduce cases of missing detection.Result The experimental results show that the mAP@0.5 and mAP@0.5∶0.9 values of the model trained on the Tsinghua-Tencent 100K traffic sign dataset of the improved YOLOv7 algorithm reached 92.5％and 72.21％,respectively.In addition,the original YOLOv7 algorithm had mAP@0.5 and mAP@0.5∶0.9 values of 89.26％and 70.38％,respectively.Thus,its accuracy improved by 3.24％and 1.83％,respectively.Furthermore,the feasibility of improving the algorithm was verified on the CSUST Chinese traffic sign detection benchmark traffic sign dataset with small targets and the collated foreign traffic sign dataset.After experimental verification,compared with the original algo-rithm,the improved algorithm showed increased accuracies of 3.15％and 2.24％on the CSUST Chinese traffic sign detec-tion benchmark dataset.In the collected foreign traffic sign dataset,after comparison with the original algorithm,the improved algorithm showed increased accuracies of 2.28％and 1.25％.Experiments revealed that the improved algorithm increased the recognition accuracy on the three traffic sign datasets.Conclusion Experimental verification and subjective and objective evaluation prove the feasibility and effectiveness of the improved YOLOv7 traffic sign recognition model in this paper.In addition,the improved model can effectively increase the recognition rate of ordinary and small-target traffic signs in various harsh environments under the premise of reducing the number of algorithm parameters.Thus,the improved model meets the recognition accuracy requirements of unmanned driving and assisted driving systems to a certain extent.

外文关键词：

traffic sign recognitionspatial pyramid pooling fast cross stage partial concat(SPPFCSPC)bi-directional feature pyramid network(BiFPN)normalized Wasserstein distance(NWD)content-aware reassembly of feature(CARAFE)small goals

作者：

孟勃、史伟大

展开 >

作者单位：

东北电力大学计算机学院,吉林 132012

东北电力大学机器人视觉与虚拟现实实验室,吉林 132012

东北电力大学电力机器人实验室,吉林 132012

关键词：

交通标志识别空间金字塔池化快速跨级部分连接(SPPFCSPC) 加权双向特征金字塔网络(BiFPN) 归一化Wasserstein距离(NWD) 特征内容的感知重组(CARAFE) 小目标

出版年：

2024

DOI：

10.11834/jig.230501

中国图象图形学报

中国科学院遥感应用研究所,中国图象图形学学会 ,北京应用物理与计算数学研究所

中国图象图形学报

CSTPCD北大核心

影响因子：1.111

ISSN：1006-8961

年,卷(期)：2024.29(9)