A Method of Real-time Detection for Road Traffic Participants Based on an Improved YOLOv5 Algorithm
Rapidly and accurately detecting traffic participants from road surveillance images is of great signifi-cance for intelligent transportation systems to monitor road targets.With the aim of solving the issues low detection accuracy and disability of detecting overlapping targets of the original YOLOv5 algorithm for various traffic partici-pants,a real-time detection method of road traffic participants based on an improved YOLOv5 algorithm is pro-posed.To improve the capacity of shallow network to extract image characteristics,the fused mobile inverted bottle-neck convolution(FusedMBC)is adopted to replace the original convolution structure to speed up the reasoning speed of the shallow neural network,and the self-attention mechanism is used to learn the texture features of traffic participants To enhance the ability of backbone network to perceive spatial features of images,the coordinate atten-tion mechanism(CA)is introduced,which makes the backbone network pay more attention to the semantic charac-teristics of traffic participants in the images.To enable conventional convolution to capture visual layouts and en-hance the sensitivity of activation space,the funnel activation function(FReLU)is adopted as the activation func-tion of the convolution layer,and the feature vector can be modeled at the pixel level.To enhance the ability of ex-tracting spatial features for dense targets,a coordinate attention mechanism is introduced to the feature fusion net-work,which captures the spatial and channel feature information of densely fused targets through attention mecha-nism,the network can accurately locate each target.Through data enhancement preprocessing on images of traffic participants based on the data set DAIR-V2X about vehicle-road cooperative and autonomous driving,a test set of 2 000 images is developed to verify the property of the model.Experimental results show that:①The improved YO-LOv5 algorithm has a mean average precision of 82.4%,an average recall rate of 93%,and an average detection speed of 204 frames/s.②In comparison to the original YOLOv5,its average detection accuracy and average detec-tion speed are increased by 5.8%and 33.3%,respectively.These results verify that the proposed method can detect traffic participants quickly and accurately,which can help to improve the ability of supervising traffic participants for intelligent transportation systems.