3D object detection algorithm with multi-channel cross attention fusion
To solve the problems that the existing single-stage 3D object detection algorithm utilizes point cloud down-sampling features in a single way and the degree of aggregation of features for the long-range contextual information cannot meet the requirement of enhancing the algorithm performance,we propose a single-stage 3D object detection al-gorithm based on multi-channel cross attention fusion.First,the channel-wise cross attention module is designed to fuse the down sampled features,which can enhance the expression ability of multi-scale features for the long-range spatial information under different receptive field based on the cross attention mechanism.Then,a cascade feature excitation module is proposed to combine the original downsampling features to cascade channel-wise cross attention weighted features to enhance the algorithm's learning ability for key spatial features.Extensive experiments were conducted on the public autonomous driving dataset KITTI and compared with mainstream algorithms.As a single-stage algorithm,the detection accuracy was 91.34%,79.85%and 75.98%for the three difficulty levels of car categories,which were 4.83%,3.26%and 3.32%better than the baseline algorithm.The experimental results demonstrate the effectiveness and ad-vancement of the algorithm and the proposed modules for 3D object detection task.
3D point cloudautonomous drivingLiDARdeep learning3D object detectionpillarcross attentionsingle-stage algorithm