Multi-level Fusion Based Weakly Supervised Object Detection Network
Due to the lack of precise bounding box annotations,weakly supervised object detectors rely on the pretrained image classification model to classify candidate regions.However,the pretrained model often produces high responses for discriminative regions rather than complete objects,resulting in the problems of part domination,instance missing and untight boxes.To address these issues,a multi-level fusion based weakly supervised object detection network is proposed.The detection performance is improved from the perspectives of enhancing the weak discriminative spatial feature learning,enriching intra-class sample features and weighting reliable pseudo-labels.Firstly,a power function is utilized to weight and fuse the activation values within the neighborhood by the power pooling layer to reduce information loss of weak discriminative features.Secondly,the feature vectors of candidate regions are randomly fused by the feature mixing method to enrich the diversity of training sample features.Finally,the confidence of predictions and pseudo-labels is fused via the confidence-based sample re-weighting strategy to adjust the influence of pseudo-labels on training.Experiments on three benchmarks demonstrate the superiority of the proposed network.