Balanced single-shot object detection using cross-context attention-guided network

扫码查看

原文链接

NSTL
Elsevier

外文摘要：In real-world application scenarios, object detection usually encounters two technical challenges, i.e., high accuracy and high speed. Although the latest detection frameworks based on anchor-free detection have achieved outstanding performance, they cannot be widely used in real-world scenarios due to their model complexity and slow speed. In this paper, inspired by cross-context attention mechanism of human visual systems, we propose a light but effective single-shot detection framework using Cross-context Attention-guided Network (CCAGNet) to balance the accuracy and speed. CCAGNet uses attention-guided mechanism to highlight the interaction of object-synergy regions, and suppresses non-object-synergy regions by combining Cross-context Attention Mechanism (CCAM), Receptive Field Attention Mechanism (RFAM), and Semantic Fusion Attention Mechanism (SFAM). The main contribution of our work includes establishing a novel attention mechanism that takes the context information of channel, spatial, cross and adjacent-regions into consideration simultaneously. Extensive experiments demonstrate the feasibility and effectiveness of our method on the public benchmark datasets. To the best of our knowledge, CCAGNet obtains the state-of-the-art performance on both PascalVOC and MSCOCO with the excellent trade-off between accuracy and speed among single-shot detectors. Especially, the Average Precision (AP) metric is significantly improved by 17.0% on small object detection on MSCOCO. (c) 2021 Published by Elsevier Ltd.

外文关键词：

Cross-context attention-guided networkCross-context attention mechanismReceptive field attention mechanismSemantic fusion attention mechanismAccuracy and speed balanceJOINT

作者：

Miao, Shuyu、Du, Shanshan、Feng, Rui、Zhang, Yuejie、Li, Huayu、Liu, Tianbi、Zheng, Lin、Fan, Weiguo

展开 >

作者单位：

Fudan Univ

Ant Grp

Univ Iowa

出版年：

2022

DOI：

10.1016/j.patcog.2021.108258

Pattern Recognition

EISCI

ISSN：0031-3203

年,卷(期)：2022.122

被引量10
参考文献量59