首页|双注意力随机选择全局上下文细粒度识别网络

双注意力随机选择全局上下文细粒度识别网络

扫码查看
针对细粒度图像识别任务中易忽视微小潜在性特征且外观差异细微等问题,提出一种基于双注意力随机选择全局上下文细粒度识别网络.首先,使用ConvNeXt作为主干网络,提出双注意力随机选择模块,对不同阶段提取到的特征进行通道随机选择和空间随机选择,使网络能够关注到其他潜在微小判别性特征;其次,利用全局上下文注意力模块将深层特征的语义信息融合到中间层,增强中间层定位微小特征的能力;最后,提出一种多分支损失,对中间层、深层和拼接层特征引入分类损失,结合不同分支提取到的特征,诱导网络获得多样性的判别特征.所提网络在Stanford-cars、CUB-200-2011、FGVC-Aircraft 3个公开细粒度数据集和真实场景下车型数据集VMRURS上分别达到了95.2%、92.1%、94.0%和97.0%的识别准确率,其性能相比其他对比方法有较大幅度提升.
Dual-attention random selection global context fine-grained recognition network
To address the difficulties of capturing the potential distinguishable features and subtle appearance differences in fine-grained image recognition tasks,dual-attention random selection global context fine-grained recognition network is proposed.Firstly,the ConvNeXt is taken as the backbone network,a dual-attention random selection module is proposed to perform channel random selection and spatial random selection on the features extracted at different stages,so that the network could focus on other potential subtle distinguishable features.Then,by using the global context attention module,the semantic information of top-level is applied to the middle-level to enhance the ability of the middle-level to locate potential subtle distinguishable features.Finally,the multi-branch loss is proposed,and classification loss is imposed on middle-level,top-level and concat-level characteristics.Combining the features extracted from different branches,the network is induced to obtain diverse distinguishable features.The network achieves the accuracies of 95.2%,92.1%,94.0%and 97.0%respectively on the three open datasets,Stanford-cars,CUB-200-2011,FGVC-Aircraft and dataset VMRURS in real scenes.The presented method in this paper greatly upgrades the comparison performance.

fine-grained recognitionconvnextdual-attention random selectionglobal context attentionmulti-branch loss

徐胜军、荆扬、段中兴、李明海、李海涛、刘福友

展开 >

西安建筑科技大学 信息与控制工程学院,陕西 西安 710055

西安市建筑制造智能化技术重点实验室,陕西 西安 710055

江苏省交通工程建设局,江苏 南京 210004

中交隧道工程局有限公司,北京 100024

展开 >

细粒度识别 ConvNeXt 双注意力随机选择 全局上下文注意力 多分支损失

国家自然科学基金陕西省自然科学基础研究计划陕西省自然科学基础研究计划陕西省重点研发计划陕西省教育厅专项科研项目

522781252023-JC-YB-5322022JQ-6812021SF-42920JK0721

2024

液晶与显示
中科院长春光学精密机械与物理研究所 中国光学光电子行业协会液晶分会 中国物理学会液晶分会

液晶与显示

CSTPCD北大核心
影响因子:0.964
ISSN:1007-2780
年,卷(期):2024.39(4)
  • 44