基于注意力机制的多任务目标计数系统设计

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：提出基于注意力机制的深层神经网络用于目标计数,其任务是为输入图像的目标进行精确数目统计.该网络模型同时引进多任务学习方法,多尺度融合得到密度特征图和注意力特征图进行目标计数.首先,使用交叉特征金字塔网络进行特征提取;其次,将提取的特征分别用于密度特征图及注意力特征图进行交叉融合;最后,通过多任务学习将两个输出特征图逐元素运算,得到精确的密度特征图.提出的网络模型在行人检测数据集(ShanghaiTech)与多类别的行为识别数据集(UCF_CC_50)上进行了训练与测试,实验结果表明,通过在各个分支引入注意力机制,可以有效提高整个模型预测结果的准确率.

外文标题：Design of Multi-Task Object Counting System Based on Attention Mechanism

外文摘要：This article proposes a deep neural network based on the attention mechanism for object counting,whose task is to accurately count the number of targets in the input image.The network model simultaneously introduces a multi-task learning approach,fusing multi-scale features to obtain density and attention feature maps for object counting.Firstly,a cross-feature pyramid network is used for feature extraction.Then,the extracted features are used for density and attention feature maps,which are cross-fused.Finally,through multi-task learning,the two output feature maps are element-wise operated to obtain an accurate density feature map.The network model of this paper is trained and tested on pedestrian detection datasets(ShanghaiTech)and multi-category action recognition datasets(UCF_CC_50).The experimental results show that introducing the attention mechanism in each branch can effectively improve the accuracy of the entire model's predictive results.

外文关键词：

object countingattention mechanismmulti-task learningspatial pyramid pooling fast cross stage partial connection

作者：

李永慧

展开 >

作者单位：

山西科技学院光机电工程学院,山西晋城 048000

关键词：

目标计数注意力机制多任务学习交叉特征金字塔网络

出版年：

2024

DOI：

10.16280/j.videoe.2024.07.012

电视技术

电视电声研究所中国电子科技集团公司第三研究所

电视技术

影响因子：0.496

ISSN：1002-8692

年,卷(期)：2024.48(7)

参考文献量1