基于特征增强和历史帧选择的Transformer视觉跟踪算法

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：为进一步提升跟踪算法在历史帧信息利用和目标特征表达方面的性能,提出基于特征增强和历史帧选择的 Transformer 视觉跟踪算法(feature enhancement and history frame selection based Transformer visual tracking,FEHST).首先,在骨干网络中引入动态预测模块,通过稀疏化策略提高自注意力机制的计算效率,聚焦目标区域特征;其次,提出特征增强模块,将局部信息与全局信息的优势相结合,提升特征的表达能力;最后,采用自适应历史帧选择策略,提升跟踪器对目标动态信息的关注.在LaSOT、TrackingNet、GOT-10K和OTB100等数据集上进行了大量的实验,实验结果显示,在LaSOT、TrackingNet、OTB100上分别取得70.1％、83.0％和71.6％的成功率,在GOT-10K上取得71.4％的平均重叠度,并能以27FPS的速度运行.

外文标题：Feature enhancement and history frame selection based Transformer visual tracking

外文摘要：To enhance the performance of tracking algorithms in utilizing historical frame information and articulating target features,this paper proposes the feature enhancement and history frame selection based Transformer visual tracking(FEHST)algorithm.Firstly,a dynamic prediction module is integrated into the backbone network with a sparsification strategy to enhance the self-attention mechanism's computational efficiency,focusing on the target region's features.Then,a feature enhancement module is introduced,merging local and global information to improve feature representation.Finally,an adaptive history frame selection strategy is adopted to enhance focus on target dynamics and algorithm robustness.Experiments on LaSOT,TrackingNet,GOT-1 0K,and OTB100 datasets are carried out to validate the algorithm,showing success rates of 70.1％,83.0％,and 71.6％,and a 71.4％average overlap on GOT-10K,at 27 FPS.

外文关键词：

computer visionvisual trackingdeep learningattention mechanismhistory frame selectionTransformer

作者：

侯志强、杨晓麟、马素刚、王云龙、余旺盛、王昀琛

展开 >

作者单位：

西安邮电大学计算机学院,西安 710121

西安邮电大学陕西省网络数据分析与智能处理重点实验室,西安 710121

空军工程大学信息与导航学院,西安 710100

关键词：

计算机视觉视觉跟踪深度学习注意力机制历史帧选择 Transformer

基金：

国家自然科学基金项目陕西省自然科学基金项目

项目编号：

620723702023-JC-YB-598

出版年：

2024

DOI：

10.13195/j.kzyjc.2023.1048

控制与决策

东北大学

控制与决策

CSTPCD北大核心

影响因子：1.227

ISSN：1001-0920

年,卷(期)：2024.39(10)