计算机应用与软件2024,Vol.41Issue(10) :314-318.DOI:10.3969/j.issn.1000-386x.2024.10.046

一种融合上下文信息及自适应感受野的多尺度目标检测算法

AN MULTI-SCALE OBJECT DETECTION ALGORITHM COMBINING CONTEXT INFORMATION AND ADAPTIVE RECEPTIVE FIELD

张婷 兰时勇
计算机应用与软件2024,Vol.41Issue(10) :314-318.DOI:10.3969/j.issn.1000-386x.2024.10.046

一种融合上下文信息及自适应感受野的多尺度目标检测算法

AN MULTI-SCALE OBJECT DETECTION ALGORITHM COMBINING CONTEXT INFORMATION AND ADAPTIVE RECEPTIVE FIELD

张婷 1兰时勇1
扫码查看

作者信息

  • 1. 四川大学视觉合成图形图像技术国防重点学科实验室 四川成都 610064
  • 折叠

摘要

目标检测在实际应用各类复杂场景中面临着诸多的挑战,如目标遮挡、光照变化、目标尺度变化等.为了提高多尺度目标检测的性能,提出一种改进的特征金字塔(FPN)的目标检测算法.以特征金字塔网络框架为基础引入上下文信息融合模块,充分利用目标对象与其周围环境的关联属性,增强宽动态尺度范围的目标对象的特征表征,提高不同尺度目标的辨识能力.此外,构建一个跨通道注意机制,自适应调整不同尺度目标特征的通道灵敏度,学习到适应目标尺度的感受野范围.该算法在Pascal VOC数据集训练验证,其平均精确率(mAP)比基准方法提高了3%.

Abstract

Object detection faces many challenges in the practical application of various complex scenes,such as object occlusion,illumination changes,and object size changes in the practical application.In order to improve the performance of multi-scale target detection,this paper proposes an improved feature pyramid network(FPN)target detection algorithm.Based on the FPN framework,the context information fusion was introduced to utilize the relevance of an object to its surrounding environment and enhance feature representation of objects for wide dynamic range images and to improve the ability of detection ability for different scales.In addition,a cross-channel attention mechanism was constructed to adaptively adjust the channel sensitivity of target features at different scales.Experiments on the Pascal VOC dataset show that the proposed method improves the detection performance by 3%compared with the baseline method in terms of mean average precision(mAP).

关键词

目标检测/上下文信息融合/跨通道注意力机制

Key words

Object detection/Context information extraction/Cross-channel attention mechanism

引用本文复制引用

基金项目

四川省科技厅重点研发项目(2021YFG0300)

视觉合成图形图像技术国防重点学科实验室开放研究项目()

出版年

2024
计算机应用与软件
上海市计算技术研究所 上海计算机软件技术开发中心

计算机应用与软件

CSTPCD北大核心
影响因子:0.615
ISSN:1000-386X
段落导航相关论文