首页|基于多注意力机制的红外与可见光图像夜间目标检测

基于多注意力机制的红外与可见光图像夜间目标检测

扫码查看
目标检测一直是计算机视觉领域的研究热点,YOLO 系列目标检测模型已广泛应用于多个领域.然而,目前关于目标检测的图像数据大多是基于单一类型传感器,难以完整地表征成像场景,且检测到的目标所包含有用信息具有局限性,尤其是在低照度、夜晚、雨雾等条件下,目标检测更加困难.为了更好地检测夜间目标,本文提出了一种结合CBAM注意力机制与Transformer 的多注意力机制的红外与可见光图像夜间目标检测方法,通过添加Transformer来获取丰富的局部和上下文信息,通过添加CBAM注意力机制来减少误检.为了验证方法的有效性,本文选取了 5 种当前主流的目标检测算法在公开红外目标检测数据集上进行测试,本文方法与原始YOLO v7 相比,mAP从62.6%提升至 71.5%.本文还制作了一个用于夜间目标检测红外-可见光融合目标检测数据集.在该数据集上与原始YOLOv7 相比,mAP从79.90%提升至 94.80%,效果非常显著.
Nighttime Object Detection in Infrared and Visible Images Based on Multi-Attention Mechanism
Object detection has long been a research hotspot in the field of computer vision,and the YOLO series of object detection models is widely used in numerous fields.However,most current image data for object detection are based on a single type of sensor,which makes it difficult to fully characterize the imaging scene.The detected objects contain limited useful information,especially under conditions of low illumination,night,rain,and fog.To improve nighttime object detection,our study proposed a multi-attention mechanism for infrared and visible images.This mechanism combines the CBAM attention mechanism with a Transformer to obtain rich local and contextual information and reduce false detections.To verify the effectiveness of the method,five current mainstream object detection algorithms were selected and tested on a public infrared object detection dataset.The mAP of the proposed method improved from 62.6%to 71.5%compared to the original YOLOv7.This study also produced an infrared-visible fusion dataset for nighttime object detection.On this dataset,the mAP improved significantly from 79.90%to 94.80%compared to the original YOLOv7.

multi-attention mechanismnight object detectioninfrared and visible imagesYOLOv7

黎瑞虹、付志涛、张韶琛、张健、王雷光

展开 >

昆明理工大学 国土资源工程学院,云南 昆明 650093

西南林业大学 森林生态大数据国家林业和草原重点实验室,云南 昆明 650024

多注意力 夜间目标检测 红外与可见光图像 YOLOv7

2024

红外技术
昆明物理研究所 中国兵工学会夜视技术专业委员会

红外技术

CSTPCD北大核心
影响因子:0.914
ISSN:1001-8891
年,卷(期):2024.46(12)