基于Transformer和空间注意力的红外与可见光图像融合

Infrared and visible image fusion based on transformer and spatial attention model

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：目前,已经有很多研究人员将卷积神经网络应用到红外与可见光图像融合任务中,并取得了较好的融合效果.其中有很多方法是基于自编码器架构的网络模型,这类方法通过自监督方式进行训练,在测试阶段需要采用手工设计的融合策略对特征进行融合.但现有的基于自编码器网络的方法很少能够充分地利用浅层特征和深层特征,而且卷积神经网络受到感受野的限制,建立长距离依赖较为困难,因而丢失了全局信息.而Transformer借助于自注意力机制,可以建立长距离依赖,有效获取全局上下文信息.在融合策略方面,大多数方法设计的较为粗糙,没有专门考虑不同模态图像的特性.因此,在编码器中结合了 CNN和Transformer,使编码器能够提取更加全面的特征.并将注意力模型应用到融合策略中,更精细化地优化特征.实验结果表明,该融合算法相较于其他图像融合算法在主观和客观评价上均取得了优秀的结果.

外文摘要：Currently,the applications of convolutional neural networks to the task of fusing infrared and visible images have achieved better fusion results.Many of these methods are based on network models with self-encoder architec-tures,which are trained in a self-supervised methods and require the use of hand-designed fusion strategies to fuse fea-tures in the testing phase.However,existing methods based on self-encoder networks rarely make full use of both shallow and deep features,and convolutional neural networks are limited by the receptive field,making it more difficult to establish long-range dependencies and thus losing global information.In contrast,Transformer,with the help of self-attention mechanism,can establish long-range dependencies and effectively obtain global contextual information.In terms of fusion strategies,most of the methods are designed in a crude way and do not specifically consider the charac-teristics of different modal images.Therefore,CNN and Transformer are combined in the encoder to enable the en-coder to extract more comprehensive features.And the attention model is applied to the fusion strategy to optimize the features in a more refined way.The experimental results show that the fusion algorithm achieves excellent results in both subjective and objective evaluations compared to other image fusion algorithms.

外文关键词：

image fusiondeep learningTransformerinfrared imagevisible image

作者：

耿俊、吴子豪、李文海、李晓瑜

展开 >

作者单位：

新疆大学软件学院,新疆乌鲁木齐 830091

关键词：

图像融合深度学习 Transformer 红外图像可见光图像

基金：

新疆维吾尔自治区自然科学基金

项目编号：

2021D01C077

出版年：

2024

DOI：

10.3969/j.issn.1001-5078.2024.03.018

激光与红外

华北光电技术研究所

激光与红外

CSTPCD北大核心

影响因子：0.723

ISSN：1001-5078

年,卷(期)：2024.54(3)

参考文献量20