基于改进pix2pix的红外图像转换技术

Infrared image conversion technology based on improved pix2pix

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：针对不同波段图像获取代价不同的问题,提出一种基于pix2pix的图像转换方法并进行改进.主要针对生成器和鉴别器两方面进行改进.生成器方面,使用残差结构的生成器替换原来的U-Net生成器以缓解梯度消失问题;引入可变形卷积,提高目标边缘和小目标的生成效果;引入BAM注意力机制,提高了算法对图像中主要目标的特征提取能力以提升生成图像的效果.鉴别器方面:改变PatchGAN中卷积层的层数(原PatchGAN为3层卷积),设置对照实验找到转换效果最好的卷积层数.以可见光图像和红外图像之间的转换为例进行实验.实验结果表明,改进后的算法在生成图像上的均方根误差(MSE)下降了 31.4％、结构相似性(SSIM)提高了 11.2％,可以更好的实现红外图像和可见光图像之间的转换.

外文摘要：In order to solve the problem of different cost of image acquisition in different light segments,an image con-version method based on pix2pix was proposed.It mainly focuses on the generator and discriminator.In terms of gener-ators,the residual structures generator was used instead of the original U-Net generator to alleviate the gradient vanis-hing problem.Deformable convolution is introduced to improve the generation effect of target edges and small tar-gets.The BAM attention mechanism is introduced to improve the feature extraction ability of the algorithm for the main target in the image to improve the image generation effect.In terms of discriminators:change the number of convolu-tional layers in PatchGAN(the original PatchGAN is 3-layer convolution),and set up a control experiment to find the convolutional layer with the best conversion effect.Some KAIST datasets are selected for training and testing.The ex-perimental results show that the Root Mean Square Error(MSE)of the improved algorithm is reduced by 31.4％and the Structural Similarity(SSIM)is increased by 11.2％,which can better realize the conversion between infrared and visible images.

外文关键词：

generative adversarial networkpix2piximage conversionresidual structures

作者：

叶明亮、史春景、郝永平、李大伟

展开 >

作者单位：

沈阳理工大学机械工程学院,辽宁沈阳 110159

沈阳理工大学装备工程学院,辽宁沈阳 110159

关键词：

生成对抗网络 pix2pix 图像转换残差结构

出版年：

2024

DOI：

10.3969/j.issn.1001-5078.2024.07.024

激光与红外

华北光电技术研究所

激光与红外

CSTPCD北大核心

影响因子：0.723

ISSN：1001-5078

年,卷(期)：2024.54(7)

参考文献量9