宜宾学院学报2024,Vol.24Issue(6) :21-26.DOI:10.19504/j.cnki.issn1671-5365.2024.06.04

一种改进CycleGAN的素描头像彩色化算法

A Colorization Algorithm for Sketch Heads with Improved CycleGAN

廖振 林国军 黄丹 胡鑫 游松 兰江海 周旭 金若水
宜宾学院学报2024,Vol.24Issue(6) :21-26.DOI:10.19504/j.cnki.issn1671-5365.2024.06.04

一种改进CycleGAN的素描头像彩色化算法

A Colorization Algorithm for Sketch Heads with Improved CycleGAN

廖振 1林国军 1黄丹 1胡鑫 1游松 1兰江海 1周旭 1金若水1
扫码查看

作者信息

  • 1. 四川轻化工大学自动化与信息工程学院,四川宜宾 644000
  • 折叠

摘要

针对现阶段由素描头像生成的彩色头像图像清晰度低、人脸识别率不高和视觉质量不佳等问题,提出一种改进CycleGAN的素描头像彩色化算法:对U-Net自编码器的第一个特征提取模块进行优化,设计一种多尺度自注意力机制特征提取模块,从多个尺度提取输入图像以减少输入图像的细节信息丢失,将提取的特征用通道堆叠的方式进行特征融合,对融合的特征嵌入SENet自注意力机制,以引导模型对特征重点区域的关注度,最后再降低融合特征的通道维数;对生成头像与真实头像添加L,像素损失和感知损失,以进一步提升生成头像的质量.实验结果表明:较基础模型CycleGAN生成的彩色头像,在CUHK数据集FID值降低了 22.23、Rank-1值提高了 16%,在AR数据集FID值降低了 15.34、Rank-1值提高了 9.3%.

Abstract

Aiming at the problems of low clarity,low face recognition rate and poor visual quality of color avatar images gener-ated from sketch avatars at the present stage,a colorization algorithm for sketch avatars improving CycleGAN was proposed:by optimizing the first feature extraction module of the U-Net self-encoder,a multi-scale self-attention mechanism feature extrac-tion module was designed to extract the input image from multiple scales to reduce the loss of detail information of the input im-age.The extracted features were fused by means of channel stacking,and the fused features were embedded with SENet self-attention mechanism to direct the model's attention to the feature focus area.Finally,the dimension of fused features was reduced.L1 pixel loss and perceptual loss were added to the generated and real avatars to further improve the quality of the generated ava-tars.The experimental results show that compared with the color avatar generated by the base model CycleGAN,the FID value of the CUHK dataset is reduced by 22.23 and Rank-1 value is improved by 16%,and the FID value of the AR dataset is reduced by 15.34 and Rank-1 value is improved by 9.3%.

关键词

CycleGAN/多尺度特征提取/SENet/监督学习/L1像素损失/感知损失

Key words

CycleGAN/multi-scale feature extraction/SENet/supervised learning/L1 pixel loss/perceptual loss

引用本文复制引用

基金项目

四川省科技厅项目(2022YFSY0056)

出版年

2024
宜宾学院学报
宜宾学院

宜宾学院学报

CHSSCD
影响因子:0.185
ISSN:1671-5365
参考文献量2
段落导航相关论文