安庆师范大学学报(自然科学版)2024,Vol.30Issue(3) :78-83.DOI:10.13757/j.cnki.cn34-1328/n.2024.03.012

基于深度传播融合生成对抗网络的文本生成图像算法

Text-to-Image Synthesis Algorithm Based on GANs with Deeply Propagated Fusion

吴海峰 兰强
安庆师范大学学报(自然科学版)2024,Vol.30Issue(3) :78-83.DOI:10.13757/j.cnki.cn34-1328/n.2024.03.012

基于深度传播融合生成对抗网络的文本生成图像算法

Text-to-Image Synthesis Algorithm Based on GANs with Deeply Propagated Fusion

吴海峰 1兰强1
扫码查看

作者信息

  • 1. 安庆师范大学 计算机与信息学院,安徽 安庆 246133
  • 折叠

摘要

基于深度融合生成对抗网络(DF-GAN)多个融合模块相互独立,以致网络融合深度较浅并难以得到最优融合结果的问题,本文提出了一种基于深度传播融合生成对抗网络(DPF-GAN)的文本生成图像算法.该算法通过拼接相邻的仿射模块和融合模块,让前面的融合信息传播至后面的融合模块中,从而促进文本和图像更深层次地融合.实验表明,在CUB-200-2011和COCO数据集上,DPF-GAN生成的图像质量要优于DF-GAN,特别是CUB-200-2011数据集的FID指标减少了11.34%.与递归仿射变换生成对抗网络(RAT-GAN)相比,DPF-GAN的空间复杂度更低且推理速度更快.

Abstract

The multiple fusion modules of deep fusion generative adversarial network(DF-GAN)were independent of each other,which leaded to a shallow fusion depth and made it difficult to obtain the optimal fusion result.Hence,a text-to-im-age synthesis algorithm which based on deep propagated fusion generative adversarial network(DPF-GAN)was proposed to solve these issues.This algorithm connected adjacent affine and fusion modules through concatenation,so that the previous fu-sion information can be propagated to the subsequent fusion modules.This facilitates a deeper integration of text and image.Through experimental results on the CUB-200-2011 dataset and COCO dataset,found that the quality of images which gener-ated by DPF-GAN was better than DF-GAN.The FID score on CUB-200-2011 dataset was decreased by approximately 11.34%compared to DF-GAN.Compared to the Recurrent affine transformation generative adversarial network(RAT-GAN),DPF-GAN offers lower spatial complexity and faster inference speed.

关键词

文本生成图像/生成对抗网络/仿射变换/深度传播融合/单级主干

Key words

text-to-image synthesis/generative adversarial network/affine transformation/deeply propagated fusion/sin-gle level backbone

引用本文复制引用

基金项目

安徽省自然科学基金(2108085MF216)

出版年

2024
安庆师范大学学报(自然科学版)
安庆师范学院

安庆师范大学学报(自然科学版)

影响因子:0.252
ISSN:1007-4260
参考文献量1
段落导航相关论文