首页|基于深度传播融合生成对抗网络的文本生成图像算法

基于深度传播融合生成对抗网络的文本生成图像算法

扫码查看
基于深度融合生成对抗网络(DF-GAN)多个融合模块相互独立,以致网络融合深度较浅并难以得到最优融合结果的问题,本文提出了一种基于深度传播融合生成对抗网络(DPF-GAN)的文本生成图像算法。该算法通过拼接相邻的仿射模块和融合模块,让前面的融合信息传播至后面的融合模块中,从而促进文本和图像更深层次地融合。实验表明,在CUB-200-2011和COCO数据集上,DPF-GAN生成的图像质量要优于DF-GAN,特别是CUB-200-2011数据集的FID指标减少了11。34%。与递归仿射变换生成对抗网络(RAT-GAN)相比,DPF-GAN的空间复杂度更低且推理速度更快。
Text-to-Image Synthesis Algorithm Based on GANs with Deeply Propagated Fusion
The multiple fusion modules of deep fusion generative adversarial network(DF-GAN)were independent of each other,which leaded to a shallow fusion depth and made it difficult to obtain the optimal fusion result.Hence,a text-to-im-age synthesis algorithm which based on deep propagated fusion generative adversarial network(DPF-GAN)was proposed to solve these issues.This algorithm connected adjacent affine and fusion modules through concatenation,so that the previous fu-sion information can be propagated to the subsequent fusion modules.This facilitates a deeper integration of text and image.Through experimental results on the CUB-200-2011 dataset and COCO dataset,found that the quality of images which gener-ated by DPF-GAN was better than DF-GAN.The FID score on CUB-200-2011 dataset was decreased by approximately 11.34%compared to DF-GAN.Compared to the Recurrent affine transformation generative adversarial network(RAT-GAN),DPF-GAN offers lower spatial complexity and faster inference speed.

text-to-image synthesisgenerative adversarial networkaffine transformationdeeply propagated fusionsin-gle level backbone

吴海峰、兰强

展开 >

安庆师范大学 计算机与信息学院,安徽 安庆 246133

文本生成图像 生成对抗网络 仿射变换 深度传播融合 单级主干

安徽省自然科学基金

2108085MF216

2024

安庆师范大学学报(自然科学版)
安庆师范学院

安庆师范大学学报(自然科学版)

影响因子:0.252
ISSN:1007-4260
年,卷(期):2024.30(3)
  • 1