基于深度传播融合生成对抗网络的文本生成图像算法

扫码查看

原文链接

万方数据
维普

中文摘要：基于深度融合生成对抗网络(DF-GAN)多个融合模块相互独立,以致网络融合深度较浅并难以得到最优融合结果的问题,本文提出了一种基于深度传播融合生成对抗网络(DPF-GAN)的文本生成图像算法.该算法通过拼接相邻的仿射模块和融合模块,让前面的融合信息传播至后面的融合模块中,从而促进文本和图像更深层次地融合.实验表明,在CUB-200-2011和COCO数据集上,DPF-GAN生成的图像质量要优于DF-GAN,特别是CUB-200-2011数据集的FID指标减少了11.34%.与递归仿射变换生成对抗网络(RAT-GAN)相比,DPF-GAN的空间复杂度更低且推理速度更快.

外文标题：Text-to-Image Synthesis Algorithm Based on GANs with Deeply Propagated Fusion

外文摘要：The multiple fusion modules of deep fusion generative adversarial network(DF-GAN)were independent of each other,which leaded to a shallow fusion depth and made it difficult to obtain the optimal fusion result.Hence,a text-to-im-age synthesis algorithm which based on deep propagated fusion generative adversarial network(DPF-GAN)was proposed to solve these issues.This algorithm connected adjacent affine and fusion modules through concatenation,so that the previous fu-sion information can be propagated to the subsequent fusion modules.This facilitates a deeper integration of text and image.Through experimental results on the CUB-200-2011 dataset and COCO dataset,found that the quality of images which gener-ated by DPF-GAN was better than DF-GAN.The FID score on CUB-200-2011 dataset was decreased by approximately 11.34%compared to DF-GAN.Compared to the Recurrent affine transformation generative adversarial network(RAT-GAN),DPF-GAN offers lower spatial complexity and faster inference speed.

外文关键词：

text-to-image synthesisgenerative adversarial networkaffine transformationdeeply propagated fusionsin-gle level backbone

作者：

吴海峰、兰强

展开 >

作者单位：

安庆师范大学计算机与信息学院,安徽安庆 246133

关键词：

文本生成图像生成对抗网络仿射变换深度传播融合单级主干

基金：

安徽省自然科学基金

项目编号：

2108085MF216

出版年：

2024

DOI：

10.13757/j.cnki.cn34-1328/n.2024.03.012

安庆师范大学学报(自然科学版)

安庆师范学院

安庆师范大学学报(自然科学版)

影响因子：0.252

ISSN：1007-4260

年,卷(期)：2024.30(3)

参考文献量1