摘要
近年来,基于Transformer结构的图像修复算法在图像全局结构理解、通用数据集的泛化能力等方面表现出色.然而目前相关研究综述较少,为了进一步推动图像修复问题研究,对Transformer类的图像修复方法进行归纳和分析.首先介绍了Transformer基本原理和框架,其次依据Transformer的结构对采用Transformer的图像修复模型进行分类,分析描述了各方法的改进之处、适用范围和优缺点等,针对不同掩码以及掩码比率,在多种公共数据集上对不同算法进行定量分析和修复效果展示,同时对各种方法的输出多样性进行性能分析.最后总结了相关研究所面临的挑战,并对未来的发展前景和研究方向提出了展望.
Abstract
In recent years,image inpainting algorithms based on Transformer structure have performed well in terms of image global structure understanding,diversified restoration,generalization ability of common datasets,etc.,and the restoration results are more reasonable and diversified.However,there are few reviews of related studies.In order to further promote the study of image inpainting problems,image inpainting methods of Transformer class are summarized and analyzed.Firstly,we introduced the basic principle and framework of Transformer.Secondly,we clas-sified the image inpainting models using Transformer based on the structure of Transformer,and analyzed and de-scribed the improvements,applicability,advantages and disadvantages of each method,etc.In addition,quantitative a-nalysis and inpainting effects of different algorithms were demonstrated on various public datasets for different masks and mask ratios,and performance analysis of the output diversity of various methods was also performed.Finally,the challenges faced by the related research are summarized,and the future prospects and research directions are pro-posed.