面向图像修复的注意力协同调制对抗网络

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
万方数据

中文摘要：以往的虚拟修复方法在不伤害壁画本体的前提下取得了令人瞩目的结果,但存在已知破损区域位置的假设,而现实中破损区域是未知的且难以准确获取.对于含有复杂结构的破损区域,现有图像修复方法会产生严重的伪影失真.为解决这个问题,提出了一种基于注意力协同调制对抗网络的古代壁画图像盲修复模型.该模型包括破损检测和孔洞修复2个阶段,在破损检测阶段,设计了多路径注意力模块,联合区域候选网络和预测分支模块,估计破损区域的位置掩模;在孔洞修复阶段,采用了融入门控卷积的协同调制生成对抗网络修复破损区域.在古代壁画数据集上进行实验,并与最新的4种方法进行了比较,AP值比性能第二的破损检测模型提高了 15％,FID值比性能第二的孔洞修复模型降低了 4.6％.实验结果证明:所提方法改善了壁画图像修复结果,在定性和定量评估上超过现有流行方法,能够准确检测古代壁画破损区域的位置掩模,生成高逼真的壁画图像修复结果.

外文标题：Research on attention co-modulated adversarial network for image completion

外文摘要：Ancient murals,as a precious cultural heritage,have important historical,cultural,artistic,and scientific values.However,due to natural and human factors,they have suffered varying degrees of damage,including missing parts,pigment shedding,and other issues.These problems seriously affect the conservation and utilization of the murals.Conventional manual repair or duplications are performed on some ancient murals,which may cause secondary damages.Besides,manual repair has some drawbacks,such as being time-consuming,laborious,and inefficient,making it inapplicable for large-scale mural restoration.In contrast,virtual repair methods recover missing information in digital images of ancient murals without damaging the mural bodies,providing a potentially attractive non-contact solution for mural conservation.Although existing virtual repair methods have achieved remarkable results,they assume that the locations of damaged areas are known beforehand.In practice,however,it is difficult to accurately locate the damaged areas.Moreover,for damaged areas with complex structures,these methods may cause semantic errors and serious artifacts due to low modeling efficiency.To address these issues,we propose a novel attention co-modulated adversarial network for blind mural image completion.It automatically detects and repairs damaged areas in mural images,in which both multi-attention modules and gated convolutions are built to extract location masks and fill missing holes respectively.Our network consists of two stages:damage detection and hole inpainting.In the first stage,a multi-path attention module is designed and integrated with both region proposal network and branch prediction modules to estimate the location mask of damaged areas.In the second stage,with the estimated location mask available,the co-modulated generative adversarial network module incorporates with gated convolution to repair the damaged areas.More specifically,the damage detection stage includes two parts:coarse segmentation and fine segmentation.In the coarse segmentation,given a damaged mural image,the features extracted from the convolutional layer of the input damaged image are first divided into G cardinality groups to control the connection between the input and output.Each cardinality group is further divided into R subgroups to learn the features of different channels.In the fine segmentation,the feature pyramid structure is first used to fuse the extracted features by using a 1 × 1 convolution.Then the fused features are upsampled and added to the previous features by residual learning to increase the accuracy of damage detection.Moreover,a gated convolution is introduced to build the collaborative modulation-based generative adversarial network module,making the location mask into a learnable soft gating map.We created a custom mural dataset by collecting 5 500 ancient mural images mainly from the Tang Dynasty,with 4,400 images for training,550 images for validation,and 550 images for testing.We compared our model with four state-of-the-art deep learning models on ancient mural image datasets.Our model increases the average precision(AP)value by 15％compared to Mask R-CNN and reduces the Fréchet Inception Distance(FID)score by 4.6％compared to CoModGAN.Comparative experiments demonstrates our model improves the results of mural image completion and generally outperforms the existing popular methods in qualitative and quantitative evaluations.We propose a blind mural image completion model consisting of damage detection and hole inpainting.We introduce multi-path attention and gated convolution into damage detection and hole inpainting respectively.The model first employs the multi-path attention module to extract rich semantic features for rough segmentation of damaged areas,then adopts the region proposal network and prediction branch for fine segmentation,and finally employs the collaborative modulation-based generative adversarial network module via gated convolution to fill the missing areas for constructing a complete repaired mural image.The ablation study confirms the effectiveness of the multi-path attention module and gated convolution in mural image completion.Our experimental results show our method automatically identifies the precise location mask of damaged areas in mural images,produces visually realistic completed mural images.It outperforms baseline methods in both qualitative and quantitative evaluations.

外文关键词：

image completiondeep learningadversarial networkancient muralspigment shedding

作者：

章勇勤、李若彤、杜林格、苏静怡

展开 >

作者单位：

郑州大学考古与文化遗产学院,郑州 450001

西北大学信息科学与技术学院,西安 710127

东北大学秦皇岛分校计算机与通信工程学院,河北秦皇岛 066004

关键词：

图像补全深度学习对抗网络古代壁画颜料脱落

出版年：

2024

DOI：

10.3969/j.issn.1674-8425(z).2024.11.016

重庆理工大学学报

重庆理工大学

重庆理工大学学报

CSTPCD北大核心

影响因子：0.567

ISSN：1674-8425

年,卷(期)：2024.38(21)