LDH:least dependent hiding for screen-shooting resilient watermarking
Objective With the rapid development of internet and communication technology,the remote desktop technique enables separating the confidential information and the screen in space.However,it also engenders information security risks of confidential information because of illegal screen shooting.How can illegal screen shooting be prevented and the related responsibility identified?Adding a robust watermark and revealing the message hidden in the shot image is pre-ferred.By taking photos of the files displayed on the screen,the captured photos can realize efficient,high-quality informa-tion recording.The pictures taken on the screen not only record effective information but also destroy the possible water-mark signal carried to a large extent,making the photo leakage behavior concealed and difficult to trace.Screen-shooting watermark is a challenging subject in digital watermark.In screen shooting,the information displayed on the screen is received through camera capturing and postprocessing operations to transmit information from the screen to the camera in the optical channel involving optical changes,digital-analog conversion,image scaling,and image distortion.Four main methods are used to deal with this subject,namely,key-point-,template-,frequency-domain-,and deep neural network(DNN)-based methods.Traditional methods and DNN-based methods have some solutions.However,neither of them could balance computational complexity,image quality,and watermark robustness.The calculation of key points in key-point based methods is always overly time-consuming for practical use.Template-based methods often bring great changes to the cover images,resulting in image quality degradation.Watermarks generated by the frequency-domain-based methods have poor robustness and could be easily destroyed.Almost all methods should correct and resize the warped image to its original image size for the following watermark extraction stage,which is the main reason why the watermarks in these meth-ods could not achieve robustness to clipping and scaling in practice.To solve the above problems,the least dependent hid-ing for screen-shooting resilient watermarking method is proposed to consider computational complexity,image quality,and robustness comprehensively.The decoder-based reveal network only needs to disclose the watermark message from the corresponding location of the container image,which guarantees the semantic consistency of the reveal network and the embedding network.The embedded watermark,such as user name,time,and IP address,could be extracted under the screen-shooting attack or other attacks,and to imitate the information loss in screen shooting,an improved noise layer is designed for the training of our model.Method First,the watermark embedding network in the dependent deep hiding(DDH)framework is greatly simplified,and the Sobel operator is added to introduce the edge information of the cover image.The scaling attack operation is added to the noise layer,and the perspective distortion correction preprocessing is removed because it limits the application range of screen-shooting resilient watermarking.The existing noise layer is rede-fined in the way that the image disturbance types are randomly selected and the parameters of the specific image distur-bance types are randomly changed,which increases the sample equilibrium and diversity of the training data of the reveal network.The investigation of previous DNN-based methods reveals their watermark residuals visually approximate the edges of the cover images.A strong correlation exists between the edges of the cover images and the invisibility of the water-mark.To improve robustness and reduce computation complexity,the edge map of the cover image extracted by the Sobel operator is concatenated with the feature map of the watermark.The watermark embedding network is divided into two parts according to whether the cover image is used in the convolution because the network part without cover image participating in it could be previously calculated in practice.Second,the existing noise layer is modified to simulate the image scaling operation in the screen shooting,so the widely used perspective distortion correction can be canceled.Considering the class-balance principle,a new design idea of noise layer is proposed,in which random decision modules are added to the noise layer to make the data augmentation stronger than the original image disturbing effects.When training the network,learned perceptual image patch similarity(LPIPS)loss,L2 loss,and structural similarity index measure(SSIM)loss are used to constrain the visual similarity of the cover image and the container image while information entropy loss and weighted cross entropy loss are used to reconstruct the watermark with the form of a single-channel binary image.Model training and testing is carried out based on PyTorch.PyTorch is used to implement least dependent hiding(LDH)with NVIDIA GeForce 2080Ti GPU and Intel Core i7-9700 3.00 GHz CPU.The whole neural network is optimized by Adam optimizer.The initial learning rate is set to 1e-3,which is then reduced 90%every 20 epochs.In the training,the input image resolution is 256 x 256 and the batch size is 2.A pretrained model trained without geometric transformation in the noise layer is used to initialize the model.Result Experimental results show the proposed noise layer is more effective than the three latest methods on the DIVerse 2K(DIV2K)dataset.The proposed method achieves the highest peak signal-to-noise ratio(PSNR)and SSIM index,which improves PSNR by 12 dB and SSIM by 0.006 compared with the second-best method-universal deep hiding(UDH)if no image attacks are applied.Moreover,it ranks second in accuracy and F1 index if no image attacks are applied.Compared with the same network framework using the noise layer proposed by the previous work,our algorithm achieves better indicators and higher accuracy for the watermark extraction in both modes with and without image attacks,which proves the noise layer proposed is indeed helpful to increase the training to improve the accuracy and robustness of watermark extraction.The watermark can be extracted from the screen shot images in the range of 10 cm to more than 50 cm,and it has a high extraction success rate at a usual distance.Conclusion In this paper,the least dependent hiding for screen-shooting resilient watermarking is proposed,which comprehensively balances computa-tional complexity,image quality,and robustness.An effective noise layer improvement measure is also designed,which helps our algorithm perform better in image quality and watermark robustness.The proposed algorithm has the advantages of high embedding efficiency,high robustness,and high transparency,which means wider application range compared with the existing methods.
digital watermarkscreen-to-camera channelfully convolutional networkdependent hidingnoise layer