基于生成式自监督学习的对抗样本分类算法

Adversarial example classification algorithm based on generative self-supervised learning

扫码查看

原文链接

维普
万方数据

中文摘要：对抗样本常常被视为对深度学习模型鲁棒性的威胁,而现有对抗训练往往会降低分类网络的泛化精度,导致其对原始样本的分类效果降低.因此,提出了一个基于生成式自监督学习的对抗样本分类算法,通过自监督学习训练生成式模型获取图像数据潜在特征的能力,并基于该模型实现对抗样本的特征筛选,而后将其中有益分类的信息反馈给分类模型.最后进行联合学习,完成端到端的全局训练,进一步实现分类模型泛化精度的提升.在MNIST、CIFAR10和CIFAR100数据集上的实验结果显示,与标准训练相比,该算法将分类精度分别提高了0.06％、1.34％、0.89％,达到99.70％、84.34％、63.65％.结果证明,该算法克服了传统对抗训练降低模型泛化性能的固有缺点,并进一步提高了分类网络的精度.

外文摘要：Adversarial examples are often regarded as a threat to the robustness of deep learning models,and various defense techniques such as adversarial training have been developed to mitigate the impact of adversarial examples on label prediction.However,the various existing adversarial training reduces the generalization accuracy of the classification network,resulting in a reduction in its classification effect on the original examples.Therefore,an adversarial example classification algorithm based on generative self-supervised learning is proposed.Through self-supervised learning,the generative model can be trained to obtain the potential features of image data,and this model performs feature screening on adversarial examples.After that,the information useful for classification is fed back to train the classification model.Finally,joint learning is carried out to complete the end-to-end global training,and further improves the generalization accuracy of the classification model.Experimental results on MNIST,CIFAR10,and CIFAR100 datasets show that compared with standard training,the proposed algorithm increases the classification accuracy by 0.06％,1.34％,and 0.89％,respectively,reaching 99.70％,84.34％,and 63.65％.The result shows that it overcomes the inherent shortcomings of traditional adversarial training reducing the generalization performance of the model,and further improves the accuracy of the classification network.

外文关键词：

adversarial examplesself-supervised learningimage classificationgenerative model

作者：

阳帆、魏宪、郭杰龙、郑建漳、兰海

展开 >

作者单位：

福州大学先进制造学院,福建泉州 362251

中国科学院福建物质结构研究所,福建福州 350108

中国科学院海西研究院泉州装备制造研究中心,福建泉州 362216

关键词：

对抗样本自监督学习图像分类生成式模型

基金：

福建省科技计划福建省科技计划泉州市科技计划

项目编号：

2021T30682021T30032021C065L

出版年：

2024

DOI：

10.19304/J.ISSN1000-7180.2023.0114

微电子学与计算机

中国航天科技集团公司第九研究院第七七一研究所

微电子学与计算机

CSTPCD

影响因子：0.431

ISSN：1000-7180

年,卷(期)：2024.41(2)

参考文献量24