武汉大学自然科学学报(英文版)2023,Vol.28Issue(1) :35-44.DOI:10.1051/wujns/2023281035

Adversarial Example Generation Method Based on Sensitive Features

WEN Zerui SHEN Zhidong SUN Hui QI Baiwen
武汉大学自然科学学报(英文版)2023,Vol.28Issue(1) :35-44.DOI:10.1051/wujns/2023281035

Adversarial Example Generation Method Based on Sensitive Features

WEN Zerui 1SHEN Zhidong 2SUN Hui 3QI Baiwen3
扫码查看

作者信息

  • 1. Key Laboratory of Aerospace Information Security and Trusted Computing,Ministry of Education,School of Cyber Science and Engineering,Wuhan University,Wuhan 430079,Hubei,China
  • 2. Key Laboratory of Aerospace Information Security and Trusted Computing,Ministry of Education,School of Cyber Science and Engineering,Wuhan University,Wuhan 430079,Hubei,China;Engineering Research Center of Cyberspace,Yunnan University,Kunming 650504,Yunnan,China
  • 3. Zhongnan Hospital,Wuhan University,Wuhan 430072,Hubei,China
  • 折叠

Abstract

As deep learning models have made remarkable strides in numerous fields,a variety of adversarial attack methods have emerged to interfere with deep learning models.Adversarial examples apply a minute perturbation to the original image,which is incon-ceivable to the human but produces a massive error in the deep learning model.Existing attack methods have achieved good results when the network structure is known.However,in the case of unknown network structures,the effectiveness of the attacks still needs to be im-proved.Therefore,transfer-based attacks are now very popular because of their convenience and practicality,allowing adversarial samples generated on known models to be used in attacks on unknown models.In this paper,we extract sensitive features by Grad-CAM and pro-pose two single-step attacks methods and a multi-step attack method to corrupt sensitive features.In two single-step attacks,one corrupts the features extracted from a single model and the other corrupts the features extracted from multiple models.In multi-step attack,our method improves the existing attack method,thus enhancing the adversarial sample transferability to achieve better results on unknown models.Our method is also validated on CIFAR-10 and MINST,and achieves a l%-3%improvement in transferability.

Key words

deep learning model/adversarial example/transferability/sensitive characteristics/AI security

引用本文复制引用

基金项目

Key R&D Projects in Hubei Province(2022BAA041)

Key R&D Projects in Hubei Province(2021BCA124)

Open Foundation of Engineering Research Center of Cyberspace(KJAQ202112002)

出版年

2023
武汉大学自然科学学报(英文版)
武汉大学

武汉大学自然科学学报(英文版)

CSTPCDCSCD北大核心
影响因子:0.066
ISSN:1007-1202
参考文献量26
段落导航相关论文