Denoising Autoencoder-Based Defensive Distillation as an Adversarial Robustness Algorithm Against Data Poisoning Attacks

扫码查看

原文链接

NETL
NSTL

外文摘要：Deep neural networks (DNNs) have demonstrated promising performances in handling complex real-world scenarios, surpassing human intelligence. Despite their exciting performances, DNNs are not robust against adversarial attacks. They are specifically vulnerable to data poisoning attacks where attackers meddle with the initial training data, despite the multiple defensive methods available, such as defensive distillation. However, defensive distillation has shown promising results in robustifying image classification deep learning (DL) models against adversarial attacks at the inference level, but they remain vulnerable to data poisoning attacks. This work incorporates a data denoising and reconstruction framework with a defensive distillation methodology to defend against such attacks. We leverage a denoising autoencoder (DAE) to develop a data reconstruction and filtering pipeline with a well-designed reconstruction threshold. We added carefully created adversarial examples to the initial training data to assess the proposed method's performance. Our experimental findings demonstrate that the proposed methodology significantly reduced the vulnerability of the defensive distillation framework to a data poison attack.

外文关键词：

Deep Neural NetworkDenoising AutoencoderDefensive DistillationAdversarial attacks and Robust-nessData Poisoning

作者：

Bakary Badjie、Jose Cecilio、Antonio Casimiro

展开 >

作者单位：

LASIGE, Departamento de Informdtica, Faculdade de Ciencias da Universidade Lisboa, Lisboa

出版年：

2023

Ada letters

ISSN：1094-3641

年,卷(期)：2023.43(2)