中国科学:信息科学(英文版)2024,Vol.67Issue(3) :58-74.DOI:10.1007/s11432-022-3671-9

Mitigate noisy data for smart IoT via GAN based machine unlearning

Zhuo MA Yilong YANG Yang LIU Xinjing LIU Jianfeng MA
中国科学:信息科学(英文版)2024,Vol.67Issue(3) :58-74.DOI:10.1007/s11432-022-3671-9

Mitigate noisy data for smart IoT via GAN based machine unlearning

Zhuo MA 1Yilong YANG 1Yang LIU 1Xinjing LIU 1Jianfeng MA2
扫码查看

作者信息

  • 1. School of Cyber Engineering,Xidian University,Xi'an 710071,China
  • 2. School of Cyber Engineering,Xidian University,Xi'an 710071,China;State Key Laboratory of Integrated Services Networks(ISN),Xi'an 710071,China
  • 折叠

Abstract

With the development of IoT applications,machine learning dramatically improves the utility of variable IoT systems such as autonomous driving.Although the pretrain-finetune framework can cope well with data heterogeneity in complex IoT scenarios,the data collected by sensors often contain unexpected noisy data,e.g.,out-of-distribution(OOD)data,which leads to the reduced performance of fine-tuned models.To resolve the problem,this paper proposes MuGAN,a method that can mitigate the side-effect of OOD data via the generative adversarial network(GAN)-based machine unlearning.MuGAN follows a straightforward but effective idea to mitigate the performance loss caused by OOD data,i.e.,"flashbacking"the model to the condition where OOD data are excluded from model training.To achieve the goal,we design an adversarial game,where a discriminator is trained to identify whether a sample belongs to the training set by observing the confidence score.Meanwhile,a generator(i.e.,the target model)is updated to fool the discriminator into believing that the OOD data are not included in the training set but others do.The experimental results show that benefiting from the high unlearning rate(more than 90%)and retention rate(99%),MuGAN succeeds in lowering the model performance degradation caused by OOD data from 5.88%to 0.8%.

Key words

machine unlearning/generative adversarial network/out of distribution data/Internet of Thing/neural network

引用本文复制引用

基金项目

国家重点研发计划(2022YFB3103500)

国家自然科学基金(U21A20464)

国家自然科学基金(61872283)

Natural Science Basic Research Program of Shaanxi Province(2021JC-22)

陕西省重点研发计划(2022GY-029)

高等学校学科创新引智计划(111计划)(B16037)

出版年

2024
中国科学:信息科学(英文版)
中国科学院

中国科学:信息科学(英文版)

CSTPCDEI
影响因子:0.715
ISSN:1674-733X
参考文献量49
段落导航相关论文