中国科学:信息科学(英文版)2024,Vol.67Issue(8) :234-248.DOI:10.1007/s11432-023-4010-4

Identifying malicious traffic under concept drift based on intraclass consistency enhanced variational autoencoder

Xiang LUO Chang LIU Gaopeng GOU Gang XIONG Zhen LI Binxing FANG
中国科学:信息科学(英文版)2024,Vol.67Issue(8) :234-248.DOI:10.1007/s11432-023-4010-4

Identifying malicious traffic under concept drift based on intraclass consistency enhanced variational autoencoder

Xiang LUO 1Chang LIU 1Gaopeng GOU 1Gang XIONG 1Zhen LI 1Binxing FANG2
扫码查看

作者信息

  • 1. Institute of Information Engineering,Chinese Academy of Sciences,Beijing 100085,China;School of Cyber Security,University of Chinese Academy of Sciences,Beijing 100049,China
  • 2. School of Computer Science and Technology,Harbin Institute of Technology,Shenzhen,Shenzhen 518055,China
  • 折叠

Abstract

Accurate identification of malicious traffic is crucial for implementing effective defense counter-measures and has led to extensive research efforts.However,the continuously evolving techniques employed by adversaries have introduced the issues of concept drift,which significantly affects the performance of existing methods.To tackle this challenge,some researchers have focused on improving the separability of malicious traffic representation and designing drift detectors to reduce the number of false positives.Nevertheless,these methods often overlook the importance of enhancing the generalization and intraclass consistency in the representation.Additionally,the detectors are not sufficiently sensitive to the variations among different malicious traffic classes,which results in poor performance and limited robustness.In this paper,we propose intraclass consistency enhanced variational autoencoder with Class-Perception detector(ICE-CP)to identify malicious traffic under concept drift.It comprises two key modules during training:intraclass consistency enhanced(ICE)representation learning and Class-Perception(CP)detector construc-tion.In the first module,we employ a variational autoencoder(VAE)in conjunction with Kullback-Leibler(KL)-divergence and cross-entropy loss to model the distribution of each input malicious traffic flow.This approach simultaneously enhances the generalization,interclass consistency,and intraclass differences in the learned representation.Consequently,we obtain a compact representation and a trained classifier for non-drifting malicious traffic.In the second module,we design the CP detector,which generates a centroid and threshold for each malicious traffic class separately based on the learned representation,depicting the boundaries between drifting and non-drifting malicious traffic.During testing,we utilize the trained classi-fier to predict malicious traffic classes for the testing samples.Then,we use the CP detector to detect the potential drifting samples using the centroid and threshold defined for each class.We evaluate ICE-CP and some advanced methods on various real-world malicious traffic datasets.The results show that our method outperforms others in identifying malicious traffic and detecting potential drifting samples,demonstrating outstanding robustness among different concept drift settings.

Key words

concept drift/malicious traffic identification/variational autoencoder/intrusion detection/cy-berspace security

引用本文复制引用

基金项目

National Key Research and Development Program of China(2021YFB3101400)

出版年

2024
中国科学:信息科学(英文版)
中国科学院

中国科学:信息科学(英文版)

CSTPCDEI
影响因子:0.715
ISSN:1674-733X
段落导航相关论文