首页|基于自编码器的网络流量异常检测

基于自编码器的网络流量异常检测

扫码查看
现有流量异常检测方案在面对日益复杂的网络流量和维度增加的数据结构时,存在误报率高、效率低以及实用性差等问题.针对这些问题,本文提出一种基于自编码器的网络流量异常检测模型.该模型首先基于随机森林算法对网络流量提取特征并筛选最优特征集,通过层次聚类算法将特征向量集划分为若干子集来降低特征维度;然后由自编码器并行处理特征子集并计算RMSE值,定义多次实验的最大平均RMSE值为正常流量阈值;利用测试数据的平均RMSE值和阈值判定异常流量.实验结果表明,本文模型召回率较传统的异常检测方法平均提高了4.3个百分点,运行时间降低了约37%.
Anomaly Detection of Network Traffic Based on Autoencoder
In the face of increasingly complex network traffic and data structures with increasing dimensions,the existing traffic anomaly detection schemes have problems such as high false positive rate,low efficiency and poor practicability.To solve these problems,an autoencoder based network traffic anomaly detection model is proposed.Firstly,the model extracts the features of network traffic based on random forest algorithm and selects the optimal collection,and divides the feature vector set into several subsets by hierarchical clustering algorithm to reduce the feature dimension.Then the feature subset is processed in parallel by the autoencoder and the RMSE value is calculated.The maximum average RMSE value of multiple experiments is defined as the normal flow threshold.The average RMSE value and threshold of the test data are used to determine the abnormal traffic.The ex-perimental results show that the recall rate of this model is 4.3 percentage points higher than that of the traditional anomaly detec-tion method,and the running time is reduced by about 37%.

anomaly detectionautoencoderhierarchical clusteringrandom forest algorithm

吕美静、年梅、张俊、付鲁森

展开 >

新疆师范大学计算机科学技术学院,新疆 乌鲁木齐 830054

中国科学院新疆理化技术研究所,新疆 乌鲁木齐 830011

异常检测 自编码器 层次聚类 随机森林算法

2024

计算机与现代化
江西省计算机学会 江西省计算技术研究所

计算机与现代化

CSTPCD
影响因子:0.472
ISSN:1006-2475
年,卷(期):2024.(12)