首页|基于BERT和XGBoost的Webshell检测方案

基于BERT和XGBoost的Webshell检测方案

扫码查看
Web服务的后门程序Webshell是黑客攻击的常用手段.传统的检测方法在检测经过变种、混淆加密的Web-shell后门时存在漏检率和误报率较高的缺陷.为解决这一问题,融合BERT和XGBoost的特性设计了一种新的检测方法,能极大地提升Webshell后门程序的检测准确率.在检测中把经过预处理的Webshell样本文件使用BERT模型提取词向量特征,并使用集成学习算法XGBoost进行分类训练,得到一个较优的检测模型,最后利用该模型能有效的实现各种Webshell恶意程序检测.相对于基于传统机器学习的检测模型,我们提出的综合Webshell检测方法在精确度、查全率和F1 值等各项指标上均展现出优异的性能,其检测的准确性高达97.75%.
Webshell detection scheme based on BERT and XGBoost
Webshell,the backdoor program of Web services,is a common means of hacker attack.The traditional detection meth-ods have the defects of high missed detection rate and false positive rate when detecting the Webshell backdoor which is mutated and confused-encrypted.To solve this problem,this paper integrated BERT and XGBoost features to design a new detection method,which could greatly improve the detection accuracy of Webshell backdoor program.In the detection,the word vector fea-tures were extracted from the preprocessed Webshell sample files using BERT model,and the integrated learning algorithm XG-Boost was used for classification training,so as to obtain an optimal detection model.Finally,the model could effectively detect various Webshell malicious programs.Compared with the detection model based on traditional machine learning algorithm,the proposed fusion Webshell detection method had better performance in the aspects of precision,recall and F1 value,and the de-tection accuracy reached 97.75%.

WebshellBERTXGBoostfeature extraction

张育铭、李浩华、郭现峰

展开 >

西南民族大学计算机科学与工程学院,四川 成都 610041

Webshell BERT XGBoost 特征提取

四川省教育厅重点项目四川省科技厅项目中央高校基本科研业务费专项西南民族大学项目

18ZA05122017JY02302021NYYXS54

2024

西南民族大学学报(自然科学版)
西南民族大学

西南民族大学学报(自然科学版)

CSTPCD
影响因子:0.441
ISSN:2095-4271
年,卷(期):2024.50(2)
  • 15