基于BERT的多模型融合的Web攻击检测方法

doi:10.19678/j.issn.1000-3428.0068295

摘要/Abstract

摘要：

传统Web攻击检测方法准确率不高, 不能有效防范Web攻击。针对该问题, 提出一种基于变换器的双向编码器表示(BERT)的预训练模型、文本卷积神经网络(TextCNN)和双向长短期记忆网络(BiLSTM)多模型融合的Web攻击检测方法。先将HTTP请求进行预处理, 再通过BERT进行训练得到具备上下文依赖的特征向量, 并用TextCNN模型进一步提取其中的高阶语义特征, 作为BiLSTM的输入, 最后利用Softmax函数进行分类检测。在HTTP CSIC 2010和恶意URL检测两个数据集上对所提方法进行验证, 结果表明, 与支持向量机(SVM)、逻辑回归(LR)等传统的机器学习方法和现有较新的方法相比, 基于BERT的多模型融合的Web攻击检测方法在准确率、精确率、召回率和F1值指标上均表现更优(准确率和F1值的最优值都在99%以上), 能准确检测Web攻击。

关键词: Web攻击检测, 基于变换器的双向编码器表示, 多模型融合, HTTP请求, 文本卷积神经网络, 双向长短期记忆网络

Abstract:

Traditional Web attack detection methods have a low accuracy and cannot effectively prevent Web attacks.In this regard, we propose a detection method for Web attacks based on the multi-model fusion of converter-based Bidirectional Encoder Representations from Transformer(BERT) pre-training model, Text Convolutional Neural Network(TextCNN), and Bidirectional Long Short-Term Memory(BiLSTM) network. Initially, an HTTP request is preprocessed, followed by BERT training to obtain context-dependent feature vectors. Then, the TextCNN model is used to further extract higher-order semantic features as BiLSTM inputs, and the Softmax function is used for classification detection. The proposed BERT-based multi-model fusion Web attack detection method is verified using two datasets: HTTP CSIC 2010 and malicious URL detection. Compared with traditional machine learning methods, such as the Support Vector Machine(SVM), Logistic Regression(LR), and existing newer methods, the BERT-based multi-model fused Web attack detection method has better accuracy, precision, recall, and F1 value indicators, with a maximum accuracy and F1 score of more than 99%, and can better detect Web attacks.

Key words: Web attacks detection, Bidirectional Encoder Representations from Transformers(BERT), multi-model fusion, HTTP request, Text Convolutional Neural Network(TextCNN), Bidirectional Long Short-Term Memory(BiLSTM) network

袁平宇, 邱林. 基于BERT的多模型融合的Web攻击检测方法[J]. 计算机工程, 2024, 50(11): 197-206.

YUAN Pingyu, QIU Lin. Web Attacks Detection Method Based on BERT with Multi-Model Fusion[J]. Computer Engineering, 2024, 50(11): 197-206.

https://www.ecice06.com/CN/Y2024/V50/I11/197

图/表 14

图1 模型总体框架

Fig.1 Overall framework of the model

图2 数据集样例

Fig.2 Dataset sample

图3 URL路径样例

Fig.3 URL path sample

图4 BERT模型

Fig.4 BERT model

图5 Transformer-encoder框架

Fig.5 Transformer-encoder framework

图6 TextCNN模型

Fig.6 TextCNN model

图7 BiLSTM模型

Fig.7 BiLSTM model

图8 学习率对准确率的影响

Fig.8 The influence of learning rate on accuracy

图9 混淆矩阵

Fig.9 Confusion matrix

图10 t-SNE可视化

Fig.10 t-SNE visualization

参考文献 27

1	YANG C H, WU J P, LEE F Y, et al. Detection and mitigation of SYN flooding attacks through SYN/ACK packets and black/white lists. Sensors, 2023, 23 (8): 3817. doi: 10.3390/s23083817
2	ZHANG H, LI Y D, LV Z H, et al. A real-time and ubiquitous network attack detection based on deep belief network and support vector machine. CAA Journal of Automatica Sinica, 2020, 7 (3): 790- 799. doi: 10.1109/JAS.2020.1003099
3	周桥, 翟江涛, 荚东升, 等. 基于卷积门控循环神经网络的Web攻击检测方法. 广西师范大学学报(自然科学版), 2023, 41 (6): 51- 61. URL
	ZHOU Q, ZHAI J T, GU D S, et al. Web attack detection method based on convolutional gated recurrent neural network. Journal of Guangxi Normal University(Natural Science Edition), 2023, 41 (6): 51- 61. URL
4	ZHANG C, GUO R Z, MA X Y, et al. W-TextCNN: a TextCNN model with weighted word embeddings for Chinese address pattern classification. Computers, Environment and Urban Systems, 2022, 95, 101819. doi: 10.1016/j.compenvurbsys.2022.101819
5	XU G X, ZHANG Z X, ZHANG T, et al. Aspect-level sentiment classification based on attention-BiLSTM model and transfer learning. Knowledge-Based Systems, 2022, 245, 108586.
6	ANDERSON J P. Computer security threat monitoring and surveillance[EB/OL]. [2023-10-10]. http://shodh.inflibnet.ac.in/bitstream/123456789/3388/7/07_refrences.pdf.
7	YE Z W, SUN Y H, SUN S, et al. Research on network intrusion detection based on support vector machine optimized with grasshopper optimization algorithm[C]//Proceedings of the 10th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications(IDAACS). Washington D. C., USA: IEEE Press, 2019: 378-383.
8	DEVAN P, KHARE N. An efficient XGBoost—DNN-based classification model for network intrusion detection system. Neural Computing and Applications, 2020, 32 (16): 12499- 12514. doi: 10.1007/s00521-020-04708-x
9	WEN L B. Cloud computing intrusion detection technology based on BP-NN. Wireless Personal Communications, 2022, 126 (3): 1917- 1934. doi: 10.1007/s11277-021-08569-y
10	GAUTAM S, HENRY A, ZUHAIR M, et al. A composite approach of intrusion detection systems: hybrid RNN and correlation-based feature optimization. Electronics, 2022, 11 (21): 3529. doi: 10.3390/electronics11213529
11	许丹丹, 徐阳, 张思聪, 等. 基于DCNN-GRU模型的XSS攻击检测方法. 计算机应用于软件, 2022, 39 (2): 324- 329. doi: 10.3969/j.issn.1000-386x.2022.02.051
	XU D D, XU Y, ZHANG S C, et al. XSS attack detection method based on DCNN-GRU model. Computer Application in Software, 2022, 39 (2): 324- 329. doi: 10.3969/j.issn.1000-386x.2022.02.051
12	SEYYAR Y E, YAVUZ A G, UNVER H M. An attack detection framework based on BERT and deep learning. IEEE Access, 2022, 10, 68633- 68644. doi: 10.1109/ACCESS.2022.3185748
13	ELUBEYD H, YILTAS-KAPLAN D. Hybrid deep learning approach for automatic DoS/DDoS attacks detection in software-defined networks. Applied Sciences, 2023, 13 (6): 3828. doi: 10.3390/app13063828
14	侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木. 中文预训练模型研究进展. 计算机科学, 2022, 49 (7): 148- 163. doi: 10.11896/jsjkx.211200018
	HOU Y T, Abulizi Abudukelimu, Abudukelimu Halidanmu. Research progress on Chinese pre-training models. Computer Science, 2022, 49 (7): 148- 163. doi: 10.11896/jsjkx.211200018
15	江魁, 余志航, 陈小雷, 等. 基于BERT-CNN的Webshell流量检测系统设计与实现. 计算机应用, 2023, 43 (S1): 126- 132.
	JIANG K, YU Z H, CHEN X L, et al. Design and implementation of a Webshell traffic detection system based on BERT-CNN. Computer Applications, 2023, 43 (S1): 126- 132.
16	张玉帅, 赵欢, 李博. 基于BERT和BiLSTM的语义槽填充. 计算机科学, 2021, 48 (1): 247- 252. doi: 10.11896/jsjkx.191200088
	ZHANG Y S, ZHAO H, LI B. Semantic slot filling based on BERT and BiLSTM. Computer Science, 2021, 48 (1): 247- 252. doi: 10.11896/jsjkx.191200088
17	李德玉, 罗锋, 王素格. 融合CNN和标签特征的中文文本情绪多标签分类. 山西大学学报(自然科学版), 2020, 43 (1): 65- 71. doi: 10.13451/j.sxu.ns.2018138
	LI D Y, LUO F, WANG S G. A multi-label emotion classification method for Chinese text based on CNN and tag features. Journal of Shanxi University(Natural Science Edition), 2020, 43 (1): 65- 71. doi: 10.13451/j.sxu.ns.2018138
18	KIM Y. Convolutional neural networks for sentence classification[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing(EMNLP). [S. l.]: Association for Computational Linguistics, 2014: 1746-1751.
19	郝欣达. 基于深度学习的WAF防火墙的设计与实现[D]. 北京: 北京邮电大学, 2020.
	HAO X D. Design and implementation of WAF firewall based on deep learning[D]. Beijing: Beijing University of Posts and Telecommunications, 2020. (in Chinese)
20	DAWADI B R, ADHIKARI B, SRIVASTAVA D K. Deep learning technique-enabled Web application firewall for the detection of Web attacks. Sensors, 2023, 23 (4): 2073. URL
21	常江. 基于带注意力机制LSTM的Web攻击检测技术研究[D]. 太原: 中北大学, 2022.
	CHANG J. Research on Web attack detection technology based on LSTM with attention mechanism[D]. Taiyuan: North Central University, 2022. (in Chinese)
22	TEKEREK A. A novel architecture for Web-based attack detection using convolutional neural network. Computers & Security, 2021, 100, 102096.
23	STIAWAN D, BARDADI A, AFIFAH N, et al. An improved LSTM-PCA ensemble classifier for SQL injection and XSS attack detection. Computer Systems Science and Engineering, 2023, 46 (2): 1759- 1774.
24	MAC H, TRUONG D, NGUYEN L, et al. Detecting attacks on Web applications using autoencoder[C]//Proceedings of the 9th International Symposium on Information and Communication Technology. New York, USA: ACM Press, 2018: 416-421.
25	刘吉会, 何成万. 基于ECA规则和动态污点分析的SQL注入攻击在线检测. 计算机应用, 2023, 43 (5): 1534- 1542.
	LIU J H, HE C W. Online detection of SQL injection attacks based on ECA rules and dynamic taint analysis. Journal of Computer Applications, 2023, 43 (5): 1534- 1542.
26	巫家宏, 杨振国, 刘文印. 基于多尺度特征融合的恶意HTTP请求检测方法. 计算机应用研究, 2021, 38 (3): 871-874, 880.
	WU J H, YANG Z G, LIU W Y. Malicious HTTP request detection method based on multi-scale feature fusion. Computer Application Research, 2021, 38 (3): 871-874, 880.
27	MAATEN L, HINTON G. Visualizing data using t-SNE. Journal of Machine Research, 2008, 9, 2625- 2679.

[1]	党小超, 刘涧, 董晓辉, 祝忠彦, 李芬芳. 面向不平衡数据的机械设备故障命名实体识别[J]. 计算机工程, 2024, 50(9): 104-112.
[2]	屈潇雅, 李兵, 温立强. 面向行政执法案件文本的事件抽取研究[J]. 计算机工程, 2024, 50(9): 63-71.
[3]	徐晓滨, 张云硕, 施凡, 常雷雷, 陶志刚. 基于特征匹配度与异类子模型融合的安全性评估方法[J]. 计算机工程, 2024, 50(8): 113-122.
[4]	周昭辰, 方清茂, 吴晓红, 胡平, 何小海. 基于MacBERT与对抗训练的机器阅读理解模型[J]. 计算机工程, 2024, 50(5): 41-50.
[5]	虞秋辰, 周若华, 袁庆升. 基于Ghost-SE-Res2Net的多模型融合语音唤醒词检测方法[J]. 计算机工程, 2024, 50(3): 52-59.
[6]	邵良杉, 赵松泽. 基于多模型融合的不完整数据分数插补算法[J]. 计算机工程, 2023, 49(9): 79-88, 98.
[7]	陈梦萱, 陈艳平, 扈应, 黄瑞章, 秦永彬. 基于词义增强的生物医学命名实体识别方法[J]. 计算机工程, 2023, 49(10): 305-312.
[8]	王曙燕, 原柯. 基于RoBERTa-WWM的大学生论坛情感分析模型[J]. 计算机工程, 2022, 48(8): 292-298,305.
[9]	李世宝, 李贺, 赵庆帅, 殷乐乐, 刘建航, 黄庭培. 融合外部语义知识的中文文本蕴含识别[J]. 计算机工程, 2021, 47(1): 44-49.
[10]	马喆康, 迪力亚尔·帕尔哈提, 早克热·卡德尔, 吐尔根·依布拉音, 西尔艾力·色提, 艾山·吾买尔. 一种集成深度学习模型的旅游问句文本分类算法[J]. 计算机工程, 2020, 46(11): 70-76.

选择文件类型/文献管理软件名称

选择包含的内容