作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2021, Vol. 47 ›› Issue (7): 101-108. doi: 10.19678/j.issn.1000-3428.0058517

• 网络空间安全 • 上一篇    下一篇

基于层次时空特征与多头注意力的恶意加密流量识别

蒋彤彤1, 尹魏昕2, 蔡冰3, 张琨1   

  1. 1. 南京理工大学 计算机科学与工程学院, 南京 210094;
    2. 国家计算机网络与信息安全管理中心江苏分中心 网络安全处, 南京 210019;
    3. 国家计算机网络与信息安全管理中心江苏分中心 技术保障处, 南京 210019
  • 收稿日期:2020-06-02 修回日期:2020-07-03 发布日期:2020-07-10
  • 作者简介:蒋彤彤(1996-),女,硕士研究生,主研方向为网络安全、深度学习;尹魏昕、蔡冰,高级工程师;张琨,教授、博士、博士生导师。
  • 基金资助:
    江苏省研究生科研与实践创新计划(SJCX18_0149);南京理工大学自主科研专项(1181060420);南京理工大学横向课题(1191061083)。

Encrypted Malicious Traffic Identification Based on Hierarchical Spatiotemporal Feature and Multi-Head Attention

JIANG Tongtong1, YIN Weixin2, CAI Bing3, ZHANG Kun1   

  1. 1. School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China;
    2. Department of Network Security, Jiangsu Branch of National Computer Network and Information Security Management Center, Nanjing 210019, China;
    3. Department of Technical Support, Jiangsu Branch of National Computer Network and Information Security Management Center, Nanjing 210019, China
  • Received:2020-06-02 Revised:2020-07-03 Published:2020-07-10

摘要: 为实现互联网全面加密环境下的恶意加密流量精确检测,针对传统识别方法较依赖专家经验且对加密流量特征的区分能力不强等问题,提出一种基于层次时空特征与多头注意力(HST-MHSA)模型的端到端恶意加密流量识别方法。基于流量层次结构,结合长短时记忆网络和TextCNN有效整合加密流量的多尺度局部特征和双层全局特征,并引入多头注意力机制进一步增强关键特征的区分度。在公开数据集CICAndMal2017上的实验结果表明,HST-MHSA模型的流量识别F1值相较基准模型最高提升了16.77个百分点,漏报率比HAST-Ⅱ和HABBiLSTM模型分别降低了3.19和2.18个百分点,说明其对恶意加密流量具有更强的表征和识别能力。

关键词: 加密流量识别, 多头注意力机制, 恶意流量识别, 卷积神经网络, 长短时记忆网络

Abstract: To implement the full encryption of Internet,the accurate detection of encrypted malicious traffic is required,but traditional detection methods rely heavily on expert experience and perform poorly in distiguishment of encrypted traffic feature is not strong the representation of encrypted traffic.To address the problem,an end-to-end malicious encrypted traffic identification method based on Hierarchical Spatiotemporal feature and Multi-Head Self-Attention(HST-MHSA) model is proposed.By utilizing the hierarchical structure of traffic,the advantages of LSTM and TextCNN to integrate the multi-scale local features and two-layer global features of encrypted traffic are combined.In addition,the multi-head attention mechanism is introduced to further enhance the discrimination of the key features.Experimental results on the public dataset CICAndMal2017 show that the F1 value of HST-MHSA model is at most 16.77 percentage points higher than that of the benchmark model,and its Missed Alarm Rate(MAR) is 3.19 and 2.18 percentage points lower than that of the hierarchical model HAST-Ⅱ and HABBiLSTM model respectively,displaying its stronger ability to represent and identify encrypted malicious traffic.

Key words: encrypted traffic identification, multi-head attention mechanism, malicious traffic identification, Convolutional Neural Network(CNN), Long Short-Term Memory(LSTM) network

中图分类号: