作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2018, Vol. 44 ›› Issue (5): 291-295. doi: 10.19678/j.issn.1000-3428.0046769

• 开发研究与工程应用 • 上一篇    下一篇

马尔科夫模型在网络流量分类中的应用与研究

赵英 a,韩春昊 b   

  1. 北京化工大学 a.信息中心; b.信息科学与技术学院,北京 100029
  • 收稿日期:2017-04-13 出版日期:2018-05-15 发布日期:2018-05-15
  • 作者简介:赵英(1966—),男,教授、博士,主研方向为网络安全、并行计算;韩春昊(通信作者),硕士研究生。
  • 基金资助:

    中央高校基本科研业务费专项资金(PT1612)。

Application and Research of Markov Model in Network Traffic Classification

ZHAO Ying a,HAN Chunhao b   

  1. a.Information Center; b.College of Informatica Science and Technology,Beijing University of Chemical Technology,Beijing 100029,China
  • Received:2017-04-13 Online:2018-05-15 Published:2018-05-15

摘要:

传统的端口号与深度包检测分类技术已不能满足网络中各类应用的分类要求,无法进行准确分类。为此,提出一种基于半监督学习的马尔科夫模型网络流量分类算法。利用流之间的相关性构建马尔科夫模型,采用密度计算的方法估计聚类的中心点,通过KL距离计算中心点与样本之间的相似度,将样本划分到不同的应用类型中。使用马尔科夫模型提取特征参数,用以识别流量应用类型,并提高准确度,解决传统的基于半监督学习的流量分类方法依赖不稳定聚类算法的问题。实验结果表明,使用该方法机器学习得到的网络流量分类器可以取得理想的分类效果。

关键词: 网络流量分类, 马尔科夫模型, 相似度计算, 半监督学习, 流相关性, 样本密度, 聚类算法, 相对熵

Abstract:

With the development of information science and technology,the traditional port number and depth packet detection classification technology can not meet the classification requirements of various applications in the network,and can not be classified accurately.A Markov model network traffic classification algorithm based on semi-supervised learning is proposed.The Markov model is constructed by the correlation between flows.The center of the clustering is estimated by density calculation.The center point is calculated by KL distance.The similarity between samples is divided into different application types.The feature of the Markov model is used to identify the traffic application type and improve the accuracy.The problem of the traditional traffic classification method based on semi-supervised learning depends on the unstable clustering algorithm.Experimental results show that the network traffic classifier can achieve the ideal classification effect.

Key words: network traffic classification, Markov model, similarity calculation, semi-supervised learning, correlation of flow, sample density, clustering algorithm, relative entropy

中图分类号: