作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2012, Vol. 38 ›› Issue (16): 164-166. doi: 10.3969/j.issn.1000-3428.2012.16.042

• 人工智能及识别技术 • 上一篇    下一篇

基于贝叶斯学习的集成流量分类方法

汪为汉 a,唐学文 b,邓一贵 b   

  1. (重庆大学 a. 计算机学院;b. 信息与网络管理中心,重庆 400030)
  • 收稿日期:2011-10-18 修回日期:2011-12-12 出版日期:2012-08-20 发布日期:2012-08-17
  • 作者简介:汪为汉(1986-),男,硕士研究生,主研方向:流量分类,机器学习;唐学文,高级工程师;邓一贵,博士

Integrated Traffic Classification Method Based on Bayes Learning

WANG Wei-han a, TANG Xue-wen b, DENG Yi-gui b   

  1. (a. College of Computer; b. Center of Information and Network Management, Chongqing University, Chongqing 400030, China)
  • Received:2011-10-18 Revised:2011-12-12 Online:2012-08-20 Published:2012-08-17

摘要: NB方法条件独立性假设和BAN方法小训练集难以建模。为此,提出一种基于贝叶斯学习的集成流量分类方法。构造单独的NB和BAN分类器,在此基础上利用验证集得到各分类器的权重,通过加权平均组合各分类器的输出,实现网络流量分类。以Moore数据集为实验数据,并与NB方法和BAN方法相比较,结果表明,该方法具有更高的分类准确率和稳定性。

关键词: 流量分类, 朴素贝叶斯, 贝叶斯网络增广朴素贝叶斯, 实例选择, 加权

Abstract: It is difficult to model with the conditional independence assumptions of Naive Bayes(NB) method and the small training set of Bayes Network Augmented Na?ve Bayes(BAN) approach. In order to solve this problem, a new classification method is proposed in this paper. This is a combined traffic classification based on instance-based learning. It constructs a separate NB and BAN classifiers and obtains each classifier weight according to the validation set. It obtains the classification of network traffic through weighted average combination of classifier output. Using Moore data set as the experimental data, results show that the ensemble learning method rather than NB method and BAN method has higher classification accuracy and stability.

Key words: traffic classification, Na?ve Bayes(NB), Bayes Network Augmented Na?ve Bayes(BAN), instance selection, weighing

中图分类号: