作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (16): 68-70. doi: 10.3969/j.issn.1000-3428.2010.16.025

• 网络与通信 • 上一篇    下一篇

IP流量分类算法中特征选择作用分析

黄君毅,吴 静,张 晖   

  1. (西南科技大学信息工程学院,绵阳 621010)
  • 出版日期:2010-08-20 发布日期:2010-08-17
  • 作者简介:黄君毅(1983-),男,硕士研究生,主研方向:网络流量分类,机器学习;吴 静,副教授、硕士;张 晖,教授、博士
  • 基金资助:
    国家“863”计划基金资助项目(2007AA01Z151)

Analysis of Feature Selection Effect on IP Traffic Classification Algorithms

HUANG Jun-yi, WU Jing, ZHANG Hui   

  1. (College of Information Engineering, Southwest University of Science and Technology, Mianyang 621010)
  • Online:2010-08-20 Published:2010-08-17

摘要: 基于流的特征并使用机器学习技术进行网络流量分类是目前网络流量分类的主流技术。由于许多流的特征可用于流分类,其中有许多是不相关和冗余的特征,因此特征选择对算法性能的优化具有重要的作用。将基于过滤的特征选择方法应用于C4.5、Bayesnet、NBD、NBK等分类算法,实验结果表明该方法在无损于分类准确性的同时能够改进计算性能。

关键词: 特征选择, IP流量分类, 机器学习

Abstract: The current study is to use Machine Learning(ML) techniques and classify Internet traffic based on per-flow features. Since a lot flow features can be used for flow classification and there are many irrelevant and redundant features among them, feature selection plays a vital role in algorithm performance optimization. This paper uses two filter-based feature selection methods for classification algorithms such as C4.5, Bayesnet, NBD, NBK. Experimental results show the approach can improve computational performance without negative impact on classification accuracy.

Key words: feature selection, IP traffic classification, Machine Learning(ML)

中图分类号: