Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2012, Vol. 38 ›› Issue (16): 96-99. doi: 10.3969/j.issn.1000-3428.2012.16.024

• Networks and Communications • Previous Articles     Next Articles

Flow Characteristic Selection Algorithm Based on Information Metric

GUO Lei 1, WANG Ya-di 1, CHEN Shu-qiao 2, ZHU Ke 2, HAN Ji-hong 1   

  1. (1. School of Electronic Technology, Information Engineering University, Zhengzhou 450004, China; 2. National Digital Switching System Engineering and Technological R&D Center, Zhengzhou 450001, China)
  • Received:2011-11-16 Online:2012-08-20 Published:2012-08-17

一种基于信息度量的流特征遴选算法

郭 磊 1,王亚弟 1,陈庶樵 2,朱 珂 2,韩继红 1   

  1. (1. 信息工程大学电子技术学院,郑州 450004;2. 国家数字交换系统工程技术研究中心,郑州 450001)
  • 作者简介:郭 磊(1982-),男,博士研究生,主研方向:网络信息安全;王亚弟、陈庶樵,教授;朱 珂,副教授;韩继红,教授
  • 基金资助:
    国家“863”计划基金资助项目(2009AA01A346)

Abstract: This paper proposes a characteristic selection algorithm based on information metric, which includes coarse grain selection and fine grain selection. The coarse grain selection calculates the cross-entropy between different characteristics and different business categories, and chooses the most representative characteristics using in flow classification. The fine grain selection calculates the consistency between characteristics to eliminate redundant characteristics. Experimental result shows that, when the characteristics selected in the proposed algorithm are used in data flow classification, the precision rate and recall rate are higher than the other similar algorithm, and this algorithm has lower complexity.

Key words: Deep Flow Inspection(DFI), characteristic selection, information metric, flow classification, mutual information, gain ratio

摘要: 提出一种基于信息度量的流特征选择算法。该算法可分为粗粒度选择和细粒度选择2个选择步骤。粗粒度的选择通过计算特征集合中各个特征与不同业务类别的互信息,选择在流分类中最具代表性的特征。对于选取的这些特征进行细粒度的选择处理,通过计算已选特征集合中特征之间的一致性,排除多余的特征。实验结果表明,该算法遴选出的特征在用于数据流分类时,准确率和召回率都较同类算法高,且时间复杂度较低。

关键词: 深度流检测, 特征选择, 信息度量, 流分类, 互信息, 增益比

CLC Number: