作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2012, Vol. 38 ›› Issue (17): 171-173. doi: 10.3969/j.issn.1000-3428.2012.17.047

• 人工智能及识别技术 • 上一篇    下一篇

一种基于SVM后验概率的网络流量识别方法

刘三民1,2,王彩霞3,孙知信4   

  1. (1. 安徽工程大学计算机与信息学院,安徽 芜湖 241000; 2. 南京航空航天大学计算机科学与技术学院,南京 210016; 3. 安徽商贸职业技术学院电子信息工程系,安徽 芜湖 241002 4. 南京邮电大学宽带无线通信与传感网技术教育部重点实验室,南京 210003)
  • 收稿日期:2011-10-17 修回日期:2011-12-21 出版日期:2012-09-05 发布日期:2012-09-03
  • 作者简介:刘三民(1978-),男,讲师、博士研究生,主研方向:模式识别,流量检测;王彩霞,讲师、硕士;孙知信,教授、博士后
  • 基金资助:
    国家自然科学基金资助项目(60973140);江苏省自然科学基金资助项目(BK2009425);安徽省高等学校青年教师科研资助计划基金资助项目(2012SQRL220)

A Network Flow Identification Method Based on SVM Posteriori Probability

LIU San-min 1,2, WANG Cai-xia 3, SUN Zhi-xin 4   

  1. (1. College of Computer and Information, Anhui Polytechnic University, Wuhu 241000, China; 2. College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China; 3. Department of Electronic Information Engineering, Anhui Business College of Vocational Technology, Wuhu 241002, China; 4. Key Laboratory of Broadband Wireless Communication and Sensor Network Technology, Ministry of Education, Nanjing University of Posts and Telecommunications, Nanjing 210003, China)
  • Received:2011-10-17 Revised:2011-12-21 Online:2012-09-05 Published:2012-09-03

摘要: 为解决网络样本标注的难题,实现多种网络流量环境中的主动学习,提出一种基于支持向量机后验概率的网络流量识别方法。结合支持向量机输出和Sigmoid函数拟合样本所属类别后验概率,用其中较大的2类概率信息熵值衡量样本影响分,借助支持向量机和不确定性采样策略实现主动学习过程,形成流量识别模型。实验结果表明,该方法能取得较好的识别效果。

关键词: 流量识别, 主动学习, 支持向量机, 熵, 不确定性采样, 后验概率

Abstract: In order to solve the crux for sample’s label and implement active learning in network environment, the network flow identification method is presented by using Support Vector Machine(SVM) with posteriori probability. The sample’s posteriori probability is got by the output of SVM and Sigmoid function. It uses the larger of the two 2 class probability information entropy to measure the sample Effect Score(ES). By means of SVM and uncertainty sampling strategy, it realizes the active learning process, and traffic identification’s model is formed. Experimental results show that the method can achieve better identification result.

Key words: flow identification, active learning, Support Vector Machine(SVM), entropy, uncertainty sample, posteriori probability

中图分类号: