作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (12): 154-156. doi: 10.3969/j.issn.1000-3428.2008.12.054

• 安全技术 • 上一篇    下一篇

基于内容的垃圾短信过滤

李 辉1,2, 张 琦3,卢湖川3   

  1. (1. 大连理工大学管理学院,大连 116023;2.中国移动辽宁公司,沈阳 110179; 3. 大连理工大学,大连 116023)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-06-20 发布日期:2008-06-20

Junk SMS Filtering Based on Context

LI Hui 1,2, ZHANG Qi 3, LU Hu-chuan 3   

  1. (1. School of Management, Dalian University of Technology, Dalian 116023; 2. Liaoning Mobile Communications Limited Liability Company, Shenyang 110179; 3. Dalian University of Technology, Dalian 116023)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-06-20 Published:2008-06-20

摘要: 研究一种基于最小风险贝叶斯决策的垃圾短信过滤方法。对于以文本信息为主的短信,采用信息增益的方法进行特征选择,使用基于最小风险贝叶斯决策方法进行分类。通过自建短信语料库对该方法进行了实验。实验结果表明,该方法能够准确地对短信进行分类,降低合法短信的分类错误率,分类正确率达到99.3%,符合了短信分类要求。

关键词: 垃圾短信, 短信过滤, 文本分类, 朴素贝叶斯

Abstract: This paper analyzes a junk message filtering system based on the minimum risk Bayesian filtering algorithm, adopts Information Gain(IG) to select the feature, uses the minimum risk-based Bayesian filtering algorithm to classify. The experimental result, which data set is constructed by the real SMS from mobile company, shows that the method has a good performance on classification and low error rate on legit messages. The legit messages recall has achieve 99.3%. It’s suitable for SMS classification.

Key words: junk message, SMS filtering, text classification, naï, ve Bayesian

中图分类号: