摘要: 介绍了各类垃圾邮件过滤技术,分析了已经应用于垃圾邮件内容过滤领域的一些分类算法存在的某些不足,创新地将一种新的分类算法(SECTILE)应用于垃圾邮件的分类过滤中去,并设计了一个多层次垃圾邮件过滤系统。该系统整合了多项垃圾邮件过滤技术(黑名单/白名单技术、基于规则的过滤、基于内容的过滤),实验和分析结果表明,该系统提高了垃圾邮件过滤的效率和准确性。
关键词:
垃圾邮件过滤,
SECTILE,
文本分类
Abstract: This paper introduces simply kinds of the current spam filtering technology and analyzes the disadvantages of some algorithms which have been implemented on spam filtering field. A new algorithm(SECTILE) is chosen to implement content based filtering. Three kinds of filtering technology (black/white list based, rule based, content based) are implemented on the spam filtering system it designs. Experiment and analysis prove that the efficiency and accuracy of spam filtering are improved.
Key words:
Spam filtering,
SECTILE,
Text categorization
张 羿;周建国;晏蒲柳. 垃圾邮件过滤系统的研究与实现[J]. 计算机工程, 2006, 32(18): 106-108,.
ZHANG Yi;ZHOU Jianguo;YAN Puliu. Research and Implementation of Spam Filtering System[J]. Computer Engineering, 2006, 32(18): 106-108,.