作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (5): 122-124.

• 安全技术 • 上一篇    下一篇

基于签名的近似垃圾邮件检测算法

詹 川1,卢显良2,侯孟书2,刘志辉3   

  1. 1.重庆工商大学商务策划学院,重庆 400067;2. 电子科技大学计算机科学与工程学院,成都 610054;3. 山东工商学院数学与信息科学学院,烟台 264005)
  • 出版日期:2006-03-05 发布日期:2006-03-05

A Signature-based Approximate Spam Detection Algorithm

ZHAN Chuan1, LU Xianliang2, HOU Mengshu2, LIU Zhihui3   

  1. 1. Business Planning College, Chongqing Technology & Business University, Chongqing 400067; 2. School of Computer Science and Engineering, UEST of China, Chengdu 610054; 3. Mathematics & Information Science College, SDIBT, Yantai 264005
  • Online:2006-03-05 Published:2006-03-05

摘要: 针对垃圾邮件短小、一定时间内在网络上重复、大量地散发的特点,提出了基于签名的近似垃圾邮件检测算法(ASD)。该算法以句为基本单位,求取邮件所含的全部句子的摘要,垃圾邮件的近似检测转变为两个摘要集近似度的比较。通过与近似文本查询算法DSC、DSC-SS、I-Match 的比较,ASD 算法在近似垃圾邮件查询中,表现出样本集的存储空间大小适中、运算时间短、鲁棒性高、高准确率、高召回率的特征。

关键词: 近似垃圾邮件检测;垃圾邮件过滤;签名;文本近似度;查询

Abstract: In the term of characteristics of spammers sending spam in bulk over a relatively short period of time, this paper presents a signature-based approximate spam detection(ASD) algorithm. ASD detects similarity of E-mails by comparing sets of digests of all sentences in E-mails. The paper compares ASD algorithm with DSC, DSC-SS, I-Match algorithms. To approximate spam detection, ASD has a good performance in samples storage, computing time, robustness, precision and recall

Key words: Approximate spam detection; Spam filtering; Signature; Similarity of documents; Query