作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (8): 52-54. doi: 10.3969/j.issn.1000-3428.2011.08.018

• 软件技术与数据库 • 上一篇    下一篇

基于查询词扩展的中文垃圾短信检索

刘金岭   

  1. (淮阴工学院计算机工程学院,江苏 淮安 223003)
  • 出版日期:2011-04-20 发布日期:2012-10-31
  • 作者简介:刘金岭(1958-),男,教授,主研方向:数据仓库,文本数据挖掘

Chinese Junk SMS Retrieval Based on Query Words Expansion

LIU Jin-ling   

  1. (School of Computer Engineering, Huaiyin Institute of Technology, Huaian 223003, China)
  • Online:2011-04-20 Published:2012-10-31

摘要: 在垃圾短信检索中所使用的关键词与短信文本集中的词不匹配,从而影响检索效果。为此,提出一种基于上下文查询词扩展的检索方法,该方法根据关键词出现的上下文信息进行查询词扩展选择,同时考虑查询扩展词与整个查询语句及查询词的位置关系。选取3 000条短信文本进行实验,结果表明该方法能提高平均查准率。

关键词: 垃圾短信, 关键词, 查询扩展词, 检索

Abstract: The keywords used in information retrieval and text messages in junk SMS retrieval do not match, so it affects information retrieval. This paper proposes a new method of query expansion words on the basic of context and global information. At the same time, the expansion words are selected according to their relation with the whole query. Additionally, the position information between words is considered. The paper selects 3 000 text messages to be tested, results show that the method improves the average precision.

Key words: junk SMS, key words, query words expansion, retrieval

中图分类号: