Author Login Chief Editor Login Reviewer Login Editor Login Remote Office

Computer Engineering ›› 2011, Vol. 37 ›› Issue (8): 52-54.

• Networks and Communications • Previous Articles     Next Articles

Chinese Junk SMS Retrieval Based on Query Words Expansion

LIU Jin-ling   

  1. (School of Computer Engineering, Huaiyin Institute of Technology, Huaian 223003, China)
  • Online:2011-04-20 Published:2012-10-31

基于查询词扩展的中文垃圾短信检索

刘金岭   

  1. (淮阴工学院计算机工程学院,江苏 淮安 223003)
  • 作者简介:刘金岭(1958-),男,教授,主研方向:数据仓库,文本数据挖掘

Abstract: The keywords used in information retrieval and text messages in junk SMS retrieval do not match, so it affects information retrieval. This paper proposes a new method of query expansion words on the basic of context and global information. At the same time, the expansion words are selected according to their relation with the whole query. Additionally, the position information between words is considered. The paper selects 3 000 text messages to be tested, results show that the method improves the average precision.

Key words: junk SMS, key words, query words expansion, retrieval

摘要: 在垃圾短信检索中所使用的关键词与短信文本集中的词不匹配,从而影响检索效果。为此,提出一种基于上下文查询词扩展的检索方法,该方法根据关键词出现的上下文信息进行查询词扩展选择,同时考虑查询扩展词与整个查询语句及查询词的位置关系。选取3 000条短信文本进行实验,结果表明该方法能提高平均查准率。

关键词: 垃圾短信, 关键词, 查询扩展词, 检索

CLC Number: