作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (4): 30-32. doi: 10.3969/j.issn.1000-3428.2010.04.011

• 软件技术与数据库 • 上一篇    下一篇

基于主题的中文短信文本分类研究

刘金岭   

  1. (淮阴工学院计算机工程系,淮安 223003)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2010-02-20 发布日期:2010-02-20

Study on Chinese Short Message Text Classification Based on Theme

LIU Jin-ling   

  1. (Dept. of Computer Engineering, Huaiyin Institute of Technology, Huaian 223003)
  • Received:1900-01-01 Revised:1900-01-01 Online:2010-02-20 Published:2010-02-20

摘要: 根据中文短信文本分类的特点,提出同义概念归并、上下位概念的聚焦以及短信文本重点词汇的确定方法,利用主题句选取算法获取短信文本的主题,采用KNN算法将短信文本的主题进行分类。仿真实验结果表明,该算法能够有效提高短信文本的分类速度。

关键词: 短信文本, KNN算法, 主题句

Abstract: According to characteristics of Chinese short message text categorization, some contents are proposed, such as the synonymy concept merging, the superior concept and sub-concept semantic focusing and using of topic sentences. The algorithm getting theme of short text is used to obtain the text theme. KNN algorithm is also used to classify the short text subject. Simulation experimental results show this algorithm can improve the classification speed of the short text.

Key words: short message text, KNN algorithm, theme sentence

中图分类号: