作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2012, Vol. 38 ›› Issue (20): 140-143. doi: 10.3969/j.issn.1000-3428.2012.20.036

• 人工智能及识别技术 • 上一篇    下一篇

中文产品评论中评价对象的识别研究

徐叶强,朱艳辉,王文华,杜 锐,鲁 琳,邓 程,刘洪婧   

  1. (湖南工业大学计算机与通信学院,湖南 株洲 412008)
  • 收稿日期:2011-12-29 修回日期:2012-02-20 出版日期:2012-10-20 发布日期:2012-10-17
  • 作者简介:徐叶强(1982-),男,硕士,主研方向:文本分类,信息检索;朱艳辉,教授;王文华、杜 锐、鲁 琳、邓 程,硕士研究生;刘洪婧,讲师
  • 基金资助:
    湖南省自然科学基金资助项目(10JJ3002);教育部人文社会科学研究青年基金资助项目(09YJCZH019);中国包装总公司技术创新科研基金资助项目(2008-XK13);湖南工业大学研究生创新基金资助项目(CX1112)

Research on Recognition of Evaluation Object in Chinese Product Review

XU Ye-qiang, ZHU Yan-hui, WANG Wen-hua, DU Rui, LU Lin, DENG Cheng, LIU Hong-jing   

  1. (Institute of Computer & Communication, Hunan University of Technology, Zhuzhou 412008, China)
  • Received:2011-12-29 Revised:2012-02-20 Online:2012-10-20 Published:2012-10-17

摘要: 在中文产品评论中利用无监督的识别评价对象,准确率和召回率较低。为此,提出一种中文产品评论中的评价对象识别方法。对特殊词、评价对象非完整性、评价对象非稳定性等情况过滤噪声,利用评价对象在评论文本中与评价短语规则出现频率较高的特征,进行置信度排序。实验结果表明,对于14 799篇数码类评论文章,该方法的准确率、召回率和F值分别为0.605、0.780、0.681。

关键词: 无监督, 评价对象, 完整性, 稳定性, 产品评论

Abstract: Aiming at the ineffectiveness of the precision and recall of the unsupervised identification of comment target in the Chinese product reviews, a new method of the comment target recognition is proposed. The integrality and stability of the comment target and their calculation methods are put forward to filter the uncommented words, and the confidence level is scheduled with the consideration of the characteristics of the comment target in comment text, namely, its comment phrase rule and high occurrence frequency. Experimental results show that for the 14 799 digital comments, the precision, recall and F value reach 0.605, 0.780 and 0.681.

Key words: unsupervised, evaluation object, integrality, stability, product review

中图分类号: