作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (23): 194-196. doi: 10.3969/j.issn.1000-3428.2010.23.064

• 人工智能及识别技术 • 上一篇    下一篇

汉语词语情感倾向自动判断研究

张靖1,金浩2   

  1. (1. 攀枝花学院网络中心, 四川 攀枝花 617000; 2. 南京大学计算机科学与技术系, 南京 210093)
  • 出版日期:2010-12-05 发布日期:2010-12-14
  • 作者简介:张靖(1972-),男,副教授、硕士,主研方向:软件工程,文本挖掘;金浩,博士
  • 基金资助:
    四川省教育厅科技基金资助项目(07ZB155);攀枝花市科技基金资助项目(2009CYR13)

Study on Chinese Word Sentiment Polarity Automatic Estimation

ZHANG Jing1,JIN Hao2   

  1. (1. Campus Network Center, Panzhihua University, Panzhihua 617000, China; 2. Department of Computer Science and Technology, Nanjing University, Nanjing 210093, China)
  • Online:2010-12-05 Published:2010-12-14

摘要: 汉语词语情感倾向自动判断避免了个人判断的影响,并提高了主观性词典创建效率。 讨论和分析汉语词语情感倾向判断技术,使用情感特征集合进行倾向性描述,建立基于二元语法依赖关系的情感倾向互信息特征模型。采用机器学习方式得到分类器,对词语的情感倾向进行自动判别,并进行比较和优化,性能得以提高,最好的SVM准确率达到95.47%,F值达到93.90%。采用特征集合描述情感倾向性,在建立的互信息特征模型上,使用机器学习方法自动判断词语情感倾向是有效的。

关键词: 自动判断, 特征选择, 机器学习, 情感分析, 倾向

Abstract: The Chinese word sentiment polarity automatic judgment can avoid artificial error and improve the efficiency of the subjective lexicon creation. The technology of the Chinese word sentiment polarity judgment is discussed and analyzed. The polarity is described by using the sentiment characteristics set. The model of the sentiment polarity mutual information characteristics is created based on the bigram dependency of POS tagging. The classifier is available by machine learning to automatically judge, compare and optimize the word sentiment polarity. All of these help to improve the properties, the highest accuracy of SVM reaches 95.47%, and the F value is up to 93.90%. So it is effective to describe the sentiment polarity by using characteristic set and to automatically judge the word sentiment polarity by machine learning and based on the mutual characteristics model.

Key words: automatic estimation, feature selection, machine learning, sentiment analysis, polarity

中图分类号: