作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 人工智能及识别技术 • 上一篇    下一篇

具有权重因子的细粒度情感词库构建方法

黄高峰1a,周学广1a,李 娟1b ,刘 华2   

  1. (1. 海军工程大学a. 信息安全系; b. 计算机工程系,武汉430033; 2. 75753 部队,广州510600)
  • 收稿日期:2013-12-05 出版日期:2014-11-15 发布日期:2014-11-13
  • 作者简介:黄高峰(1979 - ),男,讲师、CCF 会员,主研方向:网络舆情分析,自然语言处理;周学广,教授;李 娟,副教授、博士研究生; 刘 华,工程师、硕士。
  • 基金资助:
    国家自然科学基金资助项目(611100042)。

Construction Method of Fine-grained Emotion Thesaurus with Weight Factor

HUANG Gaofeng 1a ,ZHOU Xueguang 1a ,LI Juan 1b ,LIU Hua 2   

  1. (1a. Information Security Department; 1b. Computer Engineering Department, University of Engineering,Wuhan 430033,China; 2. 75753 Troops,Guangzhou 510600,China)
  • Received:2013-12-05 Online:2014-11-15 Published:2014-11-13

摘要: 情感词库在文本情感分析中发挥重要作用,但在分析细粒度情感如人类情绪状态时却无法正确区分。针对 该问题,提出一种基于义原相似度计算的细粒度情感词库构建方法。对词语之间的义原相似度进行计算分析,构建7 类细粒度情感词库,并在此基础上给出细粒度情感词在词库中的权重计算方法,最终得到7 类具有权重值的细粒度 情感词库。实验结果表明,应用引入权重的细粒度情感词库后,文本情感倾向判别的准确率可提升5% 左右。

关键词: 义原相似度, 情绪, 细粒度情感, 权重计算, 权重因子, 词库构建

Abstract: Emotion thesaurus plays an important role in the text sentiment analysis,but it is particularly inadequate in the analysis of fine-grained emotions such as human emotions. To solve this problem,this paper presents a fine-grained emotion thesaurus construction method via the calculation of sememe similarity,and finishes the construction of seven sorts of thesaurus. Based on this work,this paper researches on the calculation method of the weight of fine-grained emotion words, and proposes a new weight calculation method of emotion words. Finally, this paper finishes the construction of seven sorts of thesaurus with weight value. Experimental results show that the introduction of the finegrained emotion thesaurus with weights can make the accuracy rate of the text emotional tendencies increased by about 5% .

Key words: sememe similarity, emotion, fine-grained emotion, weight calculation, weight factor, thesaurus construction

中图分类号: