摘要: 情感词库在文本情感分析中发挥重要作用,但在分析细粒度情感如人类情绪状态时却无法正确区分。针对 该问题,提出一种基于义原相似度计算的细粒度情感词库构建方法。对词语之间的义原相似度进行计算分析,构建7 类细粒度情感词库,并在此基础上给出细粒度情感词在词库中的权重计算方法,最终得到7 类具有权重值的细粒度 情感词库。实验结果表明,应用引入权重的细粒度情感词库后,文本情感倾向判别的准确率可提升5% 左右。
关键词:
义原相似度,
情绪,
细粒度情感,
权重计算,
权重因子,
词库构建
Abstract: Emotion thesaurus plays an important role in the text sentiment analysis,but it is particularly inadequate in
the analysis of fine-grained emotions such as human emotions. To solve this problem,this paper presents a fine-grained emotion thesaurus construction method via the calculation of sememe similarity,and finishes the construction of seven sorts of thesaurus. Based on this work,this paper researches on the calculation method of the weight of fine-grained emotion words, and proposes a new weight calculation method of emotion words. Finally, this paper finishes the construction of seven sorts of thesaurus with weight value. Experimental results show that the introduction of the finegrained emotion thesaurus with weights can make the accuracy rate of the text emotional tendencies increased by about 5% .
Key words:
sememe similarity,
emotion,
fine-grained emotion,
weight calculation,
weight factor,
thesaurus construction
中图分类号:
黄高峰,周学广,李娟,刘华. 具有权重因子的细粒度情感词库构建方法[J]. 计算机工程.
HUANG Gaofeng,ZHOU Xueguang,LI Juan,LIU Hua. Construction Method of Fine-grained Emotion Thesaurus with Weight Factor[J]. Computer Engineering.