作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2020, Vol. 46 ›› Issue (12): 113-119,141. doi: 10.19678/j.issn.1000-3428.0055734

• 人工智能与模式识别 • 上一篇    下一篇

一种混合语种文本的多维度多情感分析方法

李妍慧, 郑超美, 王炜立, 杨昕   

  1. 南昌大学 信息工程学院, 南昌 330031
  • 收稿日期:2019-08-14 修回日期:2019-11-26 发布日期:2019-12-13
  • 作者简介:李妍慧(1994-),女,硕士研究生,主研方向为人工智能、数据分析;郑超美(通信作者),教授;王炜立,博士;杨昕,硕士研究生。
  • 基金资助:
    江西省自然科学基金(20161BAB212040)。

A Multi-Dimensional and Multi-Emotion Analysis Method for Mixed Language Texts

LI Yanhui, ZHENG Chaomei, WANG Weili, YANG Xin   

  1. College of Information and Engineering, Nanchang University, Nanchang 330031, China
  • Received:2019-08-14 Revised:2019-11-26 Published:2019-12-13

摘要: 针对中英混合微博文本情感分析问题,提出一种新的多维度多情感分析方法。将中英混合语言文本分别翻译成中文和英文,提取文本在不同语言维度的特征,基于中英混合语言、中文、英文分别提取对应的语义信息,综合上述3种从不同语言维度获取的语义信息,构建一个多情感分类模型并进行训练微调,根据情感概率得到最终的分类结果。实验结果表明,相比单一的处理方法,该方法能够更全面地提取中英混合文本的语义特征,并精准地判别文本中的多种情感,具有较好的分类效果。

关键词: 中英混合微博, 多情感分析, 多维度方法, BERT模型, 迁移学习

Abstract: To address the emotion analysis of Chinese-English code-mixed microblog texts,this paper proposes a new multi-dimensional and multi-emotion analysis method.The Chinese-English code-mixed texts are translated into Chinese and English respectively,and features of the texts in different language dimensions are extracted.Then the corresponding semantic information is extracted based on the Chinese-English mixed language,Chinese and English.By combining the three kinds of semantic information extracted from different language dimensions,a multi-emotion classification model is constructed and trained for fine tuning.Finally,the classification result is obtained based on the emotional probability.Experimental results show that compared with simple processing methods,the proposed method more comprehensively extracts the semantic features of Chinese-English code-mixed texts,accurately distinguishes multiple emotions in the text,and has a better classification effect.

Key words: Chinese-English mixed microblogs, multi-emotion analysis, multi-dimensional method, BERT model, transfer learning

中图分类号: