作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2012, Vol. 38 ›› Issue (16): 1-4. doi: 10.3969/j.issn.1000-3428.2012.16.001

• 专栏 •    下一篇

基于粗糙集的在线评论情感分析模型

王祖辉,姜 维   

  1. (哈尔滨工业大学信息管理与信息系统研究所,哈尔滨 150001)
  • 收稿日期:2012-01-13 修回日期:2012-03-05 出版日期:2012-08-20 发布日期:2012-08-17
  • 作者简介:王祖辉(1980-),男,博士研究生,主研方向:电子商务,评论情感分析,数据挖掘;姜 维,讲师、博士
  • 基金资助:

    国家自然科学基金资助项目(70801022);中央高校基本科研业务费专项基金资助项目(HIT.NSRIF.2010083);中国博士后科学基金资助项目(20090450973);黑龙江省教育厅科学技术研究基金资助项目(12511435)

Online Reviews Sentiment Analysis Model Based on Rough Sets

WANG Zu-hui, JIANG Wei   

  1. (Research Center of Information Management and Information System, Harbin Institute of Technology, Harbin 150001, China)
  • Received:2012-01-13 Revised:2012-03-05 Online:2012-08-20 Published:2012-08-17

摘要:

针对在线评论情感分析的复杂特征抽取问题,提出一种基于粗糙集的在线评论情感分析模型。分析传统词袋性特征,指出固定搭配特征在情感极性判别中的作用,采用粗糙集方法挖掘在线评论中的固定搭配特征,将其融合于SVM与Naive Bayes等情感分析模型中。实际酒店的在线评论情感分析结果表明,增加粗规则后,SVM模型与Naive Bayes模型获得的评论情感判别精度都有所提高。

关键词: 情感分析, 粗糙集, 特征提取, 词袋特征, 固定搭配特征, 支持向量机

Abstract:

In order to extract the complicated features for online reviews sentiment analysis, an online reviews sentiment analysis model based on rough sets is presented. It analyzes the traditional word bags features, and the regular collocation features are confirmed to play important role in identifying the reviews sentiment. The regular collocation features as rough rules extracted are incorporated into the sentiment analysis model, and the results on real hotel online reviews show the performance improvement under both Support Vector Machine(SVM) model and Naive Bayes model.

Key words: sentiment analysis, rough set, feature extraction, word bag feature, regular collocation feature, Support Vector Machine(SVM)

中图分类号: