作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 人工智能及识别技术 • 上一篇    下一篇

基于词向量的产品评论有用度评估方法

郑华飞,周向东   

  1. (复旦大学 计算机科学技术学院,上海 200433)
  • 收稿日期:2016-03-15 出版日期:2017-04-15 发布日期:2017-04-14
  • 作者简介:郑华飞(1990—),男,硕士研究生,主研方向为自然语言处理、信息检索、文本挖掘;周向东,教授。
  • 基金资助:
    国家自然科学基金(61370157);上海市科技计划项目(14511107403)。

Product Review Helpfulness Assessment Method Based on Word Vector

ZHENG Huafei,ZHOU Xiangdong   

  1. (School of Computer Science,Fudan University,Shanghai 200433,China)
  • Received:2016-03-15 Online:2017-04-15 Published:2017-04-14

摘要: 产品评论的快速增长以及质量的参差不齐,使得消费者获得有用的产品评论变得困难。为此,提出一种新的产品评论有用度评估方法。引入词向量作为评论文本的深度特征表示,结合结构特征、情感特征、元数据特征等训练回归模型,自动地对评论进行有用度评估并基于有用度对评论进行排序。在Amazon真实数据集上的实验结果表明,该方法在回归性能和排序性能上均优于UGR+LEN+STR方法和基准方法。另外通过挖掘特定领域的词向量特征,该向量模型在RMSE,NDCG等评价指标上可有效地改善评估效果。

关键词: 有用度, 深度学习, 神经网络语言模型

Abstract: Rapid increasing volume of product reviews as well as their variant qualities makes it time-consuming and tedious for individuals to perceive valuable reviews.Therefore,this paper proposes an approach to automatically assess helpfulness of online product reviews based on word vector.The approach introduces word vector as a deep text feature and incorporates it with structure feature,sentiment feature and meta feature to learn a regression model for automatically helpfulness assessing.A ranking procedure based on helpfulness is performed.Compared with UGR+LEN+STR’s approach and the baseline,experimental results conducted on Amazons dataset show that this approach achieves promising performances both on regression and ranking.Furthermore,this paper explores domain-specific word vector model,which can improve assessing effect on RMSE,NDCG and other evaluation indexs.

Key words: product reviews, word vector, helpfulness, deep learning, neural network language model

中图分类号: