作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2012, Vol. 38 ›› Issue (11): 254-257,261. doi: 10.3969/j.issn.1000-3428.2012.11.077

• 开发研究与设计技术 • 上一篇    下一篇

基于用户行为的产品垃圾评论者检测研究

邱云飞,王建坤,邵良杉,刘大有   

  1. (辽宁工程技术大学软件学院,辽宁 葫芦岛 125105)
  • 收稿日期:2011-12-05 出版日期:2012-06-05 发布日期:2012-06-05
  • 作者简介:邱云飞(1976-),男,副教授、博士,主研方向:数据挖掘;王建坤,硕士研究生;邵良杉、刘大有,教授、博士生导师
  • 基金资助:
    国家自然科学基金资助项目(70971059);辽宁省高等学校创新团队支持计划基金资助项目(2009T045);辽宁省科技攻关计划基金资助项目(2007308003)

Research on Product Review Spammer Detection Based on Users’ Behavior

QIU Yun-fei, WANG Jian-kun, SHAO Liang-shan, LIU Da-you   

  1. QIU Yun-fei, WANG Jian-kun, SHAO Liang-shan, LIU Da-you
  • Received:2011-12-05 Online:2012-06-05 Published:2012-06-05

摘要: 为找到垃圾评论的制造者,提出一种基于用户行为的产品垃圾评论者检测方法。从垃圾评论者的行为目的出发,将其发表垃圾评论的5种行为模式作为垃圾评论者的检测指标,从卓越亚马逊网站获取1 470个评论用户,按单指标选取、5个指标集成选取的方法确定最可能和最不可能成为垃圾评论者的评论用户各25个,并对这50个评论者进行人工标记,根据标记结果设计有监督的线性回归模型。实验结果表明,该模型从1 470个评论者中发现88个用户为垃圾评论者,对垃圾评论者的检测效果优于基于用户有用性投票的基准方法。

关键词: 用户行为, 线性回归模型, 垃圾评论者检测, 短文本, 产品评论, 垃圾评论

Abstract: In order to find the review spammers, this paper proposes a user review spammer detecting method that is based on users’ behavior. Starting from the purpose of review spammers, it makes spammer’s five behavior patterns as the index of spammer detection. Based on the 1 470 reviewers, it gets from the JOYO Amazon website, according to a single indicator selection and 5 indicators integration selection, finally gets the 25 of most likely spammers and 25 of the most unlikely spammers, and markes artificially the 50 of suspicious reviewers, according to the artificial results trained a supervised linear regression model which is based on the 5 indicators. Experimental results show that the model of the spammer detection finds 88 review spammers in the 1 470 reviewers, and is more effective than those based on user voting usefulness of the baseline method.

Key words: user behavior, linear regression model, review spammer detection, short text, product review, review spam

中图分类号: