作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 专栏 • 上一篇    下一篇

基于支持向量机的炒作微博识别方法

董雨辰,刘 琰,罗军勇,张 进   

  1. (数学工程与先进计算国家重点实验室,郑州450001)
  • 收稿日期:2014-04-11 出版日期:2015-03-15 发布日期:2015-03-13
  • 作者简介:董雨辰(1988 - ),男,硕士研究生,主研方向:网络信息安全,网络态势感知;刘 琰(通讯作者),副教授、博士;罗军勇,教授; 张 进,硕士研究生。
  • 基金资助:

    国家自然科学基金资助项目(61309007);国家“863”计划基金资助项目(2012AA012902);国家科技支撑计划基金资助项目 (2012BAH47B01)。

Hype Microblog Recognition Method Based on Support Vector Machine

DONG Yuchen,LIU Yan,LUO Junyong,ZHANG Jin   

  1. (State Key Laboratory of Mathematical Engineering and Advanced Computing,Zhengzhou 450001,China)
  • Received:2014-04-11 Online:2015-03-15 Published:2015-03-13

摘要:

微博是舆论传播的中心和渠道,同时参与舆论的形成、发展与引导过程,其自媒体发布、意见领袖参与等因素在一定程度上造成了微博谣言、虚假炒作、社会动员等现象。针对炒作微博的传播特点,分析其群体的隐蔽策划现象,挖掘出普通微博和炒作微博在传播网络结构、转发增量统计等方面的差异。通过社交网站的应用程序接口对目标微博的所有评论、转发和点赞用户进行信息获取,构建该微博的传播网络,利用社团模块度、平均最短路径和网络直径这3 个属性度量该网络的紧密程度,基于支持向量机对所抽取的微博进行分类,进而识别出炒作微博。实验结果表明,该方法对微博传播用户的属性信息依赖小以及传播网络结构特征敏感,并且具有较高的炒作微博识别准确率。

关键词: 社交网络, 炒作群体, 炒作微博, 社团模块度, 网络直径, 平均最短路径, 支持向量机

Abstract:

Microblog is not only a center or channel of mass media,but also involved in the formation,development and guidance of public opinions. The propagation of speculation microblog which is released from We-media,opinion leaders or some other users,causes microblog rumors,false hype,social mobilization and other problems. This paper analyzes the phenomenon of covert planning,mines the difference of the structure in communication networks and the incremental statistics of forwardings between the ordinary and the speculation. A novel algorithm for hype microblog recognition is proposed in this paper based on Support Vector Machine (SVM) which uses the modularity peak spread and the average diameter of the shortest path in propagation network. The proposed method has advantages of less dependence on user profile information and is sensitive to the structure of propagation networks,and it has higher recognition accuracy.

Key words: social network, hype group, hype microblog, community module degree, network diameter, average shortest path, Support Vector Machine(SVM)

中图分类号: