作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 人工智能及识别技术 • 上一篇    下一篇

基于深度游走模型的标签传播社区发现算法

冯曦 1,朱福喜 1,2,刘世超 1   

  1. (1.武汉大学 计算机学院,武汉 430072; 2.汉口学院 计算机科学与技术学院,武汉 430212)
  • 收稿日期:2016-12-01 出版日期:2018-03-15 发布日期:2018-03-15
  • 作者简介:冯曦(1991—),女,硕士研究生,主研方向为社会网络分析、数据挖掘;朱福喜,教授、博士生导师;刘世超,博士。
  • 基金资助:
    国家自然科学基金(61272277)。

Label Propagation Community Discovery Algorithm Based on DeepWalk Model

FENG Xi  1,ZHU Fuxi  1,2,LIU Shichao  1   

  1. (1.School of Computer,Wuhan University,Wuhan 430072,China;2.School of Computer Science and Technology,Hankou University,Wuhan 430212,China)
  • Received:2016-12-01 Online:2018-03-15 Published:2018-03-15

摘要: 针对传统标签传播算法准确率较低的问题,提出一种基于深度游走模型的改进标签传播算法。以社会网络作为深度游走模型的输入,通过深度随机游走的方式对网络中的节点进行采样得到随机序列,并基于SkipGram模型对其进行神经网络训练。运用层次Softmax对SkipGram模型进行求解,得到节点的特征向量后在邻居节点之间计算节点相似度,将其作为标签传播概率的权重进行标签的传播迭代,最终得到社区发现的结果。在6个真实网络数据集和合成数据集上进行实验,结果表明,与传统标签传播算法相比,该改进算法具有较高的准确率,尤其对于节点个数在100以上的真实网络,Q值提高10%以上。

关键词: 深度游走模型, 随机序列, 特征向量, SkipGram模型, 节点相似度, 传播迭代

Abstract: Aiming at the problem of low accuracy in traditional Label Propagation Algorithm(LPA),an improved label propagation algorithm based on DeepWalk model is proposed.Firstly,the algorithm takes the social network as the input of the DeepWalk model,samples the nodes in the network to get random sequences by means of a deep random walk,and uses SkipGram model to train the samples in neural network.Secondly,computes the kernel part of SkipGram model by hierarchical Softmax and obtains the feature vector of the nodes,and then calculates the similarity between the nodes.Finally,takes the similarity of the nodes as the weight during the label propagation procedure,and then gets the results of community detection.Experimental results on 6 real network dataset and synthetic dataset show that,compares with the traditional label propagation algorithm,the improved algorithm gets the higher accuracy,and especially when the nodes’ number is more than 100 in real network dataset,the Q shows 10% rise in improved algorithm.

Key words: DeepWalk model, random sequence, feature vector, SkipGram model, node similarity, propagation iteration

中图分类号: