作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (24): 258-260. doi: 10.3969/j.issn.1000-3428.2010.24.093

• 开发研究与设计技术 • 上一篇    下一篇

基于锚文本相似度的PageRank改进算法

王钟斐,王 彪   

  1. (宝鸡文理学院数学系,陕西 宝鸡 721013)
  • 出版日期:2010-12-20 发布日期:2010-12-14
  • 作者简介:王钟斐(1983-),女,助教、硕士研究生,主研方向:排序算法,网络安全;王 彪,助教、硕士研究生

Improved PageRank Algorithm Based on Anchor Texts Similarity

WANG Zhong-fei, WANG Biao   

  1. (Department of Mathematics, Baoji University of Arts and Sciences, Baoji 721013, China)
  • Online:2010-12-20 Published:2010-12-14

摘要: 分析搜索引擎Google的PageRank算法,给出其存在的3个问题及针对这3个问题提出的改进。结合锚文本相似度提出一种改进的PageRank算法,利用Nutch对传统PageRank算法和改进后的PageRank算法进行实验分析与比较。实验结果表明,改进的PageRank算法提高了搜索结果的查准率,有利于减少主题漂移现象。

关键词: PageRank算法, 锚文本, 相似度, 主题漂移

Abstract: This paper analyzes PageRank algorithm, which is the key technology of search engine Google. Three issues and the existing improvements are pointed out. An improved PageRank algorithm combined with anchor texts similarity is proposed, and traditional PageRank algorithm and the improved algorithm are compared by Nutch. Experimental results show that the improved PageRank algorithm improves the precision of the search results, which help to reduce topic-drift phenomenon.

Key words: PageRank algorithm, anchor texts, similarity, topic-drift

中图分类号: