作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (22): 35-37. doi: 10.3969/j.issn.1000-3428.2009.22.012

• 软件技术与数据库 • 上一篇    下一篇

搜索引擎PageRank算法的改进

杨劲松,凌培亮   

  1. (同济大学机械工程学院,上海 200092)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-11-20 发布日期:2009-11-20

Improvement of PageRank Algorithm for Search Engine

YANG Jin-song, LING Pei-liang   

  1. (College of Mechanical Engineering, Tongji University, Shanghai 200092)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-11-20 Published:2009-11-20

摘要: 为了解决企业快速决策时信息检索的问题,提出一种改进的PageRank算法。在考虑网页产生时间因素的同时,通过锚文本与网页主题的相似度分析按权重分配网页各正向链接PageRank值,产生的PageRank值更贴合主题搜索引擎的要求,并保持算法的简洁性。实验结果证明该改进算法能有效减少主题漂移现象,恰当提升新网页PageRank值。

关键词: 搜索引擎, 锚文本, 向量空间模型

Abstract: In order to solve the problems in information retrieval when enterprise making rapid decision, this paper proposes an improved PageRank algorithm. Considering the time factor by Web page, it distributes the forward link different PageRank value based on the proportion by the similarity analysis between anchor text and Web page text. The final PageRank value is more suitable for topic-specific search engine and keeps simplicity of algorithm. Experimental result shows that the improved algorithm can effectively reduce the phenomenon of topic-drift and enhance the PageRank value of new Web page.

Key words: search engine, anchor text, Vector Space Model(VSM)

中图分类号: