作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (15): 284-封三. doi: 10.3969/j.issn.1000-3428.2009.15.099

• 开发研究与设计技术 • 上一篇    

基于PageRank的Web挖掘改进算法

焦金涛   

  1. (武夷学院计算机科学与工程系,武夷山 354300)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-08-05 发布日期:2009-08-05

Improved Web Mining Algorithm Based on PageRank

JIAO Jin-tao   

  1. (Dept. of Computer Science and Engineering, Wuyi University, Wuyishan 354300)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-08-05 Published:2009-08-05

摘要: 针对Google使用的PageRank算法,提出一种改进的Web挖掘算法。实现该算法时,将网页使用信息和网页添加日期信息做成点击向量和日期向量,2个向量加权后标准化得到的一个向量作为常数加入到改进的迭代算法。实验结果证明,改进算法可以提高网页重要性判断的准确度。

关键词: 搜索引擎, 网页, PageRank算法

Abstract: Aiming at PageRank algorithm used by Google, this paper proposes an improved Web mining algorithm. When realizing the algorithm, Web page using information and adding date information turn to click vector and date vector. It receives a vector after two vectors weighting of standardization. Then the vector will be put into the process of the improved iterative algorithm as a constant. Experimental result proves that the improved algorithm can make a good accuracy on evaluating the importance of Web page.

Key words: search engine, Web page, PageRank algorithm

中图分类号: