作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (18): 70-71,7. doi: 10.3969/j.issn.1000-3428.2006.18.025

• 软件技术与数据库 • 上一篇    下一篇

Web访问模式聚类中引入Web内容挖掘的方法

陈正明,马光志   

  1. (华中科技大学计算机学院,武汉 430074)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2006-09-20 发布日期:2006-09-20

Method for Importing Web Content Mining into Patterns Clustering

CHEN Zhengming, MA Guangzhi   

  1. (Department of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074)
  • Received:1900-01-01 Revised:1900-01-01 Online:2006-09-20 Published:2006-09-20

摘要: 在用户访问模式的聚类过程中引入页面的相似性因子,从用户访问的主要内容和访问路径两个方面来度量访问模式的相似性,针对以往对这种集成研究忽略的问题进行深入的探讨,提出了有效的解决方法,合理地降低了聚类结果的类别数目,能更准确地发现一个网站的潜在用户类。

关键词: 向量空间模型, Web内容挖掘, Web使用挖掘, 模糊聚类

Abstract: In the process of clustering the visitors’ travel path patterns, this paper achieves a more reasonable and accurate model from combining the content as well as the path a user visits, at the same time improves the former research in measuring the similarity of travel path pattern and presents an effective way to perform it. The result shows that the improved method decreases the number of the clustering result, and is much better in finding the potential user classes, which achieves the expectation of this paper.

Key words: Vector space model, Web content mining, Web usage mining, Fuzzy clustering

中图分类号: