Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2007, Vol. 33 ›› Issue (06): 59-61. doi: 10.3969/j.issn.1000-3428.2007.06.021

• Software Technology and Database • Previous Articles     Next Articles

Filter-based Web Access Pattern Mining

TONG Qiang1,3, ZHOU Yuanchun1,3, WU Kaichao1,2,3, YAN Baoping2   

  1. (1. Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080; 2. Computer Network Information Center, Chinese Academy of Sciences, Beijing 100080; 3. Graduate School of Chinese Academy of Sciences, Beijing 100080)
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-03-20 Published:2007-03-20

基于过滤器的Web访问模式挖掘

佟 强1,3,周园春1,3,吴开超1,2,3,阎保平2   

  1. (1. 中国科学院计算技术研究所,北京 100080;2. 中国科学院计算机网络信息中心,北京 100080;3. 中国科学院研究生院,北京 100080)

Abstract: Due to the complexity and inaccuracy of user identification and session identification in the traditional Web access pattern mining system, this paper proposes the filter based on Web access pattern mining system, which can identify a user and a session accurately, and provides good data for the mining algorithms. It presents the implementation and deployment of the log filter, and proposes the Web access pattern mining algorithm. The method is widely used in the scientific database.

Key words: Data mining, Web log, Access pattern, Frequent set

摘要: 针对传统Web访问模式挖掘系统中用户识别和会话识别的复杂性和不准确性,该文提出了基于过滤器的Web访问模式挖掘系统。它能够准确地识别用户和会话,为挖掘算法提供优质的数据。给出了日志过滤器的实现和部署,提出了Web访问模式的挖掘算法。目前该方法已经广泛地应用于科学数据库系统中。

关键词: 数据挖掘, Web日志, 访问模式, 频集

CLC Number: