计算机工程 ›› 2007, Vol. 33 ›› Issue (01): 95-97.doi: 10.3969/j.issn.1000-3428.2007.01.032

• 软件技术与数据库 • 上一篇    下一篇

一种Web日志会话识别的优化方法

陈子军,王鑫昱,李 伟   

  1. (燕山大学信息学院计算机科学与工程系,秦皇岛066004)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-01-05 发布日期:2007-01-05

Method of Web Log Sessions Reconstruction

CHEN Zijun, WANG Xinyu, LI Wei   

  1. (Department of Computer Science and Engineering, Information Institute, Yanshan University, Qinhuangdao 066004)
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-01-05 Published:2007-01-05

摘要: 会话识别是Web日志挖掘的关键步骤,然而很多方法所得到的会话不够精确。该文对此提出优化算法,并对最常用的Timeout方法识别的会话进行优化,通过实验证明会话质量得到了提高。

关键词: Web日志挖掘, 数据预处理, 会话识别

Abstract: Although sessions reconstruction is an important step in Web log mining, the sessions recognized by existing methods are not accurate. An algorithm resolving this problem is proposed. This paper improves the Timeout method with the algorithm. Finally the algorithm enhancing the quality of Web log sessions is proved by experiments.

Key words: Web log mining, Data preprocessing, Sessions reconstruction