Abstract:
By analysis of Web user access sequence, it can find the factors of user’s hobbies, interests, habits etc.,and provides the necessary support of information for the upgrade and amendment of Web sites. This article proposes a method of data mining by analysis of the user access sequence. It can reduce the number of Web pages of the session sequence and compress the size of frequent traversal sequence by taking the duration time of Web page as a parameter. Experimental results show the algorithm can reduce the cost of mining and provide a useful reference for mining of Web users’ commercial data.
Key words:
duration time of Web page,
data mining,
sequence
摘要: 对Web用户的访问序列进行分析,可以发现用户的爱好、兴趣、习惯等因素,为Web网站的升级修正提供必要的信息支持,提出一种通过对用户访问序列进行分析的数据挖掘方法,该方法采用网页驻留时间为参数来约减会话序列中的网页数,压缩频繁访问序列的规模。实验结果表明,该算法可以降低挖掘成本,为Web用户的商业数据挖掘提供有益的借鉴。
关键词:
网页驻留时间,
数据挖掘,
序列
CLC Number:
YANG Chang-Chun, SUN Jing. Mining of User Access Sequence Constrainted on Duration Time of Web Page[J]. Computer Engineering, 2010, 36(24): 45-47.
杨长春, 孙婧. 网页驻留时间约束的用户访问序列挖掘[J]. 计算机工程, 2010, 36(24): 45-47.