Clustering of Search Engine Query Log

doi:10.3969/j.issn.1000-3428.2009.01.014

Computer Engineering ›› 2009, Vol. 35 ›› Issue (1): 43-45,4. doi: 10.3969/j.issn.1000-3428.2009.01.014

• Software Technology and Database • Previous Articles Next Articles

Clustering of Search Engine Query Log

ZHANG Yu-lian, LI Yan-wei, WANG Quan, YUAN Fu-yong

(College of Information Science and Engineering, Yanshan University, Qinhuangdao 066004)

Received:1900-01-01 Revised:1900-01-01 Online:2009-01-05 Published:2009-01-05

搜索引擎查询日志的聚类

张玉连，李彦威，王权，原福永

(燕山大学信息科学与工程学院，秦皇岛 066004 )

Abstract

Abstract: In recent years, with the search engine technology and the network data mining technology development, how to find the useful information from the search engine query log becomes an important research direction. This paper discusses the excellences and the disadvantages of the clustering algorithm proposed by Beeferman and the improved algorithm which is proposed by Chan. A new improved algorithm based on the user profile of the Webpage is proposed that can weaken the influence of the noises data. And the simulation experiment proves that the new algorithm is better than the Beeferman algorithm and the Chan algorithm.

Key words: user profile, search engine query log, data mining

摘要： 随着搜索引擎技术和网络数据挖掘技术的发展，怎样从搜索引擎查询日志中找到有用的信息成为研究热点。该文在讨论Beeferman提出的算法及Chan对其改进的算法的优缺点后，提出一个基于用户网页兴趣度的改进算法。该算法能进一步减小噪声数据的影响，并通过模拟实验对3种不同的算法进行了对比。

关键词: 用户兴趣, 搜索引擎查询日志, 数据挖掘

CLC Number:

TP391

ZHANG Yu-lian; LI Yan-wei; WANG Quan; YUAN Fu-yong. Clustering of Search Engine Query Log[J]. Computer Engineering, 2009, 35(1): 43-45,4.

张玉连;李彦威;王权;原福永. 搜索引擎查询日志的聚类[J]. 计算机工程, 2009, 35(1): 43-45,4.

/ / Recommend / Download Citations

URL: http://www.ecice06.com/EN/10.3969/j.issn.1000-3428.2009.01.014

http://www.ecice06.com/EN/Y2009/V35/I1/43

[1]	XI Rongkang, CAI Manchun, LU Tianliang. Tor Traffic Analysis Model Based on Data Enhancement and Stream Data Processing [J]. Computer Engineering, 2023, 49(3): 177-184.
[2]	GU Qingzhu, DONG Hongbin. MI Loss Evaluation Model for k-Anonymity in PPDM [J]. Computer Engineering, 2022, 48(4): 143-147.
[3]	WANG Lu, LIU Xiaoqing, HE Zhenying. Frequent Word Sequence Mining Algorithm in Continuous Time Interval [J]. Computer Engineering, 2022, 48(2): 79-85,91.
[4]	ZHANG Pan, GAO Feng, ZHOU Yi, RAO Hanyu, MAO Dong, LI Jing. An Online Real-Time Anomaly Detection Method for Microservice Call Chains [J]. Computer Engineering, 2022, 48(11): 161-169.
[5]	WU Jun, OUYANG Aijia, ZHANG Lin. Redundant Contrast Pattern Filtering Algorithm for Permutation Testing [J]. Computer Engineering, 2022, 48(1): 75-84.
[6]	WU Jun, OUYANG Aijia, ZHANG Lin. Independent Exact Permutation Testing Algorithm for Distinguishing Sequential Pattern Discovery [J]. Computer Engineering, 2021, 47(8): 45-53,61.
[7]	DU Shiqing, WANG Peng, WANG Wei. A MDL-based Pattern Mining Algorithm for Log Sequences [J]. Computer Engineering, 2021, 47(2): 118-125.
[8]	WEI Wenhao, TANG Zekun, LIU Gang. PBK-means Algorithm Based on Distance and Density [J]. Computer Engineering, 2020, 46(9): 68-75.
[9]	SHI Mingyang, WANG Peng, WANG Wei. Algorithm of Supervised Time Series Segmentation and State Recognition [J]. Computer Engineering, 2020, 46(5): 131-138.
[10]	ZHANG Pan, LU Guangyue, Lü Shaoqing, ZHAO Xueli. Attributed Network Representation Learning Based on Matrix Factorization [J]. Computer Engineering, 2020, 46(10): 67-73.
[11]	WANG Huijian, LIU Zheng, LI Yun, LI Tao. Trend Prediction Method of Time Series Trends Based on Neural Network Language Model [J]. Computer Engineering, 2019, 45(7): 13-19,25.
[12]	Xijun ZHANG, Zhanting YUAN, Hong ZHANG, Weijun GAO, Enzhan ZHANG. Research on Preprocessing Method for Traffic Trajectory Big Data [J]. Computer Engineering, 2019, 45(6): 26-31.
[13]	LI Ke,WANG Hai,XU Xiaolong,DU Yu. Mobile Network Cell Information Detection Method Based on Mobile Crowdsensing [J]. Computer Engineering, 2019, 45(2): 92-100.
[14]	CUI Chen,DENG Zhaohong,WANG Shitong. Radial Basis Function Neural Network Model Based on Lasso Sparse Learning [J]. Computer Engineering, 2019, 45(2): 173-177.
[15]	XIE Bin,ZHANG Kun,CAI Ying,JIANG Tongtong,MA Mengyue. Research on Mining Algorithm for Association Co-occurrence Rule of Moving Targets [J]. Computer Engineering, 2018, 44(8): 61-67,73.

Please choose a citation manager

Content to export

Clustering of Search Engine Query Log

搜索引擎查询日志的聚类

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments

模态框（Modal）标题

Please choose a citation manager

Content to export

Clustering of Search Engine Query Log

搜索引擎查询日志的聚类

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments