大型Web站点逻辑域挖掘算法

doi:10.3969/j.issn.1000-3428.2008.09.036

计算机工程 ›› 2008, Vol. 34 ›› Issue (9): 101-102,. doi: 10.3969/j.issn.1000-3428.2008.09.036

大型Web站点逻辑域挖掘算法

郑皎凌

(成都信息工程学院软件工程系，成都 610225)

收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-05-05 发布日期:2008-05-05

Large Scale Website Logical Domain Mining Algorithm

ZHENG Jiao-ling

(Department of Software Engineering, Chengdu University of Information Technology, Chengdu 610225)

Received:1900-01-01 Revised:1900-01-01 Online:2008-05-05 Published:2008-05-05

摘要/Abstract

摘要： 通过进一步发展Wen-Syan Li等人提出的Web站点逻辑域理论，该文提出Web站点逻辑域核模型及建立在其上的逻辑域挖掘算法。该算法通过对Web站点超链接的图结构进行运算，得到Web站点逻辑域。与Wen-Syan Li算法对比测试，结果表明在获得相同逻辑域个数的情况下，克服了其采用启发式方法所带来的效率问题。在对4个大型Web站点的单独测试中，平均能够达到85%的逻辑域挖掘精度。

关键词: Web站点结构挖掘, 逻辑域, 逻辑域核

Abstract: By developing Wen-Syan Li’s website logical domain theory, the paper proposes a website logical domain core model and logical domain mining algorithm based upon it. The algorithm computes website’s hyperlink graph structure to obtain its logical domain. In comparative test with Wen-Syan Li’s algorithm, it overcomes the efficiency defect of Wen-Syan Li’s huristic method while obtaining the same quantity of logical domain. In separate test of 4 large scale websites, the logical domain core mining precision can averagely reach 85%.

Key words: website structure mining, logical domain, logical domain core

中图分类号:

TP391

郑皎凌. 大型Web站点逻辑域挖掘算法[J]. 计算机工程, 2008, 34(9): 101-102,.

ZHENG Jiao-ling. Large Scale Website Logical Domain Mining Algorithm[J]. Computer Engineering, 2008, 34(9): 101-102,.

http://www.ecice06.com/CN/Y2008/V34/I9/101

[1]	郑皎凌, 王鹏. Web站点核心逻辑结构挖掘[J]. 计算机工程, 2010, 36(21): 57-58,61.
[2]	郑皎凌;王成良. 网页分块聚类的Web站点逻辑域挖掘[J]. 计算机工程, 2007, 33(04): 52-54.

选择文件类型/文献管理软件名称

选择包含的内容

大型Web站点逻辑域挖掘算法

Large Scale Website Logical Domain Mining Algorithm

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 2

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

大型Web站点逻辑域挖掘算法

Large Scale Website Logical Domain Mining Algorithm

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 2

编辑推荐

Metrics

本文评价