作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (21): 28-30. doi: 10.3969/j.issn.1000-3428.2010.21.010

• 软件技术与数据库 • 上一篇    下一篇

基于频繁变化结构的时序XML文档聚类方法

罗梓恒,李 巍,孙 涛,李雄飞   

  1. (吉林大学计算机科学与技术学院符号计算与知识工程教育部重点实验室,长春 130012)
  • 出版日期:2010-11-05 发布日期:2010-11-03
  • 作者简介:罗梓恒(1984-),男,硕士研究生,主研方向:数据库技术,XML数据挖掘;李 巍,硕士研究生;孙 涛,博士研究生;李雄飞,教授、博士生导师
  • 基金资助:
    吉林省科技发展计划基金资助项目(20070321, 20090704)

Temporal XML Document Clustering Method Based on Frequently Changing Structure

LUO Zi-heng, LI Wei, SUN Tao, LI Xiong-fei   

  1. (Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, China)
  • Online:2010-11-05 Published:2010-11-03

摘要: 为解决XML文档对动态性表示不足的问题,通过对XML文档加入时间信息进行建模,提出2种基于时间序列的XML文档频繁变化结构挖掘算法FCSBF和FCSDF,实现对动态XML文档频繁变化结构的高效挖掘。在此基础上提出一种针对动态XML文档的聚类新方法,实验结果证明,该方法能够对动态XML文档进行有效的聚类。

关键词: 时序XML文档, 频繁变化结构, 文档聚类

Abstract: In order to solve the problems that XML document can not reflect the dynamic characteristics well, this paper adopts the method of modeling XML as temporal XML and proposes two mining algorithms based on temporal XML document named FCSBF and FCSDF to discover Frequently Changing Structure(FCS), which realizes efficiently mining frequently changing structures in dynamic XML documents. A novel clustering method for dynamic XML documents is presented. Experimental results show that the method is effective on clustering dynamic XML documents.

Key words: temporal XML document, Frequently Changing Structure(FCS), document clustering

中图分类号: