摘要: 为解决XML文档对动态性表示不足的问题,通过对XML文档加入时间信息进行建模,提出2种基于时间序列的XML文档频繁变化结构挖掘算法FCSBF和FCSDF,实现对动态XML文档频繁变化结构的高效挖掘。在此基础上提出一种针对动态XML文档的聚类新方法,实验结果证明,该方法能够对动态XML文档进行有效的聚类。
关键词:
时序XML文档,
频繁变化结构,
文档聚类
Abstract: In order to solve the problems that XML document can not reflect the dynamic characteristics well, this paper adopts the method of modeling XML as temporal XML and proposes two mining algorithms based on temporal XML document named FCSBF and FCSDF to discover Frequently Changing Structure(FCS), which realizes efficiently mining frequently changing structures in dynamic XML documents. A novel clustering method for dynamic XML documents is presented. Experimental results show that the method is effective on clustering dynamic XML documents.
Key words:
temporal XML document,
Frequently Changing Structure(FCS),
document clustering
中图分类号:
罗梓恒, 李巍, 孙涛, 李雄飞. 基于频繁变化结构的时序XML文档聚类方法[J]. 计算机工程, 2010, 36(21): 28-30.
LUO Zi-Heng, LI Wei, SUN Chao, LI Xiong-Fei. Temporal XML Document Clustering Method Based on Frequently Changing Structure[J]. Computer Engineering, 2010, 36(21): 28-30.