作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (15): 52-54. doi: 10.3969/j.issn.1000-3428.2006.15.019

• 软件技术与数据库 • 上一篇    下一篇

一种基于XML文档聚类的XML近似查询算法

郭俊文;衡星辰;邵利平;覃 征;田 磊;王 妮   

  1. 西安交通大学电子与信息工程学院,西安710049
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2006-08-05 发布日期:2006-08-05

An Approximate XML Query Algorithm Based on Clustering of XML Documents

GUO Junwen;HENG Xingchen;SHAO Liping;QIN Zheng;TIAN Lei;WANG Ni   

  1. School of Electronics and Information Engineering, Xi’an Jiaotong University, Xi’an 710049
  • Received:1900-01-01 Revised:1900-01-01 Online:2006-08-05 Published:2006-08-05

摘要: 提出了一种基于XML文档聚类的XML近似查询算法。给出了基于语义的XML文档间距离的计算方法,结合该语义距离,提出了基于网格的八邻域聚类算法对XML数据库进行聚类划分,进而利用在聚类过程中得到的聚类中心对静态有序选择算法的近似查询评估阶段进行优化,使得不用对XML数据库进行完全遍历就能及时返回满足用户需要的查询结果。最后,在汽车外形智能化设计的实验中表明该算法有效地提高了静态有序选择算法的查询效率。

关键词: XML文档, XML近似查询, 编辑距离, 聚类中心

Abstract: An approximate XML query algorithm based on clustering of XML documents is proposed. A method of computing the semantic distance between XML documents is given. XML database is clustered through the gridding-based eight-neighbours clustering algorithm with the semantic distance between XML documents. The approximate query evaluation of the SSO algorithm is optimized with the clustering centers gotten during the process of clustering. Finally, the experiments of intelligent design of automobile shape show the algorithm promotes the efficiency of the SSO algorithm.

Key words: XML document, Approximate XML query, Editing distance, Clustering center

中图分类号: