计算机工程 ›› 2010, Vol. 36 ›› Issue (1): 85-86,9.doi: 10.3969/j.issn.1000-3428.2010.01.030

• 软件技术与数据库 • 上一篇    下一篇

基于XML的检索结果聚类方法

余 宏1,万常选2   

  1. (1. 南昌师范高等专科学校信息中心,南昌 330029;2. 江西财经大学信息管理学院,南昌 330013)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2010-01-05 发布日期:2010-01-05

Retrieval Result Clustering Method Based on XML

YU Hong1, WAN Chang-xuan2   

  1. (1. Center of Information, Nanchang Teachers College, Nanchang 330029; 2. School of Information Management, Jiangxi University of Finance and Economics, Nanchang 330013)
  • Received:1900-01-01 Revised:1900-01-01 Online:2010-01-05 Published:2010-01-05

摘要: 针对XML文档的半结构化特点,提出一种建模XML检索结果片段的新思路,设计综合内容和结构语义信息度量相应文档相似性的方法,给出一种适应检索结果聚类应用需求的动态均值软聚类算法。实验表明,面向XML的检索结果聚类方法聚类效果优于传统方法。

关键词: XML检索结果聚类, 结构语义相似度, 内容相似度, 聚类算法

Abstract: According to feature of semi-structure of XML documents, a new effective method for modeling documents of XML retrieval result segment is brought forward, and a method for computing relativity of keywords and measuring similarity of structure semantics between documents is designed. A new algorithm named Dynamic k-means Soft Clustering(DKMSC) is brought forward to meet requirement of clustering retrieval results. Experiment indicates that the method of clustering XML retrieval results is obviously better than the traditional way.

Key words: XML retrieval result clustering, structure semantic similarity, content similarity, clustering algorithm

中图分类号: