作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2012, Vol. 38 ›› Issue (06): 56-59. doi: 10.3969/j.issn.1000-3428.2012.06.018

• 软件技术与数据库 • 上一篇    下一篇

基于网格结构的CLARANS改进算法

张书春 1,孙秀英 2   

  1. (1. 郑州大学信息工程学院,郑州 450002;2. 黄河科技学院,郑州 450063)
  • 收稿日期:2011-08-12 出版日期:2012-03-20 发布日期:2012-03-20
  • 作者简介:张书春(1981-),男,硕士研究生,主研方向:数据挖掘,企业信息管理系统;孙秀英,讲师、硕士研究生
  • 基金资助:
    上海市自然科学基金资助项目(042R14077);河南省科技攻关计划基金资助项目(2011C520016)

Improved CLARANS Algorithm Based on Grid Structure

ZHANG Shu-chun 1, SUN Xiu-ying 2   

  1. (1. School of Information Engineering, Zhengzhou University, Zhengzhou 450002, China; 2. Huanghe Science and Technology College, Zhengzhou 450063, China)
  • Received:2011-08-12 Online:2012-03-20 Published:2012-03-20

摘要: 为提高CLARANS算法的准确性和执行效率,利用网格聚类算法对数据空间进行划分的思想,结合统计信息网格算法,对算法初始节点和邻居节点的选择及替换总代价的计算进行改进。实验结果表明,与CLARANS算法相比,改进算法聚类结果的准确性和稳定性更高,执行时间明显降低。

关键词: CLARANS算法, 统计信息网格算法, 聚类, 相异度, 数据空间

Abstract: In order to improve the accuracy and efficiency of Clustering Large Applications based on Randomized Search(CLARANS) algorithm, this paper combines the idea of data space division which comes from grid-based algorithm Statistical Information Grid(STING), improves the CLARANS algorithm by optimizing the selection of initial node and neighbor node, optimizing the calculation of total node replaces cost. Experimental results show that, compared with the CLARANS algorithm, the improved algorithm has better accuracy and stability for the clustering results, and significantly reduce the execution time.

Key words: Clustering Large Applications based on Randomized Search(CLARANS) algorithm, Statistical Information Grid(STING) algorithm, clustering, dissimilarity degree, data space

中图分类号: