Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2009, Vol. 35 ›› Issue (17): 66-68. doi: 10.3969/j.issn.1000-3428.2009.17.022

• Software Technology and Database • Previous Articles     Next Articles

Density-reachable Based Clustering Algorithm for Multi-density

XUE Li-xiang, QIU Bao-zhi   

  1. (School of Information Engineering, Zhengzhou University, Zhengzhou 450052)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-09-05 Published:2009-09-05

基于密度可达的多密度聚类算法

薛丽香,邱保志   

  1. (郑州大学信息工程学院,郑州 450052)

Abstract: In order to cluster multi-density dataset, a clustering algorithm based on density-reachable for multi-density is proposed. Grid partition method is used to improve efficiency when computing each point’s density. A clustering starts with the highest density point and uses expansion to form a cluster based on density-reachable and breadth-first strategy. Experimental results show that this algorithm can effectively discover clusters of arbitrary shapes for multi-density and uniformity density data sets with noises. It can get good cluster quality and is more efficient than SNN algorithm.

Key words: clustering algorithm, neighborhood grid, density-reachable, breadth-first, multi-density

摘要: 为对多密度数据集聚类,提出一种基于密度可达的多密度聚类算法。使用网格划分技术来提高计算每个点密度值的效率,每次聚类都是从最高密度点开始,根据密度可达的概念和广度优先的策略逐步向外扩展进行聚类。实验表明,该算法能够有效地对任意形状、大小的均匀数据集和多密度数据集进行聚类,并能较好地识别出孤立点和噪声,其精度和效率优于SNN算法。

关键词: 聚类算法, 邻域网格, 密度可达, 广度优先, 多密度

CLC Number: