Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2008, Vol. 34 ›› Issue (10): 101-102. doi: 10.3969/j.issn.1000-3428.2008.10.036

• Software Technology and Database • Previous Articles     Next Articles

Clustering Algorithm of High-dimensional Data Based on Unit Region

XIE Kun-wu, HU Jun-peng   

  1. (School of Information Engineering, Hubei Institute for Nationalities, Enshi 445000)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-05-20 Published:2008-05-20

基于单元区域的高维数据聚类算法

谢坤武,胡俊鹏   

  1. (湖北民族学院信息工程学院,恩施 445000)

Abstract: This paper proposes a Clustering Algorithm of High-dimensional Data(CAHD). Unit regions with intensive data points are found by employing the two-way search strategy in the designated n-dimensional space or its subspaces, and these intensive modules are clustered by a case-by-phase approach. Two-way search strategy can effectively reduce the search space, improve the efficiency of algorithms, and cluster intensive regional unit only uses one by one with two machines and displacement direction. Experimental results show that the running time CAHD algorithm spent is 30% less than other algorithms with the same number of categories found.

Key words: clustering algorithm, high-dimensional data, unit

摘要: 提出一种高维数据集合聚类算法(CAHD)。采用双向搜索策略在指定的n维空间或其子空间上发现数据点密集的单元区域,采用逐位相与的方法为这些密集单元区域聚类。双向搜索策略能够有效地减少搜索空间,提高算法效率,聚类密集单元区域只用到逐位与和位移2种机器指令。实验结果表明,在发现的类数量相同的情况下,CAHD算法的运行时间比其他算法减少30%。

关键词: 聚类算法, 高维数据, 单元

CLC Number: