作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (11): 8-10. doi: 10.3969/j.issn.1000-3428.2010.11.003

• 博士论文 • 上一篇    下一篇

基于密度的并行聚类算法

陈敏, 高学东, 栾绍峻, 郗玉平   

  1. (1. 北京科技大学经济管理学院,北京 100083;2. 北京信息职业技术学院汽车工程系,北京 100015)
  • 出版日期:2010-06-05 发布日期:2010-06-05
  • 作者简介:陈 敏(1976-),女,博士研究生,主研方向:数据挖掘;高学东,教授;栾绍峻,博士研究生;郗玉平,硕士
  • 基金资助:

    国家自然科学基金资助项目“高维稀疏数据聚类研究”(70771007)

Parallel Clustering Algorithm Based on Density

Chen-Min, GAO Hua-Dong, LUAN Chao-Jun, XI Yu-Beng   

  1. (1. School of Economics and Management, University of Science and Technology Beijing, Beijing 100083; 2. Department of Automotive Engineering, Beijing Information Technology College, Beijing 100015)
  • Online:2010-06-05 Published:2010-06-05

摘要:

为满足大规模空间数据库的聚类需求,面向计算机集群,提出一种基于密度的并行聚类算法。该算法根据数据库分布特征进行数据分区,在每一个节点上对数据块并行聚类,在主节点上合并聚类结果。实验结果表明,该算法的计算速度随着节点数的增多呈线性增加,具有较好的延展性。

关键词: 并行聚类, 计算机集群, 数据库, 延展性

Abstract:

In order to meet the demands for large scale databases clustering, this paper proposes a parallel clustering algorithm based on density for computer colony. This algorithm goes on data partition according to database distribution feature, processes data block parallel clustering on every node, merges clustering result on main node. Experimental result shows that computing speed of this algorithm is linear increment with number of node increasing, and it has better extensibility.

Key words: parallel clustering, computer colony, database, extensibility parallel clustering, computer colony, database, extensibility parallel clustering, computer colony, database, extensibility

中图分类号: