Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2010, Vol. 36 ›› Issue (22): 72-74. doi: 10.3969/j.issn.1000-3428.2010.22.025

• Networks and Communications • Previous Articles     Next Articles

Completeness Proof and Implementation of Parallel k-means Clustering Algorithm

Completeness Proof and Implementation of Parallel k-means Clustering Algorithm   

  1. (School of Information, Yunnan University of Finance and Economics, Kunming 650221, China)
  • Online:2010-11-20 Published:2010-11-18

并行k均值聚类算法的完备性证明与实现

TAO Ye, ZENG Zhi-yong, YU Jian-kun, FENG Tao   

  1. (云南财经大学信息学院,昆明 650221)
  • 作者简介:陶 冶(1969-),男,讲师、硕士,主研方向:数据挖掘,并行计算;曾志勇,副教授、博士;余建坤,教授、硕士;冯 涛,讲师、硕士
  • 基金资助:
    云南省自然科学基金资助项目(2007G079M);云南省教育厅科学研究基金资助项目(6Y0159D)

Abstract: This paper analyzes classical k-means clustering algorithm, proves the way that generated complete global clustering information from the local clustering information in the case of minimizing the cost of communication between computing nodes. The clustering quality of the way is equivalent to the corresponding serial algorithm, and has higher efficiency in the implementation. On this basis, this paper gives a credible parallel k-means algorithm based on Message Passing Interface(MPI). Experimental result shows that the algorithm is efficient and feasible.

Key words: clustering, k-means algorithm, parallel, Message Passing Interface(MPI)

摘要: 对经典k均值算法进行分析,证明如何在减少节点间通信代价的情况下,从局部聚类信息生成完备的全局聚类信息,使聚类质量等价于相应串行算法,并具有较高的执行效率,在此基础上给出可信的基于消息传递接口的并行k均值算法。实验结果表明,该算法是高效的和可行的。

关键词: 聚类, k均值算法, 并行, 消息传递接口

CLC Number: