作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (15): 40-42,45. doi: 10.3969/j.issn.1000-3428.2011.15.011

• 软件技术与数据库 • 上一篇    下一篇

基于竞争学习的K质心组合聚类算法

张 宇1a,邵良杉1b,邱云飞2,刘 威1a   

  1. (1. 辽宁工程技术大学 a. 理学院;b. 系统工程研究所,辽宁 阜新 123000; 2. 辽宁工程技术大学软件学院,辽宁 葫芦岛 125105)
  • 收稿日期:2010-12-15 出版日期:2011-08-05 发布日期:2011-08-05
  • 作者简介:张 宇(1981-),男,讲师、硕士,主研方向:数据挖掘,模式识别;邵良杉,教授、博士、博士生导师;邱云飞,副教授、博士;刘 威,讲师、硕士
  • 基金资助:
    国家自然科学基金资助项目(70971059)

Combination Clustering Algorithm of K-Centroid Based on Competitive Learning

ZHANG Yu  1a, SHAO Liang-shan  1b, QIU Yun-fei  2, LIU Wei  1a   

  1. (1a. School of Science; 1b. System Engineering Institute, Liaoning Technical University, Fuxin 123000, China; 2. School of Software, Liaoning Technical University, Huludao 125105, China)
  • Received:2010-12-15 Online:2011-08-05 Published:2011-08-05

摘要: K-Means算法的聚类结果对初始簇的选择非常敏感,通常获得的是局部最优解而非全局最优解。为此,在K-Means聚类算法基础上,引入组合聚类和竞争学习概念,提出一种基于竞争学习的K质心组合聚类算法CLK-Centroid。该算法采用竞争学习策略计算簇的质心,以适应噪声数据和分布异常数据的要求,使用组合聚类策略提高聚类的精度。在数据集上构建多个CLK-Centroid聚类器进行聚类,构建子簇相似矩阵,并根据子簇之间的相似性合并相似簇。理论分析和实验结果表明该算法能够提高聚类质量。

关键词: CLK-Centroid算法, K-Means算法, 竞争学习, 组合, 聚类

Abstract: For the choice of initial cluster K-Means clustering algorithm is very sensitive, its results are frequently a local optimal solution rather than a global optimal solution. On the premise of studying K-Means clustering algorithm, this paper introduces a concept of clustering and competitive learning, and proposes a combination clustering algorithm of K-Centroid based on competitive learning(CLK-Centroid algorithm). CLK-Centroid algorithm adapts the noise data and outliers by using a strategy of competitive learning to compute a cluster Centroid, and improves the precision of cluster by using a strategy of combination clustering. Building a multiple of CLK-Centroid clustering models to cluster on data set, the different sub-cluster that comes from the different clustering results must contain an intersection. The sub-cluster similarity matrix is built to merge similar cluster according to the similarity between the sub-clusters. Theoretical analysis and experimental results show that the algorithm can improve the clustering quality.

Key words: CLK-Centroid algorithm, K-Means algorithm, competitive learning, combination, clustering

中图分类号: