作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (11): 15-17. doi: 10.3969/j.issn.1000-3428.2008.11.006

• 博士论文 • 上一篇    下一篇

基于局部搜索机制的K-Means聚类算法

孙越恒,李志圣,何丕廉   

  1. (天津大学计算机学院,天津 300072)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-06-05 发布日期:2008-06-05

K-Means Clustering Algorithm Based on Local Search Mechanism

SUN Yue-heng, LI Zhi-sheng, HE Pi-lian   

  1. (School of Computer Science and Technology, Tianjin University, Tianjin 300072)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-06-05 Published:2008-06-05

摘要: K-Means聚类算法的结果质量依赖于初始聚类中心的选择。该文将局部搜索的思想引入K-Means算法,提出一种改进的KMLS算法。该算法对K-Means收敛后的结果使用局部搜索来使其跳出局部极值点,进而再次迭代求优。同时对局部搜索的结果使用K-Means算法使其尽快到达一个局部极值点。理论分析证明了算法的可行性和有效性,而在标准文本集上的文本聚类实验表明,相对于传统的K-Means算法,该算法改进了聚类结果的质量。

关键词: K-Means聚类算法, 局部搜索机制, KMLS算法, 文本聚类

Abstract: The quality of K-Means clustering algorithm depends on the choice of cluster center. This paper introduces the idea of local search mechanism into K-Means and presents a KMLS algorithm. This algorithm uses the local search mechanism to jump out one local critical point obtained by K-Means, and uses K-Means to quickly find another local critical point. Experiments of text clustering in standard document sets show that this algorithm achieves a better clustering result than the traditional K-Means algorithm does.

Key words: K-Means clustering algorithm, local search mechanism, KMLS algorithm, text clustering

中图分类号: