初始聚类中心优化的k-means算法

doi:10.3969/j.issn.1000-3428.2007.03.024

计算机工程 ›› 2007, Vol. 33 ›› Issue (03): 65-66. doi: 10.3969/j.issn.1000-3428.2007.03.024

初始聚类中心优化的k-means算法

袁方，周志勇，宋鑫

（河北大学数学与计算机学院，保定 071002）

收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-02-05 发布日期:2007-02-05

K-means Clustering Algorithm with Meliorated Initial Center

YUAN Fang, ZHOU Zhiyong, SONG Xin

（College of Mathematics and Computer, Hebei University, Baoding 071002）

Received:1900-01-01 Revised:1900-01-01 Online:2007-02-05 Published:2007-02-05

摘要/Abstract

摘要： 传统的k-means算法对初始聚类中心敏感，聚类结果随不同的初始输入而波动。为消除这种敏感性，提出一种优化初始聚类中心的方法，此方法计算每个数据对象所在区域的密度，选择相互距离最远的k个处于高密度区域的点作为初始聚类中心。实验表明改进后的k-means算法能产生质量较高的聚类结果，并且消除了对初始输入的敏感性。

关键词: 数据挖掘, 聚类, k-means算法, 聚类中心

Abstract: The traditional k-means algorithm has sensitivity to the initial start center. To solve this problem, a new method is proposed to find the initial start center. First it computes the density of the area where the data object belongs to; then finds k data objects all of which are belong to high density area and the most far away to each other, using these k data objects as the initial start centers. Experiments on the standard database UCI show that the proposed method can produce a high purity clustering result and eliminate the sensitivity to the initial start centers.

Key words: Data mining, Clustering, K-means algorithm, Clustering center

袁方;周志勇;宋鑫. 初始聚类中心优化的k-means算法[J]. 计算机工程, 2007, 33(03): 65-66.

YUAN Fang; ZHOU Zhiyong; SONG Xin. K-means Clustering Algorithm with Meliorated Initial Center[J]. Computer Engineering, 2007, 33(03): 65-66.

http://www.ecice06.com/CN/Y2007/V33/I03/65

[1]	江雨燕, 陶承凤, 李平. 数据增强和自适应自步学习的深度子空间聚类算法[J]. 计算机工程, 2023, 49(8): 96-103, 110.
[2]	郑美光, 杨泳. 基于互信息软聚类的个性化联邦学习算法[J]. 计算机工程, 2023, 49(8): 20-28.
[3]	李泽水, 冀俊忠, 杨翠翠. 基于边权重信息深度网络嵌入的PPIN功能模块检测[J]. 计算机工程, 2023, 49(8): 69-76.
[4]	邱天晨, 郑小盈, 祝永新, 封松林. 面向非独立同分布数据的联邦学习架构[J]. 计算机工程, 2023, 49(7): 110-117.
[5]	高小方, 原玉梁, 温静, 白雪飞. 面向相交多流形聚类的标签传播算法[J]. 计算机工程, 2023, 49(6): 90-98.
[6]	位雅, 张正军, 何凯琳, 唐莉. 基于相对密度的密度峰值聚类算法[J]. 计算机工程, 2023, 49(6): 53-61.
[7]	戴浩磊, 黄永慧, 周郭许. 基于超图正则化非负张量链分解的聚类分析[J]. 计算机工程, 2023, 49(6): 81-89.
[8]	李晓腾, 张盼盼, 勾智楠, 高凯. 基于多任务学习的多模态命名实体识别方法[J]. 计算机工程, 2023, 49(4): 114-119.
[9]	程小辉, 李钰, 康燕萍. 基于中间图特征提取的卷积网络双标准剪枝[J]. 计算机工程, 2023, 49(3): 105-112.
[10]	席荣康, 蔡满春, 芦天亮. 基于数据增强与流数据处理的Tor流量分析模型[J]. 计算机工程, 2023, 49(3): 177-184.
[11]	袁立宁, 胡皓, 刘钊. 基于多通道图卷积自编码器的图表示学习[J]. 计算机工程, 2023, 49(2): 150-160,174.
[12]	蔡瑞初, 伍运金, 陈薇, 郝志峰. 面向多元时间序列的群体因果关系发现算法[J]. 计算机工程, 2023, 49(2): 127-135.
[13]	胡慧旗, 张维强, 徐晨. 判别性增强的稀疏子空间聚类[J]. 计算机工程, 2023, 49(2): 98-104.
[14]	李林珂, 康昭, 龙波. 基于黎曼流形的多视角谱聚类算法[J]. 计算机工程, 2023, 49(1): 113-120,129.
[15]	孙扬威, 戚湧. 基于聚类混合采样与PSO-Stacking的车载CAN入侵检测方法[J]. 计算机工程, 2023, 49(1): 138-145.

选择文件类型/文献管理软件名称

选择包含的内容

初始聚类中心优化的k-means算法

K-means Clustering Algorithm with Meliorated Initial Center

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

初始聚类中心优化的k-means算法

K-means Clustering Algorithm with Meliorated Initial Center

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价