改进的k-平均聚类算法研究

doi:10.3969/j.issn.1000-3428.2007.13.068

计算机工程 ›› 2007, Vol. 33 ›› Issue (13): 200-201,. doi: 10.3969/j.issn.1000-3428.2007.13.068

改进的k-平均聚类算法研究

孙士保1,2，秦克云1

(1. 西南交通大学智能控制开发中心，成都 610031；2. 河南科技大学电子信息工程学院，洛阳 471003)

收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-07-05 发布日期:2007-07-05

Research on Modified k-means Data Cluster Algorithm

SUN Shibao1,2, QIN Keyun1

(1. Intelligent Control Development Center, Southwest Jiaotong University, Chengdu 610031; 2. Electronic Information Engineering College, Henan University of Science and Technology, Luoyang 471003)

Received:1900-01-01 Revised:1900-01-01 Online:2007-07-05 Published:2007-07-05

摘要/Abstract

摘要： 聚类算法的好坏直接影响聚类的效果。该文讨论了经典的k-平均聚类算法，说明了它存在不能很好地处理符号数据和对噪声与孤立点数据敏感等不足，提出了一种基于加权改进的k-平均聚类算法，克服了k-平均聚类算法的缺点，并从理论上分析了该算法的复杂度。实验证明，用该方法实现的数据聚类与传统的基于平均值的方法相比较，能有效提高数据聚类效果。

关键词: 聚类算法, k-平均, 权, 聚类数据挖掘

Abstract: The method of data clustering will influence the effect of clustering directly. The algorithm of k-means is discussed, the shortages of this algorithm such as it can not deal with symbolic data and it is sensitive for data of isolation point and noise are demonstrated. A modified k-means clustering algorithm based on weights is put forward, it changes the shortcomings of k-means. Its complexity is analyzed from theoretical. The experiments show that, compared with traditional method based on means, the modified data clustering algorithm can improve the efficiency of data clustering.

Key words: cluster algorithm, k-means, weights, cluster data mining

中图分类号:

TP183

孙士保;秦克云. 改进的k-平均聚类算法研究[J]. 计算机工程, 2007, 33(13): 200-201,.

SUN Shibao; QIN Keyun. Research on Modified k-means Data Cluster Algorithm[J]. Computer Engineering, 2007, 33(13): 200-201,.

http://www.ecice06.com/CN/Y2007/V33/I13/200

[1]	马建红, 龚天, 姚爽. 基于证据句与图卷积网络的文档级关系抽取[J]. 计算机工程, 2023, 49(8): 104-110.
[2]	徐正梅, 刘华明, 毕学慧, 王亚. 基于特征优化的无参考光场图像质量评价[J]. 计算机工程, 2023, 49(7): 242-250.
[3]	汤卫芬, 高翠芳. 极值点自适应加权的动态时间规整算法[J]. 计算机工程, 2023, 49(7): 150-160.
[4]	潘大志, 蒋妍, 刘雅文. 求解多维背包问题的双决策交互差异算法[J]. 计算机工程, 2023, 49(7): 21-33.
[5]	王新迪, 杨夙, 张思源, 罗午阳, 李杰, 刘辉. 基于时空大数据与卫星图像的城市火灾风险预测[J]. 计算机工程, 2023, 49(6): 242-249.
[6]	安志国, 彭政, 易满成, 刘健欣, 俞思帆. 神经网络滤波器竞争训练[J]. 计算机工程, 2023, 49(4): 120-124.
[7]	杨立伟, 贾博宇, 王芳, 彭祥原. 可见光通信与WiFi异构网络资源管理算法[J]. 计算机工程, 2023, 49(3): 203-210,220.
[8]	温静, 杨洁. 基于场景对象注意与深度图融合的深度估计[J]. 计算机工程, 2023, 49(2): 222-230.
[9]	王朕, 李豪, 严冬梅, 竺永荣. 基于改进YOLOv5的路面病害检测模型[J]. 计算机工程, 2023, 49(2): 15-23.
[10]	江雨燕, 邵金, 李平. 融合自动权重学习的深度子空间聚类[J]. 计算机工程, 2022, 48(8): 77-84,97.
[11]	贺娜, 马盈仓. 融合KL信息的多视图模糊聚类算法[J]. 计算机工程, 2022, 48(7): 114-121,150.
[12]	邢彤彤, 孙仁诚, 邵峰晶, 隋毅. 深度学习中的权重初始化方法研究[J]. 计算机工程, 2022, 48(7): 104-113.
[13]	艾成豪, 高建华, 黄子杰. 混合特征选择和集成学习驱动的代码异味检测[J]. 计算机工程, 2022, 48(7): 168-176,198.
[14]	曹瑞阳, 郭佑民, 牛满宇. 基于最大最小距离的多中心数据综合增强方法[J]. 计算机工程, 2022, 48(6): 174-181.
[15]	王兵, 李辉灵, 牛新征. 基于综合选举的DPoS共识算法[J]. 计算机工程, 2022, 48(6): 50-56.

选择文件类型/文献管理软件名称

选择包含的内容

改进的k-平均聚类算法研究

Research on Modified k-means Data Cluster Algorithm

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

改进的k-平均聚类算法研究

Research on Modified k-means Data Cluster Algorithm

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价