作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (9): 57-59. doi: 10.3969/j.issn.1000-3428.2009.09.020

• 软件技术与数据库 • 上一篇    下一篇

进化数据流中基于密度的聚类算法

蔡春丽,王惠玲,孙延明   

  1. (中国电子科技集团公司第四十九研究所,哈尔滨 150001)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-05-05 发布日期:2009-05-05

Density-based Clustering Algorithm in Evolving Data Stream

CAI Chun-li, WANG Hui-ling, SUN Yan-ming   

  1. (China Electronics Technology Group Corporation No.49 Research Institute, Harbin 150001)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-05-05 Published:2009-05-05

摘要: 分析当前数据流聚类算法的优点及不足,提出一种新的进化数据流中基于密度的聚类算法——Sdstream算法,该算法能够分析并处理大规模进化数据流,利用真实数据集和仿真数据集对其进行性能测试,实验结果表明,该算法具有良好的适用性、有效性和可扩展性,能够取得较高的聚类效果。

关键词: 数据挖掘, 进化数据流, 聚类, 滑动窗口

Abstract: On basis of analyzing the advantages and weaknesses of the current clustering algorithm of data streams, this paper introduces a new density-based clustering algorithm in evolving data streams——SDStream, which can analyze and deal with large-scale evolving data stream. Its performance is tested by using both real datasets and synthetic datasets. Experimental results show this algorithm has better perpformance of applicability, effectiveness and extension and achieves a higher quality of clustering.

Key words: data mining, evolving data stream, clustering, sliding window

中图分类号: