作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (14): 49-51. doi: 10.3969/j.issn.1000-3428.2009.14.017

• 软件技术与数据库 • 上一篇    下一篇

多数据流的增量聚类实现与应用

张锡琴   

  1. (浙江工业大学经贸管理学院,杭州 310032)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-07-20 发布日期:2009-07-20

Realization and Application of Incrementally Clustering of Multi-data Streams

ZHANG Xi-qin   

  1. (College of Business Administration, Zhejiang University of Technology, Hangzhou 310032)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-07-20 Published:2009-07-20

摘要: 针对时间序列数据流的增量聚类研究较少的现状,采用多维时态子空间聚类对数据流的增量聚类进行探究。多维时态子空间聚类是指在连续一段时间内,数据流中的值的距离小于2α,它的另一个要求是最后的聚类结果必须包含一定数量的数据流。聚类结果随时间的演变能持续增量地更新,这个更新机制采用滑动窗口的形式,把最早时刻的数据删除后,添加入新到达的数据。采用股票数据对算法进行测试与验证,实验证明,该算法效果较好。

关键词: 数据流, 增量聚类, 多维时态子空间聚类

Abstract: There is only a little research on incrementally cluster temporal data streams. This paper focuses on the problem of clustering temporal data streams based on the sliding window. Temporal Multiple-dimension Subspace α-Cluster(TMSC) is adopted. It consists of data streams, whose distance is less than 2α, and must contain a number of streams which are predefined. The result of the clustering is updated through time evolving. Sliding window mechanism is adopted to realize it. The earliest data is removed, and the new coming data is added. Stock data is used to test the algorithm, and the result is quite well.

Key words: data stream, incrementally clustering, Temporal Multiple-dimensional Subspace α-Cluster(TMSC

中图分类号: