作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (13): 46-48,51. doi: 10.3969/j.issn.1000-3428.2011.13.013

• 软件技术与数据库 • 上一篇    下一篇

基于形态特征的数据流聚类方法研究

吴学雁1,2,黄道平1   

  1. (1. 华南理工大学自动化科学与工程学院,广州 510640;2. 广东工业大学管理学院,广州 510520)
  • 收稿日期:2011-02-07 出版日期:2011-07-05 发布日期:2011-07-05
  • 作者简介:吴学雁(1980-),女,讲师、博士研究生,主研方向:数据挖掘,智能检测与控制;黄道平,教授、博士
  • 基金资助:
    广东省自然科学基金资助项目(6300278);广东工业大学青年基金资助项目(092036)

Research of Data Stream Clustering Method Based on Shape Feature

WU Xue-yan  1,2, HUANG Dao-ping  1   

  1. (1. College of Automation Science and Engineering, South China University of Technology, Guangzhou 510640, China; 2. School of Management, Guangdong University of Technology, Guangzhou 510520, China)
  • Received:2011-02-07 Online:2011-07-05 Published:2011-07-05

摘要: 在聚类过程中为保留数据的重要形态与趋势特征,提出一种基于形态特征的数据流聚类方法。在初始化阶段提取重要特征点表示序列分段,在在线更新阶段使用部分动态时间弯曲方法计算子序列距离,基于动态滑动窗口思想保证多条数据流中数据的同步,在用户触发聚类阶段提出数据流聚类方法。通过对仿真数据和实际股票数据的分析结果表明,在参数设置合理的情况下,该方法可以获得接近0.95的聚类演化精度。

关键词: 数据流, 聚类演化, 数据挖掘, 形态特征

Abstract: In order to retain shape and tend features during the clustering process, this paper proposes a data stream clustering method based on shape feature. In the initialization stage, the subsequence is represented with the important points. In the online update stage, Partial Dynamic Time Warping(PDTW) method is used to compute the distances between the subsequences and ensure the data synchronization using the dynamic sliding window. In the clustering stage triggered by the user, the data streams clustering method is proposed. Experimental results show that the shape-based clustering over data streams can get the evolution accuracy of 0.95 with the reasonable parameters.

Key words: data stream, clustering evolution, data mining, shape feature

中图分类号: