作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (1): 48-50.

• 软件技术与数据库 • 上一篇    下一篇

基于 WSTB 的时间序列相似搜索算法

梁建海,杜 军,孙秀霞   

  1. 空军工程大学工程学院,西安 710038
  • 出版日期:2006-01-05 发布日期:2006-01-05

A WSTB-based Algorithm for Similarity Search in Time Series Database

LIANG Jianhai, DU Jun, SUN Xiuxia   

  1. Engineering College, Air Force University of Engineering, Xi’an 710038
  • Online:2006-01-05 Published:2006-01-05

摘要: 提出了基于WSTB(Weighted Shape To Bit-vector)的相似搜索方法,该方法在线性分段的基础上建立时间序列曲线箱,而且创立具有相似形状的时序子序列箱后建立相应的索引,对给定序列和相似序列距离的快速计算,并根据查询的时间序列的特征确定相应的权重,不需要逐个检查子序列箱内容就可以进行快速索引。WSTB 方法避免了进行逐个距离比较而造成的巨大的计算量,从而明显地提高搜索效率。最后验证了方法的通用性和有效性。

关键词: 数据挖掘;时间序列;线性分段;相似性

Abstract: A WSTB-based algorithm for similarity search is proposed which is based on the piecewise linear representation. The subsequence bin for time series is built at first and the index of the bin is built. After that, the distance of the given sequence and similar sequence is calculated. The weighted coefficient for every sequence is decided on the character. So the inquiry can be implemented without checking the content of the bin. The quantity of the WSTB calculation which is got from comparing one by one is avoided. The searching efficiency can be improved obviously. At last, the currency and efficiency of the algorithm are proved.

Key words: Data mining; Time series; Piecewise linear representation; Similarity