作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (22): 4-6. doi: 10.3969/j.issn.1000-3428.2006.22.002

• 博士论文 • 上一篇    下一篇

基于聚类的区间数时间序列的索引方法

翁小清1, 2,沈钧毅1   

  1. (1. 西安交通大学软件所,西安 710049;2. 河北经贸大学计算机中心,石家庄 050061)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2006-10-20 发布日期:2006-10-20

Time Series of Intervals Index Based on Clustering

WENG Xiaoqing1,2, SHEN Junyi1   

  1. (1. Institute of Computer Software, Xi’an Jiaotong University, Xi’an 710049; 2. Computer Center, Hebei University of Economics and Trade, Shijiazhuang 050061)
  • Received:1900-01-01 Revised:1900-01-01 Online:2006-10-20 Published:2006-10-20

摘要: 在时间序列数据库中,大多数现有的相似性搜索方法都集中在如何提高算法的效率,而对于由不精确数据组成的时间序列如何进行相似性搜索,则研究比较少,不精确数据经常用区间数据来表示;通过识别区间数时间序列中的重要区间数,使得区间数时间序列的维数大幅度降低,该文针对由区间数组成的时间序列,提出了一种基于低分率聚类的索引方法。实验表明,该方法加快了区间数时间序列的查找过程,不会出现漏报现象。

关键词: 区间数时间序列, 相似性搜索, 聚类, 索引

Abstract: Most existing approaches of similarity search in time series databases focus on the efficiency of algorithms but seldom provide a means to handle imprecise data. The imprecise data are normally presented in the interval. By identifying the important interval values from the time series of intervals, the dimensionality of the time series of intervals can be greatly reduced. This paper proposes an indexing approach of time series of intervals, based on clustering the time series of intervals in low resolution. As demonstrated by the experiments, the proposed approach speeds up the time series of intervals query process while it also guarantees no false dismissals.

Key words: Time series of intervals, Similarity search, Clustering, Index

中图分类号: