作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2007, Vol. 33 ›› Issue (11): 1-3. doi: 10.3969/j.issn.1000-3428.2007.11.001

• 博士论文 •    下一篇

数据流中基于最近数据的动态维护Generator表示法

王秉政1,黄亚楼2   

  1. (1. 南开大学信息技术科学学院,天津 300071;2. 南开大学软件学院,天津 300071)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-06-05 发布日期:2007-06-05

Maintaining Generator Representation Based on Recent Data in Stream

WANG Bingzheng1, HUANG Yalou2   

  1. (1. College of Information Technical Science, Nankai University, Tianjin 300071; 2. College of Software, Nankai University, Tianjin 300071)
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-06-05 Published:2007-06-05

摘要: 数据流中频繁的数据更新使得重新挖掘整个数据集显得比较困难。该文提出了在数据流中,基于最近数据的动态维护Generator表示方法。通过界定边界项集,使得由数据更新可能引起的项集变化能在边界集中被检测到,而无须保存所有频繁集,使处理限定在仅与更新相关的项集范围之内,取得了较好效果。

关键词: Generator, 数据流, 数据挖掘

Abstract: Frequent data updates in data stream make the whole database re-mineing impossible. This paper proposes maintaining generator representation and border sets based on recent data in stream. Representation changes aroused by data updates can be detected in those border sets. Then updating work is confined to those item sets related with updated transactions and achieves good results.

Key words: Generator, Data stream, Data mining

中图分类号: