作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (21): 73-75. doi: 10.3969/j.issn.1000-3428.2010.21.026

• 软件技术与数据库 • 上一篇    下一篇

最小频繁闭树的增量式更新算法

郭 鑫1,黄 云1,刘介丹2,周清平1   

  1. (1. 吉首大学信息管理与工程学院,湖南 张家界 427000;2. 北京市海淀区用友软件园,北京 100094)
  • 出版日期:2010-11-05 发布日期:2010-11-03
  • 作者简介:郭 鑫(1984-),男,助教、硕士,主研方向:数据挖掘;黄 云,讲师;刘介丹,学士;周清平,教授
  • 基金资助:
    湖南省大学生研究性学习和创新性实验计划基金资助项目(JSU-CX-2009-26);湖南省教育厅基金资助项目(06C658)

Incremental Updating Algorithm for Least Frequent Closed Tree

GUO Xin1, HUANG Yun1, LIU Jie-dan2, ZHOU Qing-ping1   

  1. (1. School of Information Management and Engineering, Jishou University, Zhangjiajie 427000, China; 2. Ufida Software Park, Haidian District, Beijing 100094, China)
  • Online:2010-11-05 Published:2010-11-03

摘要: 针对树挖掘算法产生大量频繁子树和树数据库随时间变化的问题,提出最小频繁闭树增量式更新算法以及增量式更新策略,能充分利用已有挖掘知识,无须重新运行树挖掘算法,并且只需进行一次数据库扫描操作。给出一种候选子树剪枝方法,能减少树同构判别次数,有效提高算法的运行效率。通过大量实验结果表明,该算法有效可行且效率较高。

关键词: 数据挖掘, 有序树, 频繁子树, 频繁闭树, 增量更新

Abstract: Tree mining algorithm always produces a lot of problems such as frequent subtrees and tree database changing over time. This paper proposes least frequent closed tree mining algorithm and incremental updating algorithm for frequent subtrees. It proposes incremental strategy, makes full use of exists data, without re-running tree mining algorithm during update mining, and needs scaning database only once. It proposes tree pruning method, which can lessen the time of distinguishing isomorphism, and improve the efficiency of algorithm working. The final adoption of a large number of experiments shows that incremental updating algorithm proposed in this paper is effective and feasible.

Key words: data mining, ordered tree, frequent subtree, frequent closed tree, incremental updating

中图分类号: