作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2012, Vol. 38 ›› Issue (3): 31-33. doi: 10.3969/j.issn.1000-3428.2012.03.011

• 软件技术与数据库 • 上一篇    下一篇

基于动态项集计数的加权频繁项集算法

秦丽君1,2,罗雄飞1   

  1. (1. 中国科学院软件研究所,北京 100190;2. 中国科学院研究生院,北京100190)
  • 收稿日期:2011-06-29 出版日期:2012-02-05 发布日期:2012-02-05
  • 作者简介:秦丽君(1983-),女,硕士研究生,主研方向:数据挖掘,数据可视化;罗雄飞,博士
  • 基金资助:
    国家“863”计划基金资助项目(2007AA040702)

Weighted Frequent Itemset Algorithm Based on Dynamic Itemset Counting

QIN Li-jun 1,2, LUO Xiong-fei 1   

  1. (1. Institute of Software, Chinese Academy of Sciences, Beijing 100190, China; 2. Graduate University of Chinese Academy of Sciences, Beijing 100190, China)
  • Received:2011-06-29 Online:2012-02-05 Published:2012-02-05

摘要: 基于Apriori的加权频繁项集挖掘算法存在扫描数据集次数多的问题。为此,提出一种基于动态项集计数的加权频繁项集算法。该算法采用权值键树的数据结构和动态项集计数的方法,满足向下闭合特性,并且动态生成候选频繁项集,从而减少扫描数据集的次数。实验结果证明,该算法生成的加权频繁项集具有较高的效率和时间性能。

关键词: 数据挖掘, 加权频繁项集挖掘, 动态项集计数, 加权支持度, 权值键树, 向下闭合特性, 最大权值

Abstract: The existing weighted frequent itemset mining algorithms which are based on Apriori require multiple dataset scans. This paper proposes a weighted frequent itemset algorithm weighted frequent itemset mining based on dynamic itemset counting which uses the structure of weighted trie tree and the method of dynamic itemset counting. This algorithm satisfies the downward closure property and dynamically generates candidate frequent itemsets, thereby reduces the number of scanning datasets and improves the performance. Experimental results show that the proposed algorithm not only generates the weighted frequent itemsets, but also has high efficiency and time performance.

Key words: data mining, weighted frequent itemset mining, dynamic itemset counting, weighted support degree, weighted trie tree, downward closure property, maximum weight

中图分类号: