作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (9): 78-80. doi: 10.3969/j.issn.1000-3428.2011.09.026

• 软件技术与数据库 • 上一篇    下一篇

基于频繁模式树的约束最大频繁项集挖掘算法

花红娟a,张 健b,陈少华a   

  1. (上海海洋大学 a. 信息学院;b. 图书馆,上海 201306)
  • 出版日期:2011-05-05 发布日期:2011-05-12
  • 作者简介:花红娟(1982-),女,硕士研究生,主研方向:数据挖掘;张 健(通讯作者),教授;陈少华,硕士研究生
  • 基金资助:
    国家“863”计划基金资助重点项目“人工鱼礁生态增殖及海域生态调控技术”(2006AA100303)

Mining Algorithm for Constrained Maximum Frequent Itemsets Based on Frequent Pattern Tree

HUA Hong-juan a, ZHANG Jian b, CHEN Shao-hua a   

  1. (a. School of Information; b. Library, Shanghai Ocean University, Shanghai 201306, China)
  • Online:2011-05-05 Published:2011-05-12

摘要: 多数最大频繁项集挖掘算法产生候选项目集的代价很高,而实际应用中用户只关心部分关联规则。针对该问题,提出一种基于频繁模式树的约束最大频繁项集快速挖掘算法。该算法能随时删除不满足约束条件的项集,无需生成候选项目集,由此提高挖掘效率。实验结果证明,该算法的效率优于同类算法。

关键词: 数据挖掘, 最大频繁项集, 约束最大频繁项集, 频繁模式树, 项约束

Abstract: The cost of producing candidate itemsets is very high in most maximum frequent itemset mining algorithms, but users are often interested in a subset of association rules in practical application, so this paper proposes a mining algorithm for constrained maximum frequent itemsets based on Frequent Pattern tree(FP-tree). It can delete the itemsets which do not meet the constraints at any time and does not produce candidate itemsets, so that the efficiency of mining is improved. Experimental results show that the algorithm is better than other algorithm.

Key words: data mining, maximum frequent itemsets, constrained maximum frequent itemsets, Frequent Pattern tree(FP-tree), item constraint

中图分类号: