Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2013, Vol. 39 ›› Issue (4): 219-221,225. doi: 10.3969/j.issn.1000-3428.2013.04.050

• Networks and Communications • Previous Articles     Next Articles

Maximal Frequent Itemset Mining Based on Parent-child Equivalency Pruning Strategy

ZHANG Zhi-gang 1,2, HUANG Liu-sheng 2, JIN Zong-an 1, XIANG Li-ping 1   

  1. (1. Department of Information Engineering, Lu An Vocation Technology College, Lu’an 237158, China; 2. School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China)
  • Received:2012-08-03 Online:2013-04-15 Published:2013-04-12

基于父子等价剪枝策略的最大频繁项集挖掘

张志刚1,2,黄刘生2,金宗安1,项莉萍1   

  1. (1. 六安职业技术学院信息工程系,安徽 六安 237158;2. 中国科学技术大学计算机科学与技术学院,合肥 230027)
  • 作者简介:张志刚(1973-),男,副教授,主研方向:数据挖掘,数据分析;黄刘生,教授、博士生导师;金宗安,讲师、硕士;项莉萍,副教授
  • 基金资助:
    安徽省高等学校自然科学研究基金资助项目(KJ2012B181)

Abstract: Incomplete pruning leads to inefficiencies for the search space with the same transaction set in parent-child node according to the traditional pruning strategy. This paper presents a parent-child equivalency pruning strategy which can prune the search space with the same transaction set in parent-child node. It effectively minimizes the search space and reduces the number of frequent items. The new maximal frequent itemset mining algorithm is completed based on new pruning strategy. Experimental results show that the new pruning strategy can shorten the time of mining maximal frequent itemset with the same support.

Key words: data mining, maximum frequent itemset, pruning strategy, minimum support, depth-first, association rule

摘要: 在传统剪枝策略中,具有相同事务集的父子结点搜索空间没有充分剪枝,效率较低。为此,提出父子等价的剪枝策略。采用深度优先搜索集合枚举树,对于父子结点中具有相同事务集的搜索空间进行剪枝,有效地缩小搜索空间,减少频繁项计算的次数,给出基于该剪枝策略的最大频繁项集挖掘算法。实验结果表明,该算法可缩短同一支持度下的最大频繁项集挖掘时间。

关键词: 数据挖掘, 最大频繁项集, 剪枝策略, 最小支持度, 深度优先, 关联规则

CLC Number: