作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (6): 82-85.

• 软件技术与数据库 • 上一篇    下一篇

一种基于 FP-tree 的最小预测集获取新算法

谢翠华,沈洁,李 云,程 伟,林 颖   

  1. 扬州大学信息工程学院计算机科学与工程系,扬州 225009
  • 出版日期:2006-03-20 发布日期:2006-03-20

A New Algorithm for Mining the Minimal Prediction Rule Set Based on FP-tree

XIE Cuihua, SHEN Jie, LI Yun, CHENG Wei, LIN Ying   

  1. Department of Computer Science & Engineering, School of Information Engineering, Yangzhou University, Yangzhou 225009
  • Online:2006-03-20 Published:2006-03-20

摘要: 生成用于预测的关联规则,现有算法生成的关联规则中许多是不必要的。利用分治策略和基于频繁闭子集的FP-TREE 生成一种特殊的关联规则集(最小预测集),它比现有规则集小,但是具有同样的预测功能,并且具有发现关键属性的能力。给出了它的算法,并从理论上证明了该算法的正确有效性。

关键词: 数据挖掘;关联规则;频繁闭项集;最小预测集;FP-tree

Abstract: Mining transaction database for association rules usually generates a large number of rules ,most of which are unnecessary when used for subsequent prediction and discovery of key data. In this paper, a particular set called the minimal prediction rule set, is mined using the divide-and-conquer strategy and the FP-tree of frequent closet item sets. It is smaller than the association rule set, but has the same ability of prediction and discovery of key data. In this paper, the algorithm of the minimal prediction rule set is given, and its effectiveness is proved in theory.

Key words: Data mining; Association rules; Frequent closet item sets; Minimal prediction rule set; FP-tree