作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (9): 81-83. doi: 10.3969/j.issn.1000-3428.2011.09.027

• 软件技术与数据库 • 上一篇    下一篇

多值属性关联规则挖掘的 Q-Apriori算法?

姜丽莉,孟凡荣,周 勇   

  1. (中国矿业大学计算机科学与技术学院,江苏 徐州 221116)
  • 出版日期:2011-05-05 发布日期:2011-05-12
  • 作者简介:姜丽莉(1985-),女,硕士研究生,主研方向:数据挖掘,知识发现;孟凡荣,教授、博士;周 勇,副教授、博士
  • 基金资助:
    国家自然科学基金资助项目(50674086);江苏省博士后科学基金资助项目(0701045B);中国矿业大学科技基金资助项目(2007 B017)

Q-Apriori Algorithm of Multivalue Attribute Association Rules Mining

JIANG Li-li, MENG Fan-rong, ZHOU Yong   

  1. (School of Computer Science & Technology, China University of Mining & Technology, Xuzhou 221116, China)
  • Online:2011-05-05 Published:2011-05-12

摘要: 针对Apriori算法进行多值属性关联规则挖掘时效率低下的问题,提出量化Apriori算法。利用多值属性数据特点改变项集存储格式,采用类似矩阵的数据结构存储项集,提高遍历数据库时统计计数的速度,使用类似矩阵的加法运算改进连接操作,减少无效候选项集的产生。实验结果表明,相比Apriori算法,该算法执行效率有较大提高。

关键词: 关联规则, 多值属性, 数据挖掘, 量化Apriori算法

Abstract: Aiming at the problem that the Apriori algorithm is inefficient in quantitative association rules mining, this paper proposes a Quantitative Apriori(Q-Apriori) algorithm. It makes use of quantitative attribute data’s characteristics, changes the storage format of the sets using special data structure like matrix to store the sets which reduces the time of traversing the database to count each set’s support. It makes improvement in join step using a method like matrix addition which reduces the number of nonsense candidate sets. Experimental results show that execution efficiency of this algorithm is better than that of Apriori algorithm.

Key words: association rules, multivalue attribute, data mining, Quantitative Apriori(Q-Apriori) algorithm

中图分类号: