作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (10): 163-165. doi: 10.3969/j.issn.1000-3428.2008.10.059

• 人工智能及识别技术 • 上一篇    下一篇

基于小生境遗传算法的关联规则挖掘方法

赵连朋1,2,金喜子1,3,孙 亮2,姜文哲4   

  1. (1. 东北师范大学计算机学院,长春 130117;2. 渤海大学公共计算机教研部,锦州 121000;3. 吉林大学计算机科学与技术学院,长春 130117;4. 东北电力设计院,长春 130117)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-05-20 发布日期:2008-05-20

Association Rule Mining Method Based on Niching Genetic Algorithm

ZHAO Lian-peng1,2, JIN Xi-zi1,3, SUN Liang2, JIANG Wen-zhe4   

  1. (1. College of Computer, Northeast Normal University, Changchun 130117; 2. Department of Public Computer-education, Bohai University, Jinzhou 121000; 3. School of Computer Science and Technology, Jilin University, Changchun 130117;4. Northeast Electric Power Design Institute, Changchun 130117)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-05-20 Published:2008-05-20

摘要: 通过采用惩罚函数设置支持度阈值,有效地解决了规则冗余问题。在小生境遗传算法方面,采用了一种新颖的素因子染色体编码方法并引入了最大频繁项分布表。该编码方法把原来用字符表示的事务转化为用一个整数进行表示,把字符串运算转化为数值运算,将事务数据库属性项压缩为一个数值型的项。通过引入最大频繁项分布表,使算法总是能在最大频繁项密集区进行挖掘,对组合搜索空间进行了有效修剪。实验结果表明,该方法对事务数据库压缩比超过25%,效率至少能提高47%。

关键词: 关联规则, 小生境遗传算法, 染色体, 杂交操作

Abstract: The paper effectively solves the problem of regulation redundancy through adopting penalty function to set up the threshold of supporting degree. In respect of niching genetic algorithm, it adopts a new prime factor chromosome encoding method and introduces the maximal frequency item distribution list. This encoding method changes the transaction originally expressed in character into the one expressed in integer and changes the character string operation into the numerical value operation and compresses the property item of the transaction database to a numerical value item. Through introducing the maximal frequency item distribution list, it can always mine in the maximal frequency item compact district and thus, it effectively prunes the assembling searchspace. Experimental results show that the adopting method makes the compressing ratio of the transaction database in excess of 25%, and the effect can increase by at least 47%.

Key words: association rule, niching genetic algorithm, chromosome, hybrid operation

中图分类号: