Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2007, Vol. 33 ›› Issue (09): 90-91,1.

• Software Technology and Database • Previous Articles     Next Articles

A GEP-based Classification Rules Mining Algorithm

PENG Jinguo1, CAI Zhihua1, KANG Lishan1,2   

  1. ( 1. School of Computer Science, China University of Geosciences, Wuhan 430074; 2. State Key Laboratory of Software Engineering, Wuhan University, Wuhan 430072 )
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-05-05 Published:2007-05-05

一种基于GEP的分类规则挖掘算法

彭锦国1,蔡之华1,康立山1,2   

  1. (1. 中国地质大学计算机学院,武汉 430074;2. 武汉大学软件工程国家重点实验室,武汉 430072)

Abstract: Based on a new automatic programming method——gene expression programming(GEP), this paper presents an improved GEP mining algorithm by designing new fitness function, optimizing the processing of initialization, introducing new genetic operators and adopting(λ+μ) selection strategy which is used in evolution strategy. Experiments are done to test the new algorithm and the data sets are from UCI machine learning repository. The capability of the new algorithm to mine accurate classification rules is compared with C4.5 algorithm and traditional GEP algorithm.

Key words: GEP, Classification rules, Data mining

摘要: 基于一种新的自动程序设计方法基因表达式程序设计(GEP),通过设计适应函数、初始化群体的优化、增加新的遗传算子以及采用演化策略中的(λ+μ)淘汰策略等对原始GEP算法进行有效的改进,设计出一种新的数据挖掘算法。采用UCI机器学习知识库中的数据集对该算法进行了实验,并通过与C4.5及文献[3]的比较,检验了该算法的准确性。

关键词: GEP, 分类规则, 数据挖掘

CLC Number: