作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (24): 167-168. doi: 10.3969/j.issn.1000-3428.2010.24.060

• 人工智能及识别技术 • 上一篇    下一篇

基于相关性分析和遗传算法的属性选择

阚峻岭,李锋刚   

  1. (安徽中医学院医药信息工程学院,合肥 230031)
  • 出版日期:2010-12-20 发布日期:2010-12-14
  • 作者简介:阚峻岭(1973-),男,讲师、硕士,主研方向:遗传算法,数据库技术;李锋刚,副教授、博士
  • 基金资助:

    安徽省自然科学基金资助项目(090416246)

Attributes Selection Based on Correlation Analysis and Genetic Algorithm

KAN Jun-ling, LI Feng-gang   

  1. (School of Medical Information Engineering, Anhui University of Traditional Medicine, Hefei 230031, China)
  • Online:2010-12-20 Published:2010-12-14

摘要:

属性的选择和评价是知识基系统设计中的重要任务和影响系统性能优劣的关键因素。为此,利用遗传算法的遗传算子搜索机制和相关性分析的启发式作为评价机制,提出一种新颖的属性选择策略,用于从属性集中选择给定案例最优的属性子集。实验结果表明,该方法可以确定与分类和预测最相关的属性子集,同时在几乎不降低分类准确性的情况下,极大地减小属性的表示空间。

关键词: 相关性分析, 遗传算法, 属性选择

Abstract:

It is an important task for knowledge-based systems to select and evaluate the attributes as well as a critical factor affecting systems’ performance. Using the genetic operator of the searching approach and correlation analysis, which characterizes Genetic Algorithm(GA), as the evaluation mechanism, this paper presents a new method to select the optimal subset of attributes for a given case library. Experimental results show that the proposed method can identify the most related subset to classify and predict, while reducing the representation space of the attributes whereas hardly decreasing the classification precision.

Key words: correlation analysis, Genetic Algorithm(GA), attributes selection

中图分类号: