作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2007, Vol. 33 ›› Issue (10): 79-80,8. doi: 10.3969/j.issn.1000-3428.2007.10.028

• 软件技术与数据库 • 上一篇    下一篇

基于边界点的可分离性度量及特征选择

任江涛,孙婧昊,黄焕宇,印 鉴   

  1. (中山大学计算机科学系,广州 510275)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-05-20 发布日期:2007-05-20

Separability Measure Based on Boundary Points and Feature Selection

REN Jiangtao, SUN Jinghao, HUANG Huanyu, YIN Jian   

  1. (Department of Computer Science, Zhongshan University, Guangzhou 510275)
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-05-20 Published:2007-05-20

摘要: 提出了一种新的面向高维数据的特征选择方法,在特征子集搜索上采用遗传算法进行随机搜索,在特征子集评价上采用基于边界点的可分性度量作为评价指标及适应度。实验结果表明,该算法可有效地找出具有较好的可分离性的特征子集,从而实现降维并提高分类 精度。

关键词: 特征选择, 边界点, 可分离性, 遗传算法

Abstract: This paper proposes a new feature selection method for the high-dimensional data, which realizes the feature subset search by genetic algorithm, and the feature subset fitness is evaluated by the separability measure based on boundary points. The experiments show that the proposed algorithm can find out the feature subsets with good separability, which results in the low-dimensional data and the good classification accuracy.

Key words: Feature selection, Boundary points, Separability, GA

中图分类号: