Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2006, Vol. 32 ›› Issue (17): 52-54,7. doi: 10.3969/j.issn.1000-3428.2006.17.018

• Special Paper • Previous Articles     Next Articles

Feature Subset Selection of Information System Based on
Similar Extension Matrix

LI Guohe   

  1. (Dept. of Computer Science and Technology, China Petroleum University, Beijing 102249)
  • Received:1900-01-01 Revised:1900-01-01 Online:2006-09-05 Published:2006-09-05

基于类扩张矩阵的信息系统特征选取

李国和   

  1. (中国石油大学(北京)计算机科学与技术系,北京 102249)

Abstract: Feature selection is NP-Hard problem. In order to get a minimal feature subset of an information system, so-called Similar Extension Matrix(SEM) is defined to discriminate all objects by its elements, and then condensed to Condensed SEM(CSEM) by the included relation in logic. At Last, by means of the statistical values as heuristic information, a minimal feature subset is efficiently obtained in CSEM. The heuristic algorithm of minimal feature subset selection is proved very efficient by theoretical analysis and experiment.

Key words: Information system, Feature selection, Heuristic information, Extension matrix

摘要: 特征选取是一个NP-Hard问题。为了快速完成信息系统的一个最小特征选取,引入了类扩张矩阵的定义。通过类扩张矩阵的元素表示对象的差异,并利用逻辑上包含关系,有效浓缩类扩张矩阵。最后,以类扩张矩阵的统计信息为启发式信息,在浓缩类扩张矩阵中实现一个最小特征子集的快速求解。通过理论分析和实验,证明了该特征选取方法的高效性。

关键词: 信息系统, 特征选取, 启发式信息, 扩张矩阵

CLC Number: