作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2013, Vol. 39 ›› Issue (4): 230-233. doi: 10.3969/j.issn.1000-3428.2013.04.053

• 人工智能及识别技术 • 上一篇    下一篇

基于联盟博弈的Filter特征选择算法

李智广1,付 枫2,孙 鑫2,李彩虹1   

  1. (1. 山东理工大学计算机科学与技术学院,山东 淄博 255049;2. 吉林大学计算机科学与技术学院,长春 130012)
  • 收稿日期:2012-05-14 出版日期:2013-04-15 发布日期:2013-04-12
  • 作者简介:李智广(1957-),男,高级实验师,主研方向:人工智能;付 枫,工程师、硕士;孙 鑫,博士研究生;李彩虹,副教授、博士
  • 基金资助:
    国家自然科学基金资助项目(60973136);山东省优秀中青年科学家科研奖励基金资助项目(BS2009DX034)

Filter Feature Selection Algorithm Based on Coalitional Game

LI Zhi-guang 1, FU Feng 2, SUN Xin 2, LI Cai-hong 1   

  1. (1. College of Computer Science and Technology, Shandong University of Technology, Zibo 255049, China; 2. College of Computer Science and Technology, Jilin University, Changchun 130012, China)
  • Received:2012-05-14 Online:2013-04-15 Published:2013-04-12

摘要: 在机器学习中,信息冗余和无关特征会导致较高的计算复杂度以及过拟合问题。为此,提出一种基于联盟博弈的Filter特征选择算法。采用联合互信息度量联盟与目标类的依赖程度,使用Shapley权利指数评估每个特征在整个特征空间中的重要性,选择具有最高优先权的特征子集用于分类学习。实验结果表明,在C4.5和支持向量机2种分类器下,该算法特征子集分类准确率的均值分别为88.72%、93.39%,高于mRMR算法和ReliefF算法。

关键词: 机器学习, 维数灾难, 特征选择, 联盟博弈, 信息论, 联合互信息

Abstract: Information redundancy and independent feature can lead to higher computational complexity and over fitting problem in machine learning. A filter feature selection algorithm based on coalitional game is proposed in this paper. The joint mutual information is utilized to measure the relevance between the coalition and target class, and Shapley value is used to evaluate the importance of each feature among the feature space. Experimental results show that under two kinds of classifier such as C4.5 and Support Vector Machine(SVM), the subset mean classification accuracy of this algorithm are 88.72% and 93.39%, and is higher than mRMR algorithm and ReliefF algorithm.

Key words: machine learning, curse of dimensionality, feature selection, coalitional game, information theory, joint mutual information

中图分类号: