作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (13): 257-259. doi: 10.3969/j.issn.1000-3428.2010.13.090

• 开发研究与设计技术 • 上一篇    下一篇

基于互信息和关系积理论的特征选择方法

何绍荣,梁金明,何志勇   

  1. (四川理工学院计算机学院,自贡 643000)
  • 出版日期:2010-07-05 发布日期:2010-07-05
  • 作者简介:何绍荣(1970-),男,讲师,主研方向:计算机网络,多媒体技术;梁金明、何志勇,副教授
  • 基金资助:
    四川省教育厅科研基金资助项目(2006A108)

Feature Selection Method Based on Mutual Information and Attribute Union Theory

HE Shao-rong, LIANG Jin-ming, HE Zhi-yong   

  1. (School of Computer Science, Sichuan University of Science and Engineering, Zigong 643000)
  • Online:2010-07-05 Published:2010-07-05

摘要:

研究互信息理论,针对其不足引进粗糙集并给出一个基于关系积理论的属性约简算法,以此为基础提出一个适用于海量文本数据集的特征选择方法。该方法使用互信息进行特征初选利用所给的属性约简算法消除冗余,从而获得具有代表性的特征子集。实验结果表明,该特征选择方法效果良好。

关键词: 特征选择, 互信息, 粗糙集, 关系积理论, 属性约简

Abstract: This paper analyzes Mutual Information(MI) theory. According to deficiency of MI, rough set is introduced and an attribute reduction algorithm based on attribute union theory is proposed. A feature selection method based on MI and attribute union theory is presented which is suitable for massive text data sets. The method uses MI to select features, and employs the proposed attribute reduction algorithm to eliminate redundancy, so the feature subsets which are more representative can be acquired. Experimental results show that the method is effective.

Key words: feature selection, Mutual Information(MI), rough set, attribute union theory, attribute reduction

中图分类号: