作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (2): 181-183. doi: 10.3969/j.issn.1000-3428.2010.02.064

• 人工智能及识别技术 • 上一篇    下一篇

处理非平衡数据的粒度SVM学习算法

郭虎升1,亓 慧1,2,王文剑1   

  1. (1. 山西大学计算机与信息技术学院计算智能与中文信息处理教育部重点实验室,太原 030006;2. 太原师范学院计算机系,太原 030012)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2010-01-20 发布日期:2010-01-20

Granular SVM Learning Algorithm for Processing Imbalanced Data

GUO Hu-sheng1, QI Hui1,2, WANG Wen-jian1   

  1. (1. Key Laboratory of Computational Intelligence and Chinese Information Processing, Ministry of Education, School of Computer and Information Technology, Shanxi University, Taiyuan 030006; 2. Department of Computer, Taiyuan Normal College, Taiyuan 030012)
  • Received:1900-01-01 Revised:1900-01-01 Online:2010-01-20 Published:2010-01-20

摘要: 针对支持向量机对于非平衡数据不能进行有效分类的问题,提出一种粒度支持向量机学习算法。根据粒度计算思想对多数类样本进行粒划分并从中获取信息粒,以使数据趋于平衡。通过这些信息粒来寻找局部支持向量,并在这些局部支持向量和少数类样本上进行有效学习,使SVM在非平衡数据集上获得令人满意的泛化能力。

关键词: 粒度支持向量机, 非平衡数据, 信息粒, 局部支持向量

Abstract: This paper presents a Granular Support Vector Machine(GSVM) learning algorithm in order to improve the performance of SVM on imbalanced datasets. The GSVM divides some granules for majority data based on granular computing theory and extracts information granules. So the data becomes balanced, then GSVM finds local support vectors from those granules. SVM learns on these LSVs together with minority data. The satisfactory generalization performance can be obtained on imbalanced data.

Key words: Guanular Support Vector Machine(GSVM), imbalanced data, information granule, local support vectors

中图分类号: