作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2013, Vol. 39 ›› Issue (1): 294-297. doi: 10.3969/j.issn.1000-3428.2013.01.065

• 开发研究与设计技术 • 上一篇    下一篇

基于球结构SVM的多标签分类

蒋 华,戚玉顺   

  1. (桂林电子科技大学计算机科学与工程学院,广西 桂林 541004)
  • 收稿日期:2011-10-09 修回日期:2011-11-23 出版日期:2013-01-15 发布日期:2013-01-13
  • 作者简介:蒋 华(1963-),男,教授、博士,主研方向:数据库系统,信息安全技术;戚玉顺,硕士

Multi-label Classification Based on Sphere Structured SVM

JIANG Hua, QI Yu-shun   

  1. (School of Computer Science and Engineering, Guilin University of Electronic Technology, Guilin 541004, China)
  • Received:2011-10-09 Revised:2011-11-23 Online:2013-01-15 Published:2013-01-13

摘要: 现有多标签分类问题普遍被转换成多类分类问题,计算量较大,运行时间较长,且面对新类别加入时,拓展性较差。为此,提出一种基于球结构支持向量机的多标签分类方法。每一类别标签对应一个球域结构,提取球重叠区域的样本,依据距离差值度量样本类别相似度,确定样本所属类别。实验结果表明,该方法可以节省210 ms的训练时间,使平均查全率提高3.2%,适合大量样本分类。

关键词: 支持向量机, 距离差, 多标签分类, 多类分类, 主动学习, K折交叉验证

Abstract: Multi-label classification problem is generally converted into multi-class classification problem, and it can increase the amount of computation and extend running time. When a new class is added, the method has a poor expandability too. To solve this problem, this paper proposes a multi-label classification method based on sphere structured Support Vector Machine(SVM). Each category labels correspond a ball domain structure, this paper extracts the samples of ball overlap region, according to the distance difference measures the sample category similarity, and is to determine the category of samples. Experimental results show that the method can save 210 ms training time, improves the average recall rate of 3.2% and is suitable for a large number of sample classification.

Key words: Support Vector Machine(SVM), distance difference, multi-label classification, multi-class classification, active learning, K-fold cross validation

中图分类号: