基于间隔准则的优化排序多标记学习算法

doi:10.19678/j.issn.1000-3428.0054652

计算机工程 ›› 2020, Vol. 46 ›› Issue (7): 104-109. doi: 10.19678/j.issn.1000-3428.0054652

基于间隔准则的优化排序多标记学习算法

金亚洲, 张正军, 颜子寒, 王雅萍

南京理工大学理学院, 南京 210094

收稿日期:2019-04-19 修回日期:2019-07-18 发布日期:2019-07-26
作者简介:金亚洲(1993-),男,硕士,主研方向为机器学习、数据挖掘;张正军(通信作者),副教授、博士;颜子寒、王雅萍,硕士。
基金资助:
全国统计科学研究项目"海量数据下半参数测量误差模型的统计建模和应用"（2018LD01）。

Optimized Ranking Algorithm Based on Margin Criterion for Multi-Label Learning

JIN Yazhou, ZHANG Zhengjun, YAN Zihan, WANG Yaping

School of Science, Nanjing University of Science and Technology, Nanjing 210094, China

Received:2019-04-19 Revised:2019-07-18 Published:2019-07-26

摘要/Abstract

摘要： 针对多标记学习分类问题，算法适应方法将其转化为排序问题，并将输出标记按照其与示例的相关性进行排序，该类方法取得了较好的分类效果。基于间隔准则提出一种多标记学习算法，通过优化模型在示例的相关标记集合中最小输出与不相关标记集合中最大输出的间隔损失来进行标记排序。在此基础上，为充分利用全部标记信息，提出一种改进的优化排序多标记学习算法，分别优化模型在示例的相关标记集合中平均输出与不相关标记集合中最大输出的间隔损失，以及优化模型在相关标记集合中最小输出与不相关标记集合中平均输出的间隔损失，从而实现标记排序。在模型的参数学习过程中，使用改进的次梯度Pegasos算法进行优化。将所提2种算法与ML-RBF、BP-MLL、ML-KNN多标记学习算法在4个多标记数据集上进行对比实验，结果表明，在HL、RL等5种不同的评价准则下，2种算法均能与对比算法取得相近的分类性能。

关键词: 多标记学习, 算法适应, 标记排序, 平均输出, 间隔准则, Pegasos算法

Abstract: For classification problems in multi-label learning,the algorithm adaptation methods that transform them into a ranking problem and rank the output labels according to their relevance to the examples have made great success.This paper proposes a multi-label learning algorithm based on the margin criterion,which optimizes the margin loss between the minimum output in the relevant label set of examples and the maximum output in the irrelevant label set of examples,so as to sort the labels.On this basis,in order to utilize all the label information,an improved optimized ranking algorithm for multi-label learning is proposed to respectively optimize the margin loss between the average output in the relevant label set and the maximum output in the irrelevant label set of examples,and the margin loss between the minimum output in the relevant label set and the average output in the irrelevant label set,so as to sort the labels.Then an improved sub-gradient Pegasos algorithm is used to learn the model parameters.Experimental results on four multi-label datasets show that the two improved algorithms achieves similar classification performance compared with ML-RBF,BP-MLL,and ML-KNN under HL,RL and other three different evaluation criteria.

Key words: multi-label learning, algorithm adaptation, label ranking, average output, margin criterion, Pegasos algorithm

中图分类号:

TP391

金亚洲, 张正军, 颜子寒, 王雅萍. 基于间隔准则的优化排序多标记学习算法[J]. 计算机工程, 2020, 46(7): 104-109.

JIN Yazhou, ZHANG Zhengjun, YAN Zihan, WANG Yaping. Optimized Ranking Algorithm Based on Margin Criterion for Multi-Label Learning[J]. Computer Engineering, 2020, 46(7): 104-109.

https://www.ecice06.com/CN/Y2020/V46/I7/104

参考文献

[1] SCHAPIRE R E,SINGER Y.BoosTexter:a Boosting-based system for text categorization[J].Machine Learning,2000,39(2/3):135-168.
[2] DE COMITÉ F,GILLERON R,TOMMASI M.Learning multi-label alternating decision trees from texts and data[M]//PETRA P.Machine learning and data mining in pattern recognition.Berlin,Germany:Springer,2003:35-49.
[3] BOUTELL M R,LUO J B,SHEN X P,et al.Learning multi-label scene classification[J].Pattern Recognition,2004,37(9):1757-1771.
[4] BARUTCUOGLU Z,SCHAPIRE R E,TROYANSKAYA O G.Hierarchical multi-label prediction of gene function[J].Bioinformatics,2006,22(7):830-836.
[5] TSOUMAKAS G,KATAKIS I.Multi-label classification[J].International Journal of Data Warehousing and Mining,2007,3(3):1-13.
[6] ZHANG Minling,ZHOU Zhihua.A review on multi-label learning algorithms[J].IEEE Transactions on Knowledge and Data Engineering,2014,26(8):1819-1837.
[7] READ J,PFAHRINGER B,HOLMES G,et al.Classifier chains for multi-label classification[J].Machine Learning,2011,85(3):333-359.
[8] CHEN Linlin,CHEN Degang.A classifier chain method for multi-label learning based on kernel alignment[J].Journal of Nanjing University(Natural Sciences),2018,54(4):67-74.(in Chinese)陈琳琳,陈德刚.一种基于核对齐的分类器链的多标记学习算法[J].南京大学学报(自然科学版),2018,54(4):67-74.
[9] TSOUMAKAS G,KATAKIS I,VLAHAVAS I.Random k-labelsets for multilabel classification[J].IEEE Transactions on Knowledge and Data Engineering,2011,23(7):1079-1089.
[10] ELISSEEFF A,WESTON J.A kernel method for multi-labelled classification[C]//Proceedings of the 14th International Conference on Neural Information Processing Systems.New York,USA:ACM Press,2001:681-687.
[11] JIANG A W,WANG C H,ZHU Y P.Calibrated Rank-SVM for multi-label image categorization[C]//Proceedings of 2008 IEEE International Joint Conference on Neural Networks.Washington D.C.,USA:IEEE Press,2008:15-26.
[12] VAPNIK V.The nature of statistical learning theory[M].Berlin,Germany:Springer,1995.
[13] ZHANG M L,ZHOU Z H.Multilabel neural networks with applications to functional genomics and text categorization[J].IEEE Transactions on Knowledge and Data Engineering,2006,18(10):1338-1351.
[14] GRODZICKI R,MANDZIUK J,WANG L P.Improved multilabel classification with neural networks[M]//GÜNTERRUDOLP H,THOMASJANSE N,SIMONLUCA S,et al.Parallel problem solving from nature-PPSN X.Berlin,Germany:Springer,2008:409-416.
[15] ZHANG M L,ZHOU Z H.ML-KNN:a lazy learning approach to multi-label learning[J].Pattern Recognition,2007,40(7):2038-2048.
[16] ZHANG Minling.An improved multi-label lazy learning approach[J].Journal of Computer Research and Development,2012,49(11):2271-2282.(in Chinese)张敏灵.一种新型多标记懒惰学习算法[J].计算机研究与发展,2012,49(11):2271-2282.
[17] SHALEV-SHWARTZ S,SINGER Y,SREBRO N,et al.Pegasos:primal estimated sub-gradient solver for SVM[J].Mathematical Programming,2011,127(1):3-30.
[18] ZHANG Shijiang,CHAI Jing.Partial label learning algorithm based on maximum margin[J].Science Technology and Engineering,2018,18(28):114-120.(in Chinese)张仕将,柴晶.一种基于最大间隔的偏标记学习算法[J].科学技术与工程,2018,18(28):114-120.
[19] CRAMMER K,SINGER Y.On the algorithmic implementation of multiclass kernel-based vector machines[J].Journal of Machine Learning Research,2002,2(2):265-292.
[20] TANG L,XUAN Q,XIONG R,et al.A multi-class large margin classifier[J].Journal of Zhejiang University-Science A,2009,10(2):253-262.
[21] LI Yukun,ZHANG Minling,GENG Xin.Leveraging implicit relative labeling-importance information for effective multi-label learning[C]//Proceedings of 2015 IEEE International Conference on Data Mining.Washington D.C.,USA:IEEE Press,2015:123-156.
[22] TSOUMAKAS G,VILCEK J,XIOUFITS E S.Mulan:a Java library for multi-label learning[EB/OL].[2019-03-10].http://mulan.sourceforge.net/datasets.html.
[23] ZHANG Minling.ML-RBF:RBF neural networks for multi-label learning[J].Neural Processing Letters,2009,29(2):61-74.

选择文件类型/文献管理软件名称

选择包含的内容

基于间隔准则的优化排序多标记学习算法

Optimized Ranking Algorithm Based on Margin Criterion for Multi-Label Learning

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 4

编辑推荐

Metrics

本文评价

[1]	袁志祥, 王雅卿, 黄俊. 基于深度互学习的多标记零样本分类[J]. 计算机工程, 2023, 49(10): 64-71.
[2]	王晓莹, 谢钧, 陶性留, 邵东生, 王忠. 基于嵌入式特征提取的多标记分类算法[J]. 计算机工程, 2019, 45(11): 172-176.
[3]	秦锋, 黄俊, 程泽凯. 用于多标记学习的阈值确定算法[J]. 计算机工程, 2010, 36(21): 214-216.
[4]	李昕, 钱旭, 王自强. 一种高效的高维异常数据挖掘算法[J]. 计算机工程, 2010, 36(21): 34-36.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于间隔准则的优化排序多标记学习算法

Optimized Ranking Algorithm Based on Margin Criterion for Multi-Label Learning

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 4

编辑推荐

Metrics

本文评价