鲁棒自表达的低秩属性选择算法

doi:10.3969/j.issn.1000-3428.2017.09.009

计算机工程

鲁棒自表达的低秩属性选择算法

胡荣耀¹,刘星毅²,程德波¹,何威¹,罗 ¹

(1.广西师范大学广西多源信息挖掘与安全重点实验室,广西桂林 541004; 2.广西钦州学院,广西钦州 535000)

收稿日期:2016-06-28 出版日期:2017-09-15 发布日期:2017-09-15
作者简介:胡荣耀(1992—),男,硕士,主研方向为数据挖掘、机器学习;刘星毅,副教授、硕士;程德波、何威、罗,硕士。
基金资助:
国家自然科学基金(61263035,61573270);中国博士后科学基金(2015M570837);广西自然科学基金(2015GXNSFCB139011);广西研究生教育创新计划项目(YCSZ2016046)。

Robust Low-rank Self-representation Feature Selection Algorithm

HU Rongyao¹,LIU Xingyi ²,CHENG Debo¹,HE Wei¹,LUO Yan¹

(1.Guangxi Key Lab of Multi-source Information Mining and Security,Guangxi Normal University,Guilin,Guangxi 541004,China; 2.Qinzhou University,Qinzhou,Guangxi 535000,China)

Received:2016-06-28 Online:2017-09-15 Published:2017-09-15

摘要/Abstract

摘要： 针对无监督属性选择算法无类别信息和未考虑属性的低秩问题,提出一种基于自表达方法的低秩属性选择算法。在损失函数中使用低秩和自表达方法描述属性间的相关结构,利用K均值聚类算法得到所有样本的伪类标签进行属性选择,采用稀疏学习方法中的l2,p-范数参数p控制属性选择结果的稀疏性,并通过子空间学习方法使属性选择结果达到全局最优。实验结果表明,与无监督属性选择算法相比,该算法在6个公开数据集上均具有较高的分类准确率及稳定性。

关键词: 属性选择, 子空间学习, K均值聚类, 低秩约束, 稀疏学习

Abstract: Since unsupervised feature selection algorithms do not have label information and also ignore the low-rank characteristics of the data,this paper proposes a new low-rank feature selection algorithm based on self-representation method.In the loss function,low rank and self-representation methods are used to describe the correlation structure between features,and the K-means clustering method is used to obtain the pseudo labels of samples to realize feature selection.Then,l2,p-norm parameter p in sparse learning method is adopted to control the sparsity of feature selection results.Through subspace learning method,the result of feature selection is globally optimal.The experimental results on six public datasets demonstrate that the proposed feature selection algorithm has higher classification accuracy and better stability compared with the unsupervised feature selection algorithm.

Key words: feature selection, subspace learning, K-means clustering, low-rank constraint, sparse learning

中图分类号:

TP181

胡荣耀,刘星毅,程德波,何威,罗. 鲁棒自表达的低秩属性选择算法[J]. 计算机工程, doi: 10.3969/j.issn.1000-3428.2017.09.009.

HU Rongyao,LIU Xingyi,CHENG Debo,HE Wei,LUO Yan. Robust Low-rank Self-representation Feature Selection Algorithm[J]. Computer Engineering, doi: 10.3969/j.issn.1000-3428.2017.09.009.

https://www.ecice06.com/CN/Y2017/V43/I9/43

参考文献

参考文献［1］ZHU Xiaofeng,HUANG Zi,SHEN Hengtao,et al.Dimensionality Reduction by Mixed Kernel Canonical Correlation Analysis［J］.Pattern Recognition,2012,45(8):3003-3016. ［2］ZHU Xiaofeng,ZHANG Shicaho,JIN Zhi,et al.Missing Value Estimation for Mixed-attribute Data Sets［J］.IEEE Transactions on Knowledge & Data Engineering,2010,23(1):110-121. ［3］ZHU X,SUK H I,SHEN D.Matrix-similarity Based Loss Function and Feature Selection for Alzheimer’s Disease Diagnosis［C］//Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2014:3089-3096. ［4］ZHU X,LI X,ZHANG S.Block-row Sparse Multiview Multilabel Learning for Image Classification［J］.IEEE Transactions on Cybernetics,2016,46(2):450-461. ［5］ZHU Xiaofeng,SUK H I,SHEN D.A Novel Matrix-similarity Based Loss Function for Joint Regression and Classification in AD Diagnosis［J］.NeuroImage,2014,100:91-105. ［6］ZHU X,HUANG Z,CHENG H,et al.Sparse Hashing for Fast Multimedia Search［J］.ACM Transactions on Information Systems,2013,31(2):595-605. ［7］ZHU Xiaofeng,HUANG Zi,YANG Yang,et al.Self-taught Dimensionality Reduction on the High-dimensional Small-sized Data［J］.Pattern Recognition,2013,46(1):215-229. ［8］FAN Zizhu,XU Yong,ZHANG D.Local Linear Discriminant Analysis Framework Using Sample Neighbors［J］.IEEE Transactions on Neural Networks,2011,22(7):1119-1132. ［9］GUI Jie,SUN Zhenan,JIA Wei,et al.Discriminant Sparse Neighborhood Preserving Embedding for Face Recognition［J］.Pattern Recognition,2012,45(8):2884-2893. ［10］MUSORO J Z,ZWINDERMAN A H,PUHAN M A,et al.Validation of Prediction Models Based on Lasso Regression with Multiply Imputed Data［J］.BMC Medical Research Methodology,2014,14(1):116. ［11］ABDI H,WILLIAMS L J.Partial Least Squares Methods:Partial Least Squares Correlation and Partial Least Square Regression［J］.Methods in Molecular Biology,2013,930:549-579. ［12］ZHU Xiaofeng,HUANG Zi,SHEN Hengtao,et al.Linear Cross-modal Hashing for Efficient Multimedia Search［C］//Proceedings of the 21st ACM International Conference on Multimedia.New York,USA:ACM Press,2013:143-152. ［13］ZHU Xiaofeng,HUANG Zi,CUI Jiangtao,et al.Video-to-shot Tag Propagation by Graph Sparse Group Lasso［J］.IEEE Transactions on Multimedia,2013,15(3):633-646. ［14］ZHU X,ZHANG L,HUANG Z.A Sparse Embedding and Least Variance Encoding Approach to Hashing［J］.IEEE Transactions on Image Processing,2014,23(9):3737-3750. ［15］钟智,胡荣耀,何威,等.基于图稀疏的自表达属性选择算法［J］.计算机工程与设计,2016,37(6):1643-1648. ［16］程德波,苏毅娟,宗鸣,等.基于稀疏学习的自适应近邻分类算法［J］.计算机工程与设计,2015,36(7):1912-1916. ［17］邓振云,龚永红,孙可,等.基于局部相关性的kNN分类算法［J］.广西师范大学学报(自然科学版),2016,34(1):52-58. ［18］XIANG Shuo,ZHU Yunzhang,SHEN Xiaotong,et al.Optimal Exact Least Squares Rank Minimization［C］//Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.New,York,USA:ACM Press,2012:480-488. ［19］UCI Repository of Machine Learning Datasets［EB/OL］.［2015-04-10］.http://archive.ics.uci.edu/ml/. ［20］Feature Selection Datasets［EB/OL］.［2015-04-10］.http://featureselection.asu.edu/datasets.php. ［21］LIBSVM——A Library for Support Vector Machines［EB/OL］.［2015-04-10］.http://www.csie.ntu.edu.tw/~cjlin/libsvm. ［22］NIE Feiping,HUANG Heng,CAI Xiao,et al.Efficient and Robust Feature Selection via Joint l2,1-norms Minimization［C］//Proceedings of the 23rd International Conference on Neural Information Processing Systems.New York,USA:ACM Press,2010:1813-1821. ［23］ZHU Pengfei,ZUO Wangmeng,ZHANG Lei,et al.Unsupervised Feature Selection by Regularized Self-representation［J］.Pattern Recognition,2015,48(2):438-446. 编辑陆燕菲

[1]	张晨阳, 黄腾, 吴壮壮. 基于K-Means聚类与深度学习的RGB-D SLAM算法[J]. 计算机工程, 2022, 48(1): 236-244,252.
[2]	宋万潼, 李冰锋, 费树岷. 基于先验知识的航拍绝缘子检测方法[J]. 计算机工程, 2021, 47(8): 301-307,314.
[3]	崔晨,邓赵红,王士同. 基于Lasso稀疏学习的径向基函数神经网络模型[J]. 计算机工程, 2019, 45(2): 173-177.
[4]	季挺,张华. 非参数化近似策略迭代并行强化学习算法[J]. 计算机工程, 2018, 44(11): 313-320.
[5]	沈俊鑫,郭晓军,王文浩,杨旭. 基于协议组降低策略的二次并行k均值聚类算法[J]. 计算机工程, 2015, 41(8): 150-155.
[6]	程功，方昱春，余婵娟，李杨. 基于稀疏学习的人脸语义子空间提取[J]. 计算机工程, 2014, 40(4): 164-169.
[7]	张向群, 张旭. 基于二维判别局部排列的特征提取算法[J]. 计算机工程, 2013, 39(8): 187-189,195.
[8]	李志强，蔺想红. 基于聚类的NSGA-II算法[J]. 计算机工程, 2013, 39(12): 186-190.
[9]	王晓燕, 曾庆宁, 粟秀尹. 基于PCA和HMM的心音自动识别系统[J]. 计算机工程, 2012, 38(20): 148-151.
[10]	张旭, 张向群, 赵伟, 何岩峰. 基于最近特征线的二维非参数化判别分析算法[J]. 计算机工程, 2012, 38(14): 171-172.
[11]	张猛, 付丽华, 刘智慧, 何婷婷, 魏志成. 基于留一准则的多尺度径向基函数网络[J]. 计算机工程, 2012, 38(12): 172-175.
[12]	高潮, 田翠翠, 郭永彩. 基于改进聚类中心分析法的红外行人分割[J]. 计算机工程, 2011, 37(6): 151-152.
[13]	吴永芳, 杨鑫, 徐敏, 张星. 基于K均值聚类的图割医学图像分割算法[J]. 计算机工程, 2011, 37(5): 232-234.
[14]	肖淑婷;吴国新;孙啸寅. 支持属性选择性披露的ATN证书描述方案[J]. 计算机工程, 2010, 36(9): 142-144.
[15]	阚峻岭, 李锋刚. 基于相关性分析和遗传算法的属性选择[J]. 计算机工程, 2010, 36(24): 167-168.

选择文件类型/文献管理软件名称

选择包含的内容

鲁棒自表达的低秩属性选择算法

Robust Low-rank Self-representation Feature Selection Algorithm

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

鲁棒自表达的低秩属性选择算法

Robust Low-rank Self-representation Feature Selection Algorithm

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价