基于最优近邻的局部保持投影方法

doi:10.19678/j.issn.1000-3428.0067987

摘要/Abstract

摘要：

局部保持投影(LPP)方法是机器学习领域中一种经典的降维方法。然而LPP方法以及部分改进方法在构建数据的局部结构时简单地使用k最近邻(k-NN)分类算法寻找样本的近邻点, 容易受到参数k、噪声和异常值的影响。为了解决上述问题, 提出一种基于最优近邻的LPP方法。该方法使用寻找最优近邻算法, 在找到样本近邻点后, 进一步选择与样本有一定数量的共同近邻点的近邻样本作为最优近邻, 通过共同近邻点的限定来选择与样本最相似的近邻, 增强近邻样本间的相关性, 避免了传统LPP方法受参数k影响大等问题。在选择出足够的样本最优近邻后, 构建数据局部结构, 以便准确地反映数据的本质结构特征, 使降维后的数据能最大程度保留样本的有效信息, 提升后续机器学习模型的性能。公共图像数据集上的对比实验结果表明, 该方法具有较好的数据降维效果, 有效地提高了图像识别准确率。

关键词: 局部保持投影方法, 最优近邻, 近邻样本, 降维, 特征提取

Abstract:

Locality Preserving Projection(LPP) is a classical dimensionality reduction method used in machine learning. However, the LPP method and some improved methods simply use the k-Nearest Neighbor(k-NN) classification algorithm to find the nearest neighbors of the samples when constructing the local structure of the data, which is easily affected by the parameter k, noise, and outliers. To solve the above problems, a LPP projection method based on the optimal nearest neighbor algorithm is proposed. The proposed method employs the optimal nearest neighbor algorithm to find the sample nearest neighbor points. Then, the algorithm further selects the nearest neighbor samples with a certain number of common points as the optimal nearest neighbors. Then, the algorithm selects the nearest neighbors that are most similar to the samples by limiting the common nearest neighbor points, thereby enhancing the correlation between the nearest neighbor samples. This selection circumvents the problem of the traditional LPP method being greatly influenced by the parameter k. After selecting sufficient sample optimal nearest neighbors, the local structure of the data is constructed to accurately reflect the essential structural features of the data such that dimensionality reduction can retain the effective information of the samples to the maximum extent and improve the performance of the subsequent machine learning models. Comparative experimental results obtained using a public image dataset show that the proposed method has a good data dimensionality reduction effect and effectively improves image recognition accuracy.

Key words: Local Preserving Projection(LPP) method, optimal nearest neighbor, nearest neighbor sample, dimensionality reduction, feature extraction

赵俊涛, 李陶深, 卢志翔. 基于最优近邻的局部保持投影方法[J]. 计算机工程, 2024, 50(9): 161-168.

ZHAO Juntao, LI Taoshen, LU Zhixiang. Locality Preserving Projection Method Based on Optimal Nearest Neighbor[J]. Computer Engineering, 2024, 50(9): 161-168.

https://www.ecice06.com/CN/Y2024/V50/I9/161

图/表 9

图1 Yale数据集上算法识别率与维度的关系

Fig.1 Algorithm recognition rate versus dimensionality on the Yale dataset

图2 ORL数据集上算法识别率与维度的关系

Fig.2 Algorithm recognition rate versus dimensionality on the ORL dataset

图3 Coil-20数据集上算法识别率与维度的关系

Fig.3 Algorithm recognition rate versus dimensionality on the Coil-20 dataset

图4 FERET数据集上算法识别率与维度的关系

Fig.4 Algorithm recognition rate versus dimensionality on the FERET dataset

参考文献 27

1	ZHOU W, GONG Z X, GUO W, et al. Robust graph structure learning for multimedia data analysis. Wireless Communications and Mobile Computing, 2021, 2021, 9458188. doi: 10.1155/2021/9458188
2	LOHRMANN C, LUUKKA P, JABLONSKA-SABUKA M, et al. A combination of fuzzy similarity measures and fuzzy entropy measures for supervised feature selection. Expert Systems with Applications, 2018, 110, 216- 236. doi: 10.1016/j.eswa.2018.06.002
3	RAY P, REDDY S S, BANERJEE T. Various dimension reduction techniques for high dimensional data analysis: a review. Artificial Intelligence Review, 2021, 54(5): 3473- 3515. doi: 10.1007/s10462-020-09928-0
4	AYESHA S, HANIF M K, TALIB R. Overview and comparative study of dimensionality reduction techniques for high dimensional data. Information Fusion, 2020, 59, 44- 58. doi: 10.1016/j.inffus.2020.01.005
5	JIA W K, SUN M L, LIAN J, et al. Feature dimensionality reduction: a review. Complex & Intelligent Systems, 2022, 8(3): 2663- 2693.
6	ZEBARI R, ABDULAZEEZ A, ZEEBAREE D, et al. A comprehensive review of dimensionality reduction techniques for feature selection and feature extraction. Journal of Applied Science and Technology Trends, 2020, 1(1): 56- 70. doi: 10.38094/jastt1224
7	SEUNG H S, LEE D D. The manifold ways of perception. Science, 2000, 290(5500): 2268- 2269. doi: 10.1126/science.290.5500.2268
8	HAN H, LI W T, WANG J C, et al. Enhance explainability of manifold learning. Neurocomputing, 2022, 500, 877- 895. doi: 10.1016/j.neucom.2022.05.119
9	HE X, NIYOGI P. Locality preserving projections[C]//Proceedings of the 16th International Conference on Neural Information Processing Systems. New York, USA: ACM Press, 2003: 1-8.
10	HE X F, CAI D, YAN S C, et al. Neighborhood preserving embedding[C]//Proceedings of the 10th IEEE International Conference on Computer Vision. Washington D. C., USA: IEEE Press, 2005: 1208-1213.
11	QIAO L S, CHEN S C, TAN X Y. Sparsity preserving projections with applications to face recognition. Pattern Recognition, 2010, 43(1): 331- 341. doi: 10.1016/j.patcog.2009.05.005
12	CHEN F X, WANG Y C, WANG B, et al. Graph representation learning: a survey. APSIPA Transactions on Signal and Information Processing, 2020, 9(1): e15.
13	LU J L, WANG H L, ZHOU J, et al. Low-rank adaptive graph embedding for unsupervised feature extraction. Pattern Recognition, 2021, 113, 107758. doi: 10.1016/j.patcog.2020.107758
14	NIE F, WANG Z, WANG R, et al. Adaptive local embedding learning for semi-supervised dimensionality reduction. IEEE Transactions on Knowledge and Data Engineering, 2021, 34(10): 4609- 4621.
15	NIE F P, DONG X, LI X L. Unsupervised and semisupervised projection with graph optimization. IEEE Transactions on Neural Networks and Learning Systems, 2020, 32(4): 1547- 1559.
16	范君, 业巧林, 业宁. 基于改进的有监督无参局部保持投影算法的人脸识别. 山东大学学报(工学版), 2019, 49(1): 10- 16. URL
	FAN J, YE Q L, YE N. Face recognition based on improved prameter-free supervised locality preserving projections. Journal of Shandong University (Engineering Science), 2019, 49(1): 10- 16. URL
17	梁兴柱, 林玉娥, 许光宇. 无参数无相关最大化判别边界算法. 图学学报, 2019, 40(1): 105- 110. URL
	LIANG X Z, LIN Y E, XU G Y. Parameter-free uncorrelated maximum discriminant margin algorithm. Journal of Graphics, 2019, 40(1): 105- 110. URL
18	LU X H, LONG J, WEN J, et al. Locality preserving projection with symmetric graph embedding for unsupervised dimensionality reduction. Pattern Recognition, 2022, 131, 108844. doi: 10.1016/j.patcog.2022.108844
19	CHEN H, NIE F P, WANG R, et al. Adaptive flexible optimal graph for unsupervised dimensionality reduction. IEEE Signal Processing Letters, 2021, 28, 2162- 2166. doi: 10.1109/LSP.2021.3116521
20	WAN M H, CHEN X Y, ZHAN T M, et al. Low-rank 2D local discriminant graph embedding for robust image feature extraction. Pattern Recognition, 2023, 133, 109034. doi: 10.1016/j.patcog.2022.109034
21	LONG T H, GAO J B, YANG M Y, et al. Locality preserving projection via deep neural network[C]//Proceedings of the International Joint Conference on Neural Networks. Washington D. C., USA: IEEE Press, 2019: 1-8.
22	WANG A G, ZHAO S H, LIU J J, et al. Locality adaptive preserving projections for linear dimensionality reduction. Expert Systems with Applications, 2020, 151, 113352. URL
23	RAN R S, QIN H, ZHANG S G, et al. Simple and robust locality preserving projections based on maximum difference criterion. Neural Processing Letters, 2022, 54(3): 1783- 1804. doi: 10.1007/s11063-021-10706-4
24	RAN R S, FENG J, ZHANG S G, et al. A general matrix function dimensionality reduction framework and extension for manifold learning. IEEE Transactions on Cybernetics, 2022, 52(4): 2137- 2148.
25	LONG T H, SUN Y F, GAO J B, et al. Locality preserving projection based on Euler representation. Journal of Visual Communication and Image Representation, 2020, 70, 102796.
26	RAN R S, REN Y S, ZHANG S G, et al. A novel discriminant locality preserving projections method. Journal of Mathematical Imaging and Vision, 2021, 63(5): 541- 554.
27	WAN M H, YAO Y, ZHAN T M, et al. Supervised Low-Rank Embedded Regression(SLRER) for robust subspace learning. IEEE Transactions on Circuits and Systems for Video Technology, 2021, 32(4): 1917- 1927.

[1]	钱清, 龙永, 蒋忠远, 段春红, 王宏. 基于深度强化学习的自适应图像隐写算法[J]. 计算机工程, 2024, 50(8): 319-327.
[2]	胡庆. 多尺度融合与双输出U-Net网络的行人重识别[J]. 计算机工程, 2024, 50(6): 102-109.
[3]	梁松林, 林伟, 王珏, 杨庆. 面向后渗透攻击行为的网络恶意流量检测研究[J]. 计算机工程, 2024, 50(5): 128-138.
[4]	李振鲁, 黄威, 孙锴. 复杂环境下的轻量化道路目标识别算法研究[J]. 计算机工程, 2024, 50(4): 219-227.
[5]	袁文涛, 卫文韬, 高德民. 融合注意力机制的多视图卷积手势识别研究[J]. 计算机工程, 2024, 50(3): 208-215.
[6]	任义, 苏博, 袁帅. 教育领域下多维度特征命名实体识别方法[J]. 计算机工程, 2024, 50(10): 110-118.
[7]	陈君航, 杨祖元, 刘名扬, 李陵江. 基于正交约束的广义可分离非负矩阵分解算法[J]. 计算机工程, 2023, 49(8): 46-53.
[8]	马娜, 温廷新, 贾旭, 李晓会. 复杂光照条件下自适应的车脸重识别模型[J]. 计算机工程, 2023, 49(8): 275-282, 290.
[9]	戴浩磊, 黄永慧, 周郭许. 基于超图正则化非负张量链分解的聚类分析[J]. 计算机工程, 2023, 49(6): 81-89.
[10]	宋羽凯, 谢江. 基于多任务学习的轻量级语音情感识别模型[J]. 计算机工程, 2023, 49(5): 122-128.
[11]	霍跃华, 赵法起. 基于Stacking与多特征融合的加密恶意流量检测[J]. 计算机工程, 2023, 49(5): 165-172,180.
[12]	关日鹏, 况立群, 焦世超, 熊风光, 韩燮. 多模态特征融合与词嵌入驱动的三维检索方法[J]. 计算机工程, 2023, 49(4): 101-107,113.
[13]	耿磊, 傅洪亮, 陶华伟, 卢远, 郭歆莹, 赵力. 基于动态卷积递归神经网络的语音情感识别[J]. 计算机工程, 2023, 49(4): 125-130,137.
[14]	李培育, 张雅丽. 基于改进SRGAN模型的人脸图像超分辨率重建[J]. 计算机工程, 2023, 49(4): 199-205.
[15]	何悦, 陈广胜, 景维鹏, 徐泽堃. 基于深度多相似性哈希方法的遥感图像检索[J]. 计算机工程, 2023, 49(2): 206-212.

选择文件类型/文献管理软件名称

选择包含的内容