基于半监督深度自编码网络的分类算法及应用

doi:10.19678/j.issn.1000-3428.0068598

摘要/Abstract

摘要：

在工业分类预测中, 有标签数据稀缺且标记成本高, 导致模型预测不准确, 同时大多数无标签数据中的特征未得到合理利用, 模型的泛化能力不足。为了解决这个问题, 提出半监督深度自编码网络(SSup-DDSAE-Link), 将有标签数据和无标签数据通过有监督学习和无监督学习进行结合, 提升模型预测准确率。该模型首先在深度自编码通道上, 分别添加高斯噪声和稀疏性约束, 提取与分类相关且更具代表性的特征表示; 其次在编码器与解码器之间引入横向连接, 过滤与分类任务不相关的信息, 使得网络能够更好地学习关键变量的特征表示, 并在网络顶层添加有监督学习路径来实现分类识别; 然后添加原始编码器, 与解码器中对应隐含层的输出一起训练, 从而构造无监督学习路径, 有效利用无标签数据中的信息; 最后通过有监督损失函数与无监督损失函数构造总损失函数, 实现对工业生产中关键变量的分类预测。实验结果表明, 与常用的有监督学习模型和传统的半监督学习模型相比, SSup-DDSAE-Link的分类预测准确率得到了有效提高, 并且精确率、召回率和F1值均得到提升。

关键词: 半监督学习, 降噪自编码器, 稀疏自编码器, 特征提取, 分类预测

Abstract:

In industrial classification prediction, labeled data are scarce, and labeling is expensive, leading to inaccurate model predictions. Simultaneously, features in most unlabeled data are not effectively used, resulting in insufficient generalization of the model. To solve this problem, this study proposes a Semi-Supervised Deep Auto-Encoder network (SSup-DDSAE-Link) that combines labeled and unlabeled data through supervised and unsupervised learning to improve the model's prediction accuracy. First, Gaussian noise and sparsity constraints are added to the deep Auto-Encoder (AE) channel to extract more representative features related to the classification. Second, a lateral connection is introduced between the encoder and decoder to filter information irrelevant to the classification task so that the network can better learn the feature representations of key variables. A supervised learning path is then added to the top layer of the network to realize classification and recognition. Subsequently, the original encoder is added and trained together with the output of the corresponding hidden layer in the decoder to realize an unsupervised learning path and effectively uses the information in the unlabeled data. Finally, the total loss function is constructed using supervised and unsupervised loss functions to classify and predict key variables in industrial production. The experimental results show that, compared with the commonly used supervised learning models and the traditional semi-supervised learning models, the proposed algorithm has better classification prediction accuracy along with effectively improved precision, recall, and F1 value.

Key words: semi-supervised learning, Denoising Auto-Encoder(DAE), Sparse Auto-Encoder(SAE), feature extraction, classification prediction

张新波, 张雪英, 黄丽霞, 陈桂军. 基于半监督深度自编码网络的分类算法及应用[J]. 计算机工程, 2025, 51(1): 71-80.

ZHANG Xinbo, ZHANG Xueying, HUANG Lixia, CHEN Guijun. Classification Algorithm and Application Based on Semi-Supervised Deep Auto-Encoder Network[J]. Computer Engineering, 2025, 51(1): 71-80.

https://www.ecice06.com/CN/Y2025/V51/I1/71

图/表 9

图1 DDSAE结构

Fig.1 Structure of DDSAE

图2 SSup-DDSAE-Link网络结构

Fig.2 Network structure of SSup-DDSAE-Link

图3 SSup-DDSAE-Link训练和测试过程

Fig.3 Training and testing process of SSup-DDSAE-Link

参考文献 29

1	连鹏隆. 基于深度学习的复杂工业过程软测量方法研究[D]. 西安: 西安理工大学, 2020.
	LIAN P L. Research on soft sensing method ofcomplex industrial process based on deep learning[D]. Xi'an: Xi'an University of Technology, 2020. (in Chinese)
2	罗顺桦, 王振雷, 王昕. 基于二子空间协同训练算法的半监督软测量建模. 化工学报, 2022, 73 (3): 1270- 1279.
	LUO S H , WANG Z L , WANG X . Semi-supervised soft sensor modeling based on two-subspace co-training algorithm. Journal of Chemical Industry and Technology, 2022, 73 (3): 1270- 1279.
3	陈亚瑞, 张芝慧, 杨剑宁, 等. 基于多模态生成模型的半监督学习. 天津科技大学学报, 2022, 37 (2): 43- 50.
	CHEN Y R , ZHANG Z H , YANG J N , et al. Semi-supervised learning based on multimodal generative model. Journal of Tianjin University of Science [WT《Times New Roman》]& Technology, 2022, 37 (2): 43- 50.
4	CANG W T , YANG H Z . Adaptive soft sensor method based on online selective ensemble of partial least squares for quality prediction of chemical process. Asia-Pacific Journal of Chemical Engineering, 2019, 14 (5): 1- 12.
5	王毅红, 张建雄, 兰官奇, 等. 压制生土砖强度的人工神经网络预测模型. 华南理工大学学报(自然科学版), 2020, 48 (7): 115- 121.
	WANG Y H , ZHANG J X , LAN G Q , et al. Artificial neural network prediction model forcompressive strength ofcompacted earth blocks. Journal of South China University of Technology (Natural Science Edition), 2020, 48 (7): 115- 121.
6	李元, 张昊展, 唐晓初. 基于多模态数据全信息的概率主成分分析故障检测研究. 仪器仪表学报, 2021, 42 (2): 75- 85.
	LI Y , ZHANG H Z , TANG X C . Study on probabilistic principalcomponent analysis fault detection based on full information of multimodal data. Chinese Journal of Scientific Instrument, 2021, 42 (2): 75- 85.
7	KENNEDY N , WIN T L , BANDYOPADHYAY A , et al. Insights from linking police domestic abuse data and health data in South Wales, UK: a linked routine data analysis using decision tree classification. The Lancet Public Health, 2023, 8 (8): e629- e638. doi: 10.1016/S2468-2667(23)00126-3
8	VAPNIK V N . Statistical learning theory. New York, USA: Wiley, 1998.
9	VOMMI A M , BATTULA T K . A hybrid filter-wrapper feature selection using fuzzy KNN based on Bonferroni mean for medical datasets classification: a COVID-19 case study. Expert Systems with Applications, 2023, 218, 119612. doi: 10.1016/j.eswa.2023.119612
10	RIZWAN-ul-HASSAN , LI C G , LIU Y T . Online dynamic security assessment of wind integrated power system using SDAE with SVM ensemble boosting learner. International Journal of Electrical Power [WT《Times New Roman》]& Energy Systems, 2021, 125, 106429.
11	RAO L B , PANG T , JI R S , et al. Combined with stack autoencoder-extreme learning machine method. Progress in Laser and Optoelectronics, 2019, 56 (11): 247- 253.
12	KHALIL R A , JONES E , BABAR M I , et al. Speech emotion recognition using deep learning techniques: a review. IEEE Access, 2019, 7, 117327- 117345. doi: 10.1109/ACCESS.2019.2936124
13	HOU L , LUO X Y , WANG Z Y , et al. Representation learning via a semi-supervised stacked distance autoencoder for image classification. Frontiers of Information Technology [WT《Times New Roman》]& Electronic Engineering, 2020, 21 (7): 1005- 1018.
14	HINTON G E , OSINDERO S , TEH Y W . A fast learning algorithm for deep belief nets. Neural Computation, 2006, 18 (7): 1527- 1554.
15	HAO X , ZHANG G G , MA S . Deep learning. International Journal of Semantic Computing, 2016, 10 (3): 417- 439.
16	VINCENT P , LAROCHELLE H , LAJOIE I , et al. Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. Journal of Machine Learning Research, 2010, 11, 3371- 3408.
17	张国令, 王晓丹, 李睿, 等. 基于栈式降噪稀疏自编码器的极限学习机. 计算机工程, 2020, 46 (9): 61- 67.
	ZHANG G L , WANG X D , LI R , et al. Extreme learning machine based on stacked denoising sparse auto-encoder. Computer Engineering, 2020, 46 (9): 61- 67.
18	LEI Y X , KARIMI H R , CEN L H , et al. Processes soft modeling based on stacked autoencoders and wavelet extreme learning machine for aluminum plant-wide application. Control Engineering Practice, 2021, 108, 104706.
19	SCHÖLKOPF B, PLATT J, HOFMANN T. Efficient learning of sparse representations with an energy-based model[C]//SCHÖLKOPF B, PLATT J, HOFMANN T. Advances in neural information processing systems 19: proceedings of the 2006 conference. Cambridge, USA: MIT Press, 2006: 1137-1144.
20	DINESH P S , MANIKANDAN M . Fully convolutional deep stacked denoising sparse auto encoder network for partial face reconstruction. Pattern Recognition, 2022, 130, 108783.
21	XING C , MA L , YANG X Q . Stacked denoise autoencoder based feature extraction and classification for hyperspectral images. Journal of Sensors, 2016, 2016, 3632943.
22	LONG J , LIANG W , LI K C , et al. A regularized cross-layer ladder network for intrusion detection in industrial Internet of Things. IEEE Transactions on Industrial Informatics, 2023, 19 (2): 1747- 1755.
23	NAIR V, HINTON G E. Rectified linear units improve restricted Boltzmann machines[C]//Proceedings of the 27th International Conference on Machine Learning. New York, USA: ACM Press, 2010: 807-814.
24	VINCENT P, LAROCHELLE H, BENGIO Y, et al. Extracting andcomposing robust features with denoising autoencoders[C]//Proceedings of the 25th International Conference on Machine Learning. New York, USA: ACM Press, 2008: 1096-1103.
25	PEZESHKI M, FAN L X, BRAKEL P, et al. Deconstructing the ladder network architecture[EB/OL]. [2023-05-20]. https://arxiv.org/pdf/1511.06430 .
26	DU Y P , YAO C Q , HUO S H , et al. A new item-based deep network structure using a restricted Boltzmann machine for collaborative filtering. Frontiers of Information Technology [WT《Times New Roman》]& Electronic Engineering, 2017, 18 (5): 658- 666.
27	YANG P H , CHEN J R , ZHANG H R , et al. A fault identification method for electric submersible pumps based on DAE-SVM. Shock and Vibration, 2022, 2022, 5868630.
28	YU X , YANG J , ZHANG J P . A transductive support vector machine algorithm based on spectral clustering. AASRI Procedia, 2012, 1, 384- 388.
29	唐兵. 以天然气为碳源的CVI法制备C/C复合材料的研究[D]. 长沙: 中南大学, 2009.
	TANG B. Study on preparation of C/Ccomposites by CVI method with natural gas as carbon source[D]. Changsha: Central South University, 2009. (in Chinese)

[1]	赵俊涛, 李陶深, 卢志翔. 基于最优近邻的局部保持投影方法[J]. 计算机工程, 2024, 50(9): 161-168.
[2]	郭敏, 张熙涵, 李阳. 融合注意力的教师互一致性半监督医学图像分割[J]. 计算机工程, 2024, 50(9): 313-323.
[3]	钱清, 龙永, 蒋忠远, 段春红, 王宏. 基于深度强化学习的自适应图像隐写算法[J]. 计算机工程, 2024, 50(8): 319-327.
[4]	胡庆. 多尺度融合与双输出U-Net网络的行人重识别[J]. 计算机工程, 2024, 50(6): 102-109.
[5]	余长宏, 许孔豪, 张泽, 高明. 基于分割点改进孤立森林的网络入侵检测方法[J]. 计算机工程, 2024, 50(6): 148-156.
[6]	顾永跟, 高凌轩, 吴小红, 陶杰. 非独立同分布下联邦半监督学习的数据分享研究[J]. 计算机工程, 2024, 50(6): 188-196.
[7]	梁松林, 林伟, 王珏, 杨庆. 面向后渗透攻击行为的网络恶意流量检测研究[J]. 计算机工程, 2024, 50(5): 128-138.
[8]	李振鲁, 黄威, 孙锴. 复杂环境下的轻量化道路目标识别算法研究[J]. 计算机工程, 2024, 50(4): 219-227.
[9]	袁文涛, 卫文韬, 高德民. 融合注意力机制的多视图卷积手势识别研究[J]. 计算机工程, 2024, 50(3): 208-215.
[10]	郭尚伟, 刘树峰, 李子铭, 欧阳德强, 王宁, 向涛. 基于融合模型的网络安全态势感知方法[J]. 计算机工程, 2024, 50(11): 1-9.
[11]	王文静, 范涛, 王国中, 赵海武. 基于SA-BPNN多模态融合的教学质量评价方法[J]. 计算机工程, 2024, 50(11): 390-398.
[12]	任义, 苏博, 袁帅. 教育领域下多维度特征命名实体识别方法[J]. 计算机工程, 2024, 50(10): 110-118.
[13]	马娜, 温廷新, 贾旭, 李晓会. 复杂光照条件下自适应的车脸重识别模型[J]. 计算机工程, 2023, 49(8): 275-282, 290.
[14]	戴浩磊, 黄永慧, 周郭许. 基于超图正则化非负张量链分解的聚类分析[J]. 计算机工程, 2023, 49(6): 81-89.
[15]	陈仲磊, 伊鹏, 陈祥, 胡涛. 基于集成学习的系统调用实时异常检测框架[J]. 计算机工程, 2023, 49(6): 162-169,179.

选择文件类型/文献管理软件名称

选择包含的内容