数据增强和自适应自步学习的深度子空间聚类算法

doi:10.19678/j.issn.1000-3428.0065405

摘要/Abstract

摘要：

深度子空间聚类通过联合执行自表达特征学习和聚类分配而取得了比传统聚类更好的性能。尽管在各种应用中出现了大量的深度子空间聚类算法，但是多数算法都无法学习到精准的面向聚类的特征。针对深度子空间聚类方法在学习聚类的特征表示时不够精准、影响最终聚类性能等问题，提出一种改进的深度子空间聚类算法。通过随机移位和旋转对原样本进行数据增强，交替地使用增强样本来训练和优化自编码器，同时更新样本的集群分配，从而学习到更稳健的特征表示。在微调阶段，损失函数中每个增强样本的目标都是将原样本分配到集群中心，目标计算可能出错，目标错误的样本会误导自编码器网络训练，为此，利用一种无需额外超参数的自适应自步学习算法，在每次迭代中选择最具说服力的样本来提高泛化能力。在MNIST、USPS、COIL100数据集上进行实验，结果表明，该算法的准确率分别达到0.931 8、0.893 4、0.723 6，消融实验和敏感性分析结果也验证了算法的有效性。

关键词: 深度学习, 子空间聚类, 数据增强, 自适应自步学习, 编码器

Abstract:

Deep subspace clustering achieves better performance than traditional clustering by jointly performing self-expressed feature learning and cluster allocation.Despite the emergence of a large number of deep subspace clustering algorithms in various applications, most algorithms are unable to learn accurate clustering-oriented features.In this study, an improved deep subspace clustering algorithm is proposed to address issues such as insufficient accuracy in learning the feature representation of clustering, which affects the final performance of deep subspace clustering methods. Random displacement and rotation are used to enhance the original sample data, whereby the autoencoder is trained and optimized by alternately using enhanced samples while updating the cluster allocation of samples to learn more robust feature representations.In the fine-tuning phase, the goal is for each enhanced sample in the loss function, to allocate the original sample to the cluster center. The target calculation may be wrong, and the sample with the wrong target will mislead the self-encoder network training.Therefore, an adaptive self-paced learning algorithm without additional hyperparameters is used to select the most convincing sample in each iteration to improve generalization ability. Experiments were conducted on the MNIST, USPS, and COIL100 datasets, and the results showed that the accuracy of the algorithm reached 0.931 8, 0.893 4, and 0.723 6, respectively.The ablation experiment and sensitivity analysis results also verified the effectiveness of the algorithm.

Key words: deep learning, subspace clustering, data augmentation, adaptive self-paced learning, encoder

江雨燕, 陶承凤, 李平. 数据增强和自适应自步学习的深度子空间聚类算法[J]. 计算机工程, 2023, 49(8): 96-103, 110.

Yuyan JIANG, Chengfeng TAO, Ping LI. Deep Subspace Clustering Algorithm with Data Augmentation and Adaptive Self-Paced Learning[J]. Computer Engineering, 2023, 49(8): 96-103, 110.

https://www.ecice06.com/CN/Y2023/V49/I8/96

图/表 17

参考文献 26

1	谢娟英, 王艳娥. 最小方差优化初始聚类中心的K-means算法. 计算机工程, 2014, 40 (8): 205-211, 223 doi: 10.3969/j.issn.1000-3428.2014.08.039
	XIE J Y, WANG Y E. K-means algorithm based on minimum deviation initialized clustering centers. Computer Engineering, 2014, 40 (8): 205-211, 223 doi: 10.3969/j.issn.1000-3428.2014.08.039
2	刘攀登, 刘清明. 稀疏数据中基于高斯混合模型的位置推荐框架. 计算机工程, 2018, 44 (1): 62- 68. URL
	LIU P D, LIU Q M. Location recommendation framework based on Gaussian mixture model in sparse data. Computer Engineering, 2018, 44 (1): 62- 68. URL
3	D'ANDRADE R G. U-statistic hierarchical clustering. Psychometrika, 1978, 43 (1): 59- 67. doi: 10.1007/BF02294089
4	葛君伟, 杨广欣. 基于共享最近邻的密度自适应邻域谱聚类算法. 计算机工程, 2021, 47 (8): 116- 123. URL
	GE J W, YANG G X. Spectral clustering algorithm for density adaptive neighborhood based on shared nearest neighbors. Computer Engineering, 2021, 47 (8): 116- 123. URL
5	VIDAL R. Subspace clustering. IEEE Signal Processing Magazine, 2011, 28 (2): 52- 68. doi: 10.1109/MSP.2010.939739
6	ELHAMIFAR E, VIDAL R. Sparse subspace clustering[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2009: 2790-2797.
7	孙登第, 凌媛, 丁转莲, 等. 基于稀疏子空间聚类的多层网络社团检测. 计算机工程, 2021, 47 (10): 52- 60. URL
	SUN D D, LING Y, DING Z L, et al. Multi-layer network community detection based on sparse subspace clustering. Computer Engineering, 2021, 47 (10): 52- 60. URL
8	VIDAL R, FAVARO P. Low Rank Subspace Clustering (LRSC). Pattern Recognition Letters, 2014, 43, 47- 61. doi: 10.1016/j.patrec.2013.08.006
9	REN Y, WANG N, LI M, et al. Deep density-based image clustering. Knowledge-Based Systems, 2020, 197, 105841. doi: 10.1016/j.knosys.2020.105841
10	JI P, ZHANG T, LI H, et al. Deep subspace clustering networks[EB/OL]. [2022-07-05]. https://arxiv.org/abs/1709.02508.
11	何锦蓉. 深度子空间聚类算法研究[D]. 徐州: 中国矿业大学, 2020.
	HE J R. Research on depth subspace clustering algorithm[D]. Xuzhou: China University of Mining and Technology, 2020. (in Chinese)
12	KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks. Communications of the ACM, 2017, 60 (6): 84- 90. doi: 10.1145/3065386
13	HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2016: 770-778.
14	SABOUR S, FROSST N, HINTON G E. Dynamic routing between capsules[EB/OL]. [2022-07-05]. https://arxiv.org/abs/1710.09829.
15	PAWAN M, BEN K, KOLLER P D. Self-paced learning for latent variable models[EB/OL]. [2022-07-05]. https://ai.stanford.edu/~bpacker/selfPacedLVM.pdf.
16	GUO X F, LIU X W, ZHU E, et al. Adaptive self-paced deep clustering with data augmentation. IEEE Transactions on Knowledge and Data Engineering, 2020, 32 (9): 1680- 1693.
17	DENG L. The MNIST database of handwritten digit images for machine learning research[best of the Web]. IEEE Signal Processing Magazine, 2012, 29 (6): 141- 142. doi: 10.1109/MSP.2012.2211477
18	XIAO H, RASUL K, VOLLGRAF R. Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms[EB/OL]. [2022-07-05]. https://arxiv.org/abs/1708.07747.
19	NAYAR S. Columbia Object Image Library(COIL100) [EB/OL]. [2022-07-05]. https://www.kaggle.com/datasets/jessicali9530/coil100.
20	GLOROT X, BOR DE S A, BENGIO Y. Deep sparse rectifier neural networks[C]//Proceedings of the 14th International Conference on Artificial Intelligence and Statistics. New York, USA: ACM Press, 2011: 315-323.
21	张有健, 陈晨, 王再见. 深度学习算法的激活函数研究. 无线电通信技术, 2021, 47 (1): 115- 120. URL
	ZHANG Y J, CHEN C, WANG Z J. Research on activation function of deep learnimg algorithm. Radio Communi-cations Technology, 2021, 47 (1): 115- 120. URL
22	KINGMA D P, BA J. Adam: a method for stochastic optimization[EB/OL]. [2022-07-05]. https://arxiv.org/abs/1412.6980.
23	ELHAMIFAR E, VIDAL R. Sparse subspace clustering: algorithm, theory, and applications. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, 35 (11): 2765- 2781.
24	LU Y Z, LIN J Q, CHEN S, et al. Automatic tumor segmentation by means of deep convolutional U-net with pre-trained encoder in PET images. IEEE Access, 2020, 8, 113636- 113648.
25	PENG Z, JIA Y, LIU H, et al. Maximum entropy subspace clustering network[EB/OL]. [2022-07-05]. https://arxiv.org/pdf/2012.03176.pdf.
26	JOSE VALANARASU J M, PATEL V M. Overcomplete deep subspace clustering networks[C]//Proceedings of IEEE Winter Conference on Applications of Computer Vision. Washington D. C., USA: IEEE Press, 2021: 746-755.

符号	含义
$ {\boldsymbol{x}}_{i} $	输入数据，$ {\boldsymbol{x}}_{i}\in \boldsymbol{X}(\boldsymbol{X}\in {\mathbb{R}}_{}^{d\times m}) $
$ {\widehat{\boldsymbol{x}}}_{i} $	增强数据，$ {\widehat{\boldsymbol{x}}}_{i}\in \boldsymbol{X}(\boldsymbol{X}\in {\mathbb{R}}_{}^{d\times m}) $
$ {f}_{e}\left({\boldsymbol{x}}_{i}\right) $	编码器部分
$ {g}_{d}\left({f}_{e}\right({\boldsymbol{x}}_{i}\left)\right) $	解码器部分
e, d	自编码器的学习参数
$ {\boldsymbol{z}}_{i}={f}_{e}\left({\boldsymbol{x}}_{i}\right) $	低维隐表示，$ {\boldsymbol{z}}_{i}\in \boldsymbol{Z}(\boldsymbol{Z}\in {\mathbb{R}}_{}^{l\times m}) $
$ {\boldsymbol{s}}_{i} $	分配给$ {\boldsymbol{z}}_{i} $的集群索引，$ {\boldsymbol{s}}_{i}\in {\left\{\mathrm{0, 1}\right\}}^{K} $
C	系数矩阵，$ \boldsymbol{C}\in {\mathbb{R}}^{m\times m} $
$ {\boldsymbol{v}}_{i} $	训练样本的权重, $ {\boldsymbol{v}}_{i}\in \left\{\mathrm{0, 1}\right\} $
λ₁, λ₂, λ	标量超参数

符号	含义
$ {\boldsymbol{x}}_{i} $	输入数据，$ {\boldsymbol{x}}_{i}\in \boldsymbol{X}(\boldsymbol{X}\in {\mathbb{R}}_{}^{d\times m}) $
$ {\widehat{\boldsymbol{x}}}_{i} $	增强数据，$ {\widehat{\boldsymbol{x}}}_{i}\in \boldsymbol{X}(\boldsymbol{X}\in {\mathbb{R}}_{}^{d\times m}) $
$ {f}_{e}\left({\boldsymbol{x}}_{i}\right) $	编码器部分
$ {g}_{d}\left({f}_{e}\right({\boldsymbol{x}}_{i}\left)\right) $	解码器部分
e, d	自编码器的学习参数
$ {\boldsymbol{z}}_{i}={f}_{e}\left({\boldsymbol{x}}_{i}\right) $	低维隐表示，$ {\boldsymbol{z}}_{i}\in \boldsymbol{Z}(\boldsymbol{Z}\in {\mathbb{R}}_{}^{l\times m}) $
$ {\boldsymbol{s}}_{i} $	分配给$ {\boldsymbol{z}}_{i} $的集群索引，$ {\boldsymbol{s}}_{i}\in {\left\{\mathrm{0, 1}\right\}}^{K} $
C	系数矩阵，$ \boldsymbol{C}\in {\mathbb{R}}^{m\times m} $
$ {\boldsymbol{v}}_{i} $	训练样本的权重, $ {\boldsymbol{v}}_{i}\in \left\{\mathrm{0, 1}\right\} $
λ₁, λ₂, λ	标量超参数

层	核大小	通道数/个
encoder-1	5×5	10
encoder-2	3×3	20
encoder-3	3×3	30
self-expressive	—	—
decoder-1	3×3	30
decoder-2	3×3	20
decoder-3	5×5	10

层	核大小	通道数/个
encoder-1	5×5	10
encoder-2	3×3	20
encoder-3	3×3	30
self-expressive	—	—
decoder-1	3×3	30
decoder-2	3×3	20
decoder-3	5×5	10

层	核大小	通道数/个
encoder-1	5×5	5
encoder-2	3×3	3
encoder-3	3×3	3
self-expressive	—	—
decoder-1	3×3	3
decoder-2	3×3	3
decoder-3	5×5	5

选择文件类型/文献管理软件名称

选择包含的内容