隐私保护的去中心联邦多视图聚类

doi:10.19678/j.issn.1000-3428.0069305

摘要/Abstract

摘要：

在大数据时代, 存在大量多视图数据, 现有的多视图聚类方法大都把所有视图数据汇总到一起进行学习, 但在实际应用中, 不同视图的数据大多存储在不同的设备中, 甚至有些设备上的数据涉及隐私, 无法共享。如果把每个视图的数据视为分布式网络中的一个节点, 联邦学习则可有效解决数据无法共享和隐私保护的问题, 联邦多视图聚类正是将联邦学习引入多视图聚类而得到的一类方法。联邦学习利用中心服务器进行协调, 当中心服务器缺失或出现故障时, 该方法将失效。为此, 提出一种去中心的联邦多视图聚类(DFMC)方法。首先通过非负矩阵分解(NMF)学习每个视图的低维表示, 然后根据视图信息的一致性, 针对不同视图的低维表示给出一致性约束, 该约束可以实现邻居视图间的通信, 构建去中心的联邦学习环境, 得到一个统一的低维表示, 进而进行聚类。在此基础上, 使用交替极小化(AM)算法对每个视图分别进行求解, 从而实现隐私保护。在真实数据集上的实验结果验证了DFMC的有效性和收敛性。

关键词: 多视图聚类, 非负矩阵分解, 联邦学习, 去中心, 隐私保护

Abstract:

In the era of big data, multi-view data exist in large quantities, and most existing multi-view clustering methods aggregate the data of all views for learning. However, data from different views are stored on different devices in some practical applications, and some of the data are private and cannot be shared. If the data of each view are regarded as nodes in a distributed network, these problems can be solved by introducing federated learning into multi-view clustering. Federated learning utilizes a central server for coordination; however, it becomes invalid when the central server is missing or faulty. This paper proposes a Decentralized Federated Multi-view Clustering (DFMC) approach to address this issue. First, the low-dimensional representation of each view is learned using Non-negative Matrix Factorization (NMF). Next, a consistency constraint is applied to the low-dimensional representations of different views based on the consistency of the view information. This constraint implements information communication between neighboring views and constructs a decentralized federated learning environment. Finally, a unified low-dimensional representation matrix is obtained and applied for clustering. Privacy preservation is achieved using the Alternating Minimization (AM) algorithm for individual views separately. Experimental results on real datasets verify the effectiveness and convergence of the DFMC approach.

Key words: multi-view clustering, Non-negative Matrix Factorization (NMF), federated learning, decentralization, privacy protection

雷一凡, 陈晓红. 隐私保护的去中心联邦多视图聚类[J]. 计算机工程, 2025, 51(7): 180-189.

LEI Yifan, CHEN Xiaohong. Privacy-Preserving Decentralized Federated Multi-View Clustering[J]. Computer Engineering, 2025, 51(7): 180-189.

https://www.ecice06.com/CN/Y2025/V51/I7/180

图/表 9

图1 4个视图的无向连通图

Fig.1 Undirected connectivity graph with four views

图2 矩阵可视化及约束违反度曲线

Fig.2 Matrix visualization and constraint violation curves

图3 收敛曲线对比

Fig.3 Comparison of convergence curves

参考文献 30

1	ZHAO J , XIE X J , XU X , et al. Multi-view learning overview: recent progress and new challenges. Information Fusion, 2017, 38, 43- 54.
2	滕少华, 盛文涛, 滕璐瑶, 等. 融合加权不一致性的多视图聚类. 小型微型计算机系统, 2025, 46 (2): 381- 388. URL
	TENG S H , SHENG W T , TENG L Y , et al. Multiview graph clustering with fusion of weighted inconsistency. Journal of Chinese Computer Systems, 2025, 46 (2): 381- 388. URL
3	刘思慧, 高全学, 宋伟, 等. 基于加权张量低秩约束的多视图谱聚类. 计算机工程, 2024, 50 (1): 129- 137. doi: 10.19678/j.issn.1000-3428.0068270
	LIU S H , GAO Q X , SONG W , et al. Multiview spectral clustering based on weighted tensor low-rank constraint. Computer Engineering, 2024, 50 (1): 129- 137. doi: 10.19678/j.issn.1000-3428.0068270
4	纪霞, 施明远, 周芃, 等. 自适应相似图联合优化的多视图聚类. 计算机学报, 2024, 47 (2): 310- 322.
	JI X , SHI M Y , ZHOU P , et al. Multi-view clustering based on adaptive similarity graph joint optimization. Chinese Journal of Computer, 2024, 47 (2): 310- 322.
5	MEI Y Y , REN Z W , WU B , et al. Robust graph-based multi-view clustering in latent embedding space. International Journal of Machine Learning and Cybernetics, 2022, 13 (2): 497- 508.
6	WANG X B , LEI Z , GUO X J , et al. Multi-view subspace clustering with intactness-aware similarity. Pattern Recognition, 2019, 88, 50- 63.
7	WANG Y X , ZHANG Y J . Nonnegative matrix factorization: a comprehensive review. IEEE Transactions on Knowledge and Data Engineering, 2012, 25 (6): 1336- 1353. doi: 10.1109/TKDE.2012.51
8	YAO X , CHEN X , MATVEEV I A , et al. Semi-paired multiview clustering based on nonnegative matrix factorization. Journal of Computer and Systems Sciences International, 2019, 58 (4): 579- 594.
9	LIANG N Y , YANG Z Y , LI Z N , et al. Multi-view clustering by non-negative matrix factorization with co-orthogonal constraints. Knowledge-Based Systems, 2020, 194, 105582. doi: 10.1016/j.knosys.2020.105582
10	LI C L , CHE H J , LEUNG M F , et al. Robust multi-view non-negative matrix factorization with adaptive graph and diversity constraints. Information Sciences, 2023, 634, 587- 607.
11	张荣国, 曹俊辉, 胡静, 等. 基于非负正交矩阵分解的多视图聚类图像分割算法. 模式识别与人工智能, 2023, 36 (6): 556- 571.
	ZHANG R G , CAO J H , HU J , et al. Non-negative orthogonal matrix factorization based multi-view clustering image segmentation algorithm. Pattern Recognition and Artificial Intelligence, 2023, 36 (6): 556- 571.
12	YEO C, RAMCHANDRAN K. Robust distributed multi-view video compression for wireless camera networks[C]//Proceedings of the Visual Communications and Image Processing 2007. Washington D.C., USA: IEEE Press, 2007: 250-258.
13	YANG Q , LIU Y , CHEN T J , et al. Federated machine learning. ACM Transactions on Intelligent Systems and Technology, 2019, 10 (2): 1- 19.
14	HUANG S D , SHI W , XU Z L , et al. Efficient federated multi-view learning. Pattern Recognition, 2022, 131, 108817.
15	WANG H T , LI A , SHEN B L , et al. Federated multi-view spectral clustering. IEEE Access, 2020, 8, 202249- 202259.
16	FLANAGAN A, OYOMNO W, GRIGORIEVSKIY A, et al. Federated multi-view matrix factorization for personalized recommendations[EB/OL]. [2023-09-05]. https://arxiv.org/abs/2004.04256.
17	CHE S C , KONG Z M , PENG H , et al. Federated multi-view learning for private medical data integration and analysis. ACM Transactions on Intelligent Systems and Technology, 2022, 13 (4): 1- 23.
18	KAIROUZ P , MCMAHAN H B , AVENT B , et al. Advances and open problems in federated learning. Foundations and Trends in Machine Learning, 2021, 14 (1/2): 1- 210.
19	LI T , SAHU A K , TALWALKAR A , et al. Federated learning: challenges, methods, and future directions. IEEE Signal Processing Magazine, 2020, 37 (3): 50- 60.
20	LIU J L , TENG S H , FEI L K , et al. A novel consensus learning approach to incomplete multi-view clustering. Pattern Recognition, 2021, 115, 107890.
21	ZHOU W, WANG H, YANG Y. Consensus graph learning for incomplete multi-view clustering[EB/OL]. [2023-09-05]. https://link.springer.com/chapter/10.1007/978-3-030-16148-4_41.
22	LALITHA A, KILINC O C, JAVIDI T, et al. Peer-to-peer federated learning on graphs[EB/OL]. [2023-09-05]. https://arxiv.org/abs/1901.11173v1.
23	FU L L , LIN P F , VASILAKOS A V , et al. An overview of recent multi-view clustering. Neurocomputing, 2020, 402, 148- 161.
24	HONG M, HAJINEZHAD D, ZHAO M. Prox-PDA: the proximal primal-dual algorithm for fast distributed nonconvex optimization and learning over networks[EB/OL]. [2023-09-05]. http://proceedings.mlr.press/v70/hong17a/hong17a.pdf.
25	LI Z L , TANG C , LIU X W , et al. Consensus graph learning for multi-view clustering. IEEE Transactions on Multimedia, 2022, 24, 2461- 2472.
26	CSISZÁR I. Information geometry and alternating minimization procedures[EB/OL]. [2023-09-05]. https://irp-cdn.multiscreensite.com/4f94f1a5/files/uploaded/278054.pdf.
27	MOKHTARI A , SHI W , LING Q , et al. A decentralized second-order method with exact linear convergence rate for consensus optimization. IEEE Transactions on Signal and Information Processing over Networks, 2016, 2 (4): 507- 522.
28	CAI X, NIE F, HUANG H. Multi-view K-means clustering on big data[EB/OL]. [2023-09-05]. https://www.ijcai.org/Proceedings/13/Papers/383.pdf.
29	HUANG S D , TSANG I W , XU Z L , et al. Measuring diversity in graph learning: a unified framework for structured multi-view clustering. IEEE Transactions on Knowledge and Data Engineering, 2021, 34 (12): 5869- 5883.
30	ZHAN K , NIE F , WANG J , et al. Multiview consensus graph clustering. IEEE Transactions on Image Processing, 2019, 28 (3): 1261- 1270.

[1]	姚玉鹏, 魏立斐, 张蕾. 一种隐私保护的抗投毒攻击联邦学习方案[J]. 计算机工程, 2025, 51(6): 223-235.
[2]	施永辉, 代琪, 陈丽芳, 韩阳. 基于自然最近邻的联邦聚合算法[J]. 计算机工程, 2025, 51(6): 236-244.
[3]	黄舒琳, 章志明, 杨伟. 基于环结构的无线传感器网络基站位置隐私保护路由协议[J]. 计算机工程, 2025, 51(4): 198-207.
[4]	沈忱, 何勇, 彭安浪. 鲁棒物联网多维时序数据预测方法[J]. 计算机工程, 2025, 51(4): 107-118.
[5]	吴小红, 李佩, 顾永跟, 陶杰. 基于EMD最优匹配的分层联邦学习算法[J]. 计算机工程, 2025, 51(2): 170-178.
[6]	吴若岚, 陈玉玲, 豆慧, 张洋文, 龙钟. 抗攻击的联邦学习隐私保护算法[J]. 计算机工程, 2025, 51(2): 179-187.
[7]	王圆圆, 王世谦, 王涵, 郭正宾, 胡显承. 基于纵向联邦学习的能源排放跨界智能分析[J]. 计算机工程, 2025, 51(1): 164-173.
[8]	陈先意, 丁思哲, 王康, 闫雷鸣, 付章杰. 一种支持安全联邦学习的主动保护模型水印框架[J]. 计算机工程, 2025, 51(1): 138-147.
[9]	张俊娜, 李天泽, 赵晓焱, 袁培燕. 一种基于DQN的去中心化优先级卸载策略[J]. 计算机工程, 2024, 50(9): 235-245.
[10]	潘恩元, 钟原, 李平. 联邦异质性数据下半监督颈椎MRI分割模型[J]. 计算机工程, 2024, 50(9): 367-376.
[11]	李红娇, 王宝金, 王朝晖, 胡仁豪. 基于模型相似度与本地损失的双重客户端选择算法[J]. 计算机工程, 2024, 50(8): 153-164.
[12]	郑清安, 董建成, 陈亮, 阮英清, 李锦松, 许林彬. 分布式可信数据管理与隐私保护技术研究[J]. 计算机工程, 2024, 50(7): 174-186.
[13]	顾永跟, 高凌轩, 吴小红, 陶杰. 非独立同分布下联邦半监督学习的数据分享研究[J]. 计算机工程, 2024, 50(6): 188-196.
[14]	胡傲然, 陈晓红. 基于多样性与一致性的单步多视图聚类[J]. 计算机工程, 2024, 50(5): 51-61.
[15]	顾永跟, 李国笑, 吴小红, 陶杰, 张艳琼. 预算约束下多任务联邦学习激励机制[J]. 计算机工程, 2024, 50(5): 149-157.

选择文件类型/文献管理软件名称

选择包含的内容