基于模型相似度与本地损失的双重客户端选择算法

doi:10.19678/j.issn.1000-3428.0068223

摘要/Abstract

摘要：

联邦学习是一种分布式机器学习技术, 通过聚合客户端本地模型参数共建全局模型。现有的联邦学习客户端选择算法作用于训练前或者训练后。面对统计异质的客户端数据, 训练前选择算法会使一些性能较差的客户端参与聚合, 导致模型的准确率下降。而训练后选择算法要求所有客户端参与训练, 需要大量的通信开销。为此, 提出双重客户端选择(DCS)算法, 在训练前选择1个客户端训练子集, 以减少全局模型的下发, 在子集训练后选择部分客户端参与聚合, 以减少本地模型的上传。在本地训练前, 服务器根据本地与全局模型的余弦相似度进行层次聚类, 得到不同的选择概率分布, 从中选出无偏的训练子集, 以便更好地适应客户端数据的统计异质性。在子集训练后, 服务器不仅考虑了本地损失, 还结合了本地与全局模型的余弦相似度筛选出聚合子集, 提高全局模型准确率。在Fashion-MNIST和CIFAR-10数据集上的实验结果表明, DCS算法相比于基线算法的测试准确率最大可提升8.55个百分点, 同时上行和下行链路的通信开销分别为O(mn+2d)和O(dn+m)。

关键词: 联邦学习, 客户端选择, 模拟相似度, 聚类, 本地损失

Abstract:

Federated learning is a distributed machine-learning technique that collaboratively constructs a global model by aggregating local model parameters from clients. Existing client selection algorithms for federated learning perform only pre- or post-training. With statistically heterogeneous client data, pre-training selection algorithms may involve poorly performing clients in aggregation, leading to a reduction in model accuracy. However, post-training selection algorithms require that all clients participate in training, which results in significant communication overhead. To address these issues, this study proposes a Dual-Client Selection (DCS) algorithm. This algorithm first selects a subset of clients for training prior to the local training phase to reduce the download of global models. Following the subset training, some clients are chosen to participate in aggregation to reduce the upload of local models. Prior to local training, the server conducts hierarchical clustering based on the cosine similarity between the local and global models. This process yields different selection probability distributions from which an unbiased training subset is selected to better adapt to the statistical heterogeneity of the client data. Following subset training, the server considers not only the local loss but also the cosine similarity between the local and global models. This enables the aggregated subset to be chosen, thereby improving the accuracy of the global model. Experimental results on the Fashion-MNIST and CIFAR-10 datasets demonstrate that the proposed DCS algorithm improves the test accuracy by a maximum of 8.55 percentage points as compared with the baseline algorithm, where the communication overheads of the uplink and downlink are O(mn+2d) and O(dn+m), respectively.

Key words: federated learning, client selection, model similarity, clustering, local loss

李红娇, 王宝金, 王朝晖, 胡仁豪. 基于模型相似度与本地损失的双重客户端选择算法[J]. 计算机工程, 2024, 50(8): 153-164.

Hongjiao LI, Baojin WANG, Zhaohui WANG, Renhao HU. Dual-Client Selection Algorithm Based on Model Similarity and Local Loss[J]. Computer Engineering, 2024, 50(8): 153-164.

https://www.ecice06.com/CN/Y2024/V50/I8/153

图/表 13

图1 双重客户端选择算法流程

Fig.1 Procedure of dual-client selection algorithm

图2 层次聚类选择算法示例

Fig.2 Example of hierarchical clustering selection algorithm

图3 比例选择算法示例

Fig.3 Example of scale selection algorithm

图4 数据集样本类别的分布

Fig.4 Distribution of sample categories in the dataset

图5 卷积神经网络结构

Fig.5 Structure of convolutional neural network

图6 不同算法在Fashion-MNIST数据集上的测试准确率

Fig.6 Test accuracy of different algorithms on the Fashion-MNIST dataset

图7 不同算法在CIFAR-10数据集上的测试准确率

Fig.7 Test accuracy of different algorithms on the CIFAR-10 dataset

参考文献 25

1	CUI Y G, CAO K, CAO G T, et al. Client scheduling and resource management for efficient training in heterogeneous IoT-edge federated learning. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2022, 41(8): 2407- 2420. doi: 10.1109/TCAD.2021.3110743
2	李尤慧子, 俞海涛, 殷昱煜, 等. 基于超级账本的集群联邦优化模型. 计算机工程, 2023, 49(1): 22- 30. doi: 10.19678/j.issn.1000-3428.0064301
	LI Y H Z, YU H T, YIN Y Y, et al. Cluster federated optimization model based on hyperledger fabric. Computer Engineering, 2023, 49(1): 22- 30. doi: 10.19678/j.issn.1000-3428.0064301
3	叶进, 韦涛, 胡亮青, 等. 一种面向智联网的高效联邦学习算法. 计算机工程, 2023, 49(12): 243-251, 261. doi: 10.19678/j.issn.1000-3428.0066803
	YE J, WEI T, HU L Q, et al. An efficient federated learning algorithm for artificial intelligence of things. Computer Engineering, 2023, 49(12): 243-251, 261. doi: 10.19678/j.issn.1000-3428.0066803
4	LAI F, ZHU X F, MADHYASTHA H V, et al. Oort: informed participant selection for scalable federated learning[EB/OL]. [2023-07-05]. https://arxiv.org/abs/2010.06081v2.
5	LI C N, ZENG X, ZHANG M, et al. PyramidFL: a fine-grained client selection framework for efficient federated learning[C]//Proceedings of the 28th Annual International Conference on Mobile Computing and Networking. New York, USA: ACM Press, 2022: 158-171.
6	TANG M X, NING X F, WANG Y T, et al. FedCor: correlation-based active client selection strategy for heterogeneous federated learning[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2022: 10092-10101.
7	HUANG T S, LIN W W, SHEN L, et al. Stochastic client selection for federated learning with volatile clients. IEEE Internet of Things Journal, 2022, 9(20): 20055- 20070. doi: 10.1109/JIOT.2022.3172113
8	DENG Y H, LYU F, REN J, et al. AUCTION: automated and quality-aware client selection framework for efficient federated learning. IEEE Transactions on Parallel and Distributed Systems, 2022, 33(8): 1996- 2009. doi: 10.1109/TPDS.2021.3134647
9	PUTRA M A P, PUTRI A R, ZAINUDIN A, et al. ACS: accuracy-based client selection mechanism for federated industrial IoT. Internet of Things, 2023, 21, 100657. doi: 10.1016/j.iot.2022.100657
10	SHEN G Y, GAO D H, SONG D X, et al. Fast heterogeneous federated learning with hybrid client selection[EB/OL]. [2023-07-05]. http://arxiv.org/abs/2208.05135.
11	MCMAHAN H B, MOORE E, RAMAGE D, et al. Communication-efficient learning of deep networks from decentralized data[EB/OL]. [2023-07-05]. https://arxiv.org/pdf/1602.05629.
12	WANG J Y, LIU Q H, LIANG H, et al. Tackling the objective inconsistency problem in heterogeneous federated optimization[C]//Proceedings of the 34th International Conference on Neural Information Processing Systems. New York, USA: ACM Press, 2020: 7611-7623.
13	LI T, SAHU A K, ZAHEER M, et al. Federated optimization in heterogeneous networks[EB/OL]. [2023-07-05]. http://arxiv.org/abs/1812.06127.
14	FRABONI Y, VIDAL R, KAMENI L, et al. Clustered sampling: low-variance and improved representativity for clients selection in federated learning[EB/OL]. [2023-07-05]. http://arxiv.org/abs/2105.05883.
15	GOETZ J, MALIK K, BUI D, et al. Active federated learning[EB/OL]. [2023-07-05]. http://arxiv.org/abs/1909.12641.
16	WANG L P, WANG W, LI B. CMFL: mitigating communication overhead for federated learning[C]// Proceedings of the 39th International Conference on Distributed Computing Systems (ICDCS). Washington D. C., USA: IEEE Press, 2019: 954-964.
17	温依霖, 赵乃良, 曾艳, 等. 基于本地模型质量的客户端选择方法. 计算机工程, 2023, 49(6): 131- 143. doi: 10.19678/j.issn.1000-3428.0065658
	WEN Y L, ZHAO N L, ZENG Y, et al. Client selection method based on local model quality. Computer Engineering, 2023, 49(6): 131- 143. doi: 10.19678/j.issn.1000-3428.0065658
18	YI L P, GANG W, LIU X G. QSFL: A two-level uplink communication optimization framework for federated learning[C]//Proceedings of International Conference on Machine Learning. New York, USA: ACM Press, 2022: 25501-25513.
19	BALAKRISHNAN R, LI T, ZHOU T, et al. Diverse client selection for federated learning via submodular maximization[C]//Proceedings of International Conference on Learning Representations. New York, USA: ACM Press, 2022: 1-10.
20	CHO Y J, WANG J, JOSHI G. Towards understanding biased client selection in federated learning[C]//Proceedings of International Conference on Artificial Intelligence and Statistics. New York, USA: ACM Press, 2022: 10351-10375.
21	李冠彬, 张锐斐, 陈超, 等. 基于旋转不变深度层次聚类网络的点云分析. 软件学报, 2022, 33(11): 4356- 4378. URL
	LI G B, ZHANG R F, CHEN C, et al. Rotation-invariant deep hierarchical cluster network for point cloud analysis. Journal of Software, 2022, 33(11): 4356- 4378. URL
22	刘艳, 王田, 彭绍亮, 等. 基于边缘的联邦学习模型清洗和设备聚类方法. 计算机学报, 2021, 44(12): 2515- 2528. URL
	LIU Y, WANG T, PENG S L, et al. Edge-based model cleaning and device clustering in federated learning. Chinese Journal of Computers, 2021, 44(12): 2515- 2528. URL
23	LI X, HUANG K X, YANG W H, et al. On the convergence of FedAvg on Non-ⅡD data[EB/OL]. [2023-07-05]. https://arxiv.org/abs/1907.02189v2.
24	李志鹏, 国雍, 陈耀佛, 等. 基于数据生成的类别均衡联邦学习. 计算机学报, 2023, 46(3): 609- 625. URL
	LI Z P, GUO Y, CHEN Y F, et al. Class-balanced federated learning based on data generation. Chinese Journal of Computers, 2023, 46(3): 609- 625. URL
25	JIANG Z D, XU Y, XU H L, et al. Heterogeneity-aware federated learning with adaptive client selection and gradient compression[C]//Proceedings of the IEEE Conference on Computer Communications. New York, USA: IEEE Press, 2023: 1-10.

[1]	潘恩元, 钟原, 李平. 联邦异质性数据下半监督颈椎MRI分割模型[J]. 计算机工程, 2024, 50(9): 367-376.
[2]	徐明亮, 李芳媛, 马浩然, 何飞. 大规模神经记录的峰电位聚类算法(特邀)[J]. 计算机工程, 2024, 50(6): 1-34.
[3]	顾永跟, 高凌轩, 吴小红, 陶杰. 非独立同分布下联邦半监督学习的数据分享研究[J]. 计算机工程, 2024, 50(6): 188-196.
[4]	熊世强, 何道敬, 王振东, 杜润萌. 联邦学习及其安全与隐私保护研究综述[J]. 计算机工程, 2024, 50(5): 1-15.
[5]	胡傲然, 陈晓红. 基于多样性与一致性的单步多视图聚类[J]. 计算机工程, 2024, 50(5): 51-61.
[6]	顾永跟, 李国笑, 吴小红, 陶杰, 张艳琼. 预算约束下多任务联邦学习激励机制[J]. 计算机工程, 2024, 50(5): 149-157.
[7]	马越, 温蜜. 基于多尺度LDTW和TCN的空间负荷预测方法[J]. 计算机工程, 2024, 50(3): 106-113.
[8]	张晓均, 李兴鹏, 唐伟, 郝云溥, 薛婧婷. 云-边融合的可验证隐私保护跨域联邦学习方案[J]. 计算机工程, 2024, 50(3): 148-155.
[9]	宋华伟, 李升起, 万方杰, 卫玉萍. 非独立同分布场景下的联邦学习优化方法[J]. 计算机工程, 2024, 50(3): 166-172.
[10]	刘少杰, 文斌, 王泽旭. 基于联邦学习的多技术融合数据交易方法[J]. 计算机工程, 2024, 50(3): 182-190.
[11]	王丽娟, 邢津萍, 尹明, 郝志峰, 蔡瑞初, 温雯. 基于一致性图的权重自适应多视角谱聚类算法[J]. 计算机工程, 2024, 50(2): 122-131.
[12]	郑晨俊, 曾艳, 袁俊峰, 张纪林, 王鑫, 韩猛. 基于联邦学习的船舶AIS轨迹预测算法[J]. 计算机工程, 2024, 50(2): 298-307.
[13]	张俊娜, 韩超臣, 陈家伟, 赵晓焱, 袁培燕. 一种联合边缘服务器部署与服务放置的方法[J]. 计算机工程, 2024, 50(10): 266-280.
[14]	张玉杰, 高晗. 基于改进FCM的冲压件缺陷图像分割算法[J]. 计算机工程, 2024, 50(10): 342-351.
[15]	刘大兴, 顾乃杰, 黄章进, 苏俊杰, 齐东升. 一种用于软件预取的访存轨迹采样算法[J]. 计算机工程, 2024, 50(10): 362-369.

选择文件类型/文献管理软件名称

选择包含的内容