结合旋转自监督和CLIP指导的长尾数据联邦学习

doi:10.19678/j.issn.1000-3428.0070288

计算机工程 ›› 2026, Vol. 52 ›› Issue (5): 129-138. doi: 10.19678/j.issn.1000-3428.0070288

结合旋转自监督和CLIP指导的长尾数据联邦学习

刘海军¹, 付晓东¹^,²^,*()

1. 昆明理工大学信息工程与自动化学院, 云南昆明 650500
2. 昆明理工大学云南省计算机技术应用重点实验室, 云南昆明 650500

收稿日期:2024-08-26 修回日期:2024-11-21 出版日期:2026-05-15 发布日期:2025-01-03
通讯作者: 付晓东
作者简介:
刘海军(CCF学生会员), 男, 硕士研究生, 主研方向为联邦学习、长尾数据
付晓东(通信作者), 教授、博士、博士生导师
基金资助:
国家自然科学基金(62362043); 云南省"兴滇英才支持计划"项目(KKXY202203008); 云南省科技计划项目(202205AF150003); 云南省科技计划项目(202204BQ040010); 云南省科技计划项目(202102AD080002)

Federated Learning on Long-Tail Data Combining Rotational Self-Supervision and CLIP Guidance

LIU Haijun¹, FU Xiaodong¹^,²^,*()

1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, Yunnan, China
2. Yunnan Key Laboratory of Computer Technology Application, Kunming University of Science and Technology, Kunming 650500, Yunnan, China

Received:2024-08-26 Revised:2024-11-21 Online:2026-05-15 Published:2025-01-03
Contact: FU Xiaodong

摘要/Abstract

摘要：

现实世界中的数据通常遵循长尾分布, 假设全局数据分布平衡的联邦学习方法难以对长尾数据中的尾类数据进行准确分类。为此, 现有研究为全局模型重新训练一个平衡的分类器, 以缓解长尾数据带来的影响, 但其未考虑平衡模型的特征提取器以及如何让模型的特征提取器学习高质量的图像特征, 导致全局模型性能不佳。为了使模型在特征学习阶段没有偏见地学习高质量的图像特征, 提出一种结合旋转自监督和对比语言-图像预训练(CLIP)指导的联邦学习方法, 通过使用旋转自监督学习来指导本地客户端模型的训练, 减少长尾数据对客户端模型造成的影响, 并使模型高质量地学习图像中的特征。同时, 利用CLIP对模型的正常训练以及旋转后的图片进行指导, 将CLIP中丰富的知识转移到客户端模型中, 进一步提升特征提取器的性能。在不同长尾分布下的CIFAR-10和CIFAR-100数据集上进行测试, 并与其他联邦学习方法进行对比, 实验结果表明, 与现有方法相比, 该方法可将全局模型的分类准确率提升2.35~4.72百分点。

关键词: 联邦学习, 长尾分布, 异构数据, 自监督学习, 对比语言-图像预训练

Abstract:

Real-world data often follow a long-tail distribution. Federated learning methods that assume a balanced global data distribution struggle to classify tail-class data within long-tail data accurately. Researchers typically focus on retraining a balanced classifier for the global model, to mitigate the impact of long-tail data. However, this approach does not consider the feature extractor of the balanced model or how the model's feature extractor can be enabled to learn high-quality image features, leading to the poor performance of the global model. To enable the model to learn high-quality image features without bias during the feature learning stage, this study proposes a federated learning method combining rotational self-supervision and Contrastive Language-Image Pre-training (CLIP) guidance. This method uses rotational self-supervision to guide the training of local client models, thereby reducing the impact of long-tail data on the client models and enabling the model to learn high-quality image features. Simultaneously, CLIP is utilized to guide both the normal training of the model and the rotated images, transferring rich knowledge from CLIP to the client model and further enhancing the performance of the feature extractor. In experiments on the CIFAR-10 and CIFAR-100 datasets under different long-tail distributions, the proposed approach improves the global model's classification accuracy by 2.35 to 4.72 percentage points, respectively, compared with other federated learning methods.

Key words: federated learning, long-tailed distribution, heterogeneous data, self-supervised learning, Contrastive Language-Image Pre-training (CLIP)

刘海军, 付晓东. 结合旋转自监督和CLIP指导的长尾数据联邦学习[J]. 计算机工程, 2026, 52(5): 129-138.

LIU Haijun, FU Xiaodong. Federated Learning on Long-Tail Data Combining Rotational Self-Supervision and CLIP Guidance[J]. Computer Engineering, 2026, 52(5): 129-138.

https://www.ecice06.com/CN/Y2026/V52/I5/129

图/表 10

图1 结合旋转自监督和CLIP指导的长尾数据联邦学习框架

Fig.1 Framework for rotational self-supervised and CLIP-guided federated learning on long-tail data

图2 旋转自监督示意图

Fig.2 Schematic diagram of rotational self-supervision

图3 模型聚合示意图

Fig.3 Schematic diagram of model aggregation

图4 TSNE可视化结果

Fig.4 TSNE visualization results

图5 不同客户端模型特征提取器每一层的CKA相似性可视化

Fig.5 Visualization of CKA similarity for each layer of different client model feature extractors

图6 联邦学习和旋转自监督联邦学习准确率比较

Fig.6 Comparison of accuracy between federated learning and rotational self-supervised federated learning

参考文献 29

1	熊世强, 何道敬, 王振东, 等. 联邦学习及其安全与隐私保护研究综述. 计算机工程, 2024, 50 (5): 1- 15. doi: 10.19678/j.issn.1000-3428.0067782
	XIONG S Q, HE D J, WANG Z D, et al. Review of federated learning and its security and privacy protection. Computer Engineering, 2024, 50 (5): 1- 15. doi: 10.19678/j.issn.1000-3428.0067782
2	欧阳昱中, 韩锐, 刘驰. 边缘侧领域自适应中长尾视觉识别技术研究. 计算机工程, 2025, 51 (7): 171- 179. doi: 10.19678/j.issn.1000-3428.0069287
	OUYANG Y Z, HAN R, LIU C. Research on long-tail visual recognition technology with edge-side domain adaptation. Computer Engineering, 2025, 51 (7): 171- 179. doi: 10.19678/j.issn.1000-3428.0069287
3	SHUAI X, SHEN Y L, JIANG S Y, et al. BalanceFL: addressing class imbalance in long-tail federated learning[C]//Proceedings of the 21st ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN). Washington D.C., USA: IEEE Press, 2022: 271-284.
4	SHANG X Y, LU Y, HUANG G, et al. Federated learning on heterogeneous and long-tailed data via classifier re-training with federated features[EB/OL]. [2024-06-05]. https://arxiv.org/abs/2204.13399.
5	SHI J M , ZHENG S S , YIN X B , et al. CLIP-guided federated learning on heterogeneity and long-tailed data. Proceedings of the AAAI Conference on Artificial Intelligence, 2024, 38 (13): 14955- 14963. doi: 10.1609/aaai.v38i13.29416
6	HUANG W K , LIU Y X , YE M , et al. Federated learning with long-tailed data via representation unification and classifier rectification. IEEE Transactions on Information Forensics and Security, 2024, 19, 5738- 5750. doi: 10.1109/TIFS.2024.3402361
7	KARIMIREDDY S P, KALE S, MOHRI M, et al. SCAFFOLD: stochastic controlled averaging for federated learning[EB/OL]. [2024-06-05]. https://arxiv.org/abs/1910.06378.
8	WANG J Y, LIU Q H, LIANG H, et al. Tackling the objective inconsistency problem in heterogeneous federated optimization[EB/OL]. [2024-06-05]. https://arxiv.org/abs/2007.07481.
9	汤凌韬, 王迪, 刘盛云. 面向非独立同分布数据的联邦学习数据增强方案. 通信学报, 2023, 44 (1): 164- 176.
	TANG L T, WANG D, LIU S Y. Data augmentation scheme for federated learning with non-ⅡD data. Journal on Communications, 2023, 44 (1): 164- 176.
10	WANG L X , XU S C , WANG X , et al. Addressing class imbalance in federated learning. Proceedings of the AAAI Conference on Artificial Intelligence, 2021, 35 (11): 10165- 10173.
11	李志鹏, 国雍, 陈耀佛, 等. 基于数据生成的类别均衡联邦学习. 计算机学报, 2023, 46 (3): 609- 625.
	LI Z P, GUO Y, CHEN Y F, et al. Class-balanced federated learning based on data generation. Chinese Journal of Computers, 2023, 46 (3): 609- 625.
12	ZHONG Z S, CUI J Q, LIU S, et al. Improving calibration for long-tailed recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Washington D.C., USA: IEEE Press, 2021: 16484-16493.
13	LI T H, WANG L M, WU G S. Self supervision to distillation for long-tailed visual recognition[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). Washington D.C., USA: IEEE Press, 2022: 610-619.
14	CHEN X H, ZHOU Y C, WU D Y, et al. AREA: adaptive reweighting via effective area for long-tailed classification[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). Washington D.C., USA: IEEE Press, 2024: 19220-19230.
15	GIDARIS S, SINGH P, KOMODAKIS N. Unsupervised representation learning by predicting image rotations[EB/OL]. [2024-06-05]. https://arxiv.org/abs/1803.07728.
16	RADFORD A, KIM J W, HALLACY C, et al. Learning transferable visual models from natural language supervision[EB/OL]. [2024-06-05]. https://arxiv.org/abs/2103.00020.
17	TAN J R, WANG C B, LI B Y, et al. Equalization loss for long-tailed object recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Washington D.C., USA: IEEE Press, 2020: 11659-11668.
18	黄震华, 杨顺志, 林威, 等. 知识蒸馏研究综述. 计算机学报, 2022, 45 (3): 624- 653.
	HUANG Z H, YANG S Z, LIN W, et al. Knowledge distillation: a survey. Chinese Journal of Computers, 2022, 45 (3): 624- 653.
19	JOYCE J M. Kullback-Leibler divergence[EB/OL]. [2024-06-05]. https://statlect.com/fundamentals-of-probability/Kullback-Leibler-divergence.
20	MCMAHAN H B, MOORE E, RAMAGE D, et al. Communication-efficient learning of deep networks from decentralized data[EB/OL]. [2024-06-05]. https://arxiv.org/abs/1602.05629.
21	KRIZHEVSKY A. Learning multiple layers of features from tiny images[EB/OL]. [2024-06-05]. http://www.cs.toronto.edu/~kriz/learning-features-2009-TR.pdf.
22	SAHU A K, LI T, SANJABI M, et al. Federated optimization in heterogeneous networks[EB/OL]. [2024-06-05]. https://arxiv.org/abs/1812.06127.
23	LUO M, CHEN F, HU D P, et al. No fear of heterogeneity: classifier calibration for federated learning with non-ⅡD data[EB/OL]. [2024-06-05]. https://arxiv.org/abs/2106.05001.
24	SARKAR D, NARANG A, RAI S. Fed-Focal loss for imbalanced data classification in federated learning[EB/OL]. [2024-06-05]. https://arxiv.org/abs/2011.06283.
25	HSU T H, QI H, BROWN M. Measuring the effects of non-identical data distribution for federated visual classification[EB/OL]. [2024-06-05]. https://arxiv.org/abs/1909.06335.
26	LIN T, KONG L J, STICH S U, et al. Ensemble distillation for robust model fusion in federated learning[EB/OL]. [2024-06-05]. https://arxiv.org/abs/2006.07242.
27	CHEN H Y, CHAO W L. FedBE: making Bayesian model ensemble applicable to federated learning[EB/OL]. [2024-06-05]. https://arxiv.org/abs/2009.01974.
28	KANG B Y, XIE S N, ROHRBACH M, et al. Decoupling representation and classifier for long-tailed recognition[EB/OL]. [2024-06-05]. https://arxiv.org/abs/1910.09217.
29	KORNBLITH S, NOROUZI M, LEE H, et al. Similarity of neural network representations revisited[EB/OL]. [2024-06-05]. https://arxiv.org/abs/1905.00414.

[1]	王田, 李果, 梅雅欣, 钟文韬. 传感云与边缘计算综述(特邀)[J]. 计算机工程, 2026, 52(5): 3-42.
[2]	牛淑芬, 王宁, 周旭升, 孔维滢, 陈丽华. 智慧医疗中基于秘密共享和同态加密的安全联邦学习方案[J]. 计算机工程, 2026, 52(4): 302-312.
[3]	尹恒杰, 郑克清, 柯建楠, 董云泉. 基于本地动量加速的非独立同分布联邦学习方法[J]. 计算机工程, 2026, 52(4): 103-110.
[4]	陈先意, 糜慧, 何俊杰, 付章杰. 基于结构嵌入的可溯源联邦学习版权保护方法[J]. 计算机工程, 2026, 52(2): 253-264.
[5]	曹天涯, 张雨静, 贾俊杰, 张宇帆, 邓晓飞. 基于个性化梯度裁剪的联邦学习隐私保护算法[J]. 计算机工程, 2026, 52(2): 265-274.
[6]	周玮, 闵卫东. 面向交通场景的强鲁棒性场景图生成网络[J]. 计算机工程, 2025, 51(9): 231-241.
[7]	王帅, 史艳翠. 基于个性化数据增强的自监督序列推荐算法[J]. 计算机工程, 2025, 51(8): 190-202.
[8]	雷一凡, 陈晓红. 隐私保护的去中心联邦多视图聚类[J]. 计算机工程, 2025, 51(7): 180-189.
[9]	姚玉鹏, 魏立斐, 张蕾. 一种隐私保护的抗投毒攻击联邦学习方案[J]. 计算机工程, 2025, 51(6): 223-235.
[10]	施永辉, 代琪, 陈丽芳, 韩阳. 基于自然最近邻的联邦聚合算法[J]. 计算机工程, 2025, 51(6): 236-244.
[11]	耿霞, 汪尧. 基于CLIP增强细粒度特征的换装行人重识别方法[J]. 计算机工程, 2025, 51(4): 293-302.
[12]	吴小红, 李佩, 顾永跟, 陶杰. 基于EMD最优匹配的分层联邦学习算法[J]. 计算机工程, 2025, 51(2): 170-178.
[13]	吴若岚, 陈玉玲, 豆慧, 张洋文, 龙钟. 抗攻击的联邦学习隐私保护算法[J]. 计算机工程, 2025, 51(2): 179-187.
[14]	王华华, 黄烨霞, 李玲, 王嘉程. 无蜂窝网络中的联邦学习用户调度与资源优化[J]. 计算机工程, 2025, 51(12): 255-267.
[15]	牛渲文, 杜晔, 杨明松, 李昂, 黎妹红. 基于层次联邦与动态权重的卫星网络异常检测方法[J]. 计算机工程, 2025, 51(12): 210-220.

选择文件类型/文献管理软件名称

选择包含的内容

结合旋转自监督和CLIP指导的长尾数据联邦学习

Federated Learning on Long-Tail Data Combining Rotational Self-Supervision and CLIP Guidance

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 10

参考文献 29

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

结合旋转自监督和CLIP指导的长尾数据联邦学习

Federated Learning on Long-Tail Data Combining Rotational Self-Supervision and CLIP Guidance

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 10

参考文献 29

相关文章 15

编辑推荐

Metrics

本文评价