Data Augmentation for Federated Image Classification Using Diffusion Models

doi:10.19678/j.issn.1000-3428.0253368

Abstract

Abstract: Federated learning is a distributed machine learning paradigm that leverages decentralized data resources while ensuring data privacy. However, in real-world scenarios, data across clients are often non-IID (Independent and Identically Distributed), leading to label shift and class imbalance issues, which hinder convergence of global models and degrade generalization performance. To address the impact of such data heterogeneity on model performance, we propose a cross-client data augmentation and classification framework based on diffusion models. In this framework, each client trains an initial diffusion model based on local data and uploads its model parameters to the server. The server aggregates these parameters to construct a global diffusion model, which is then downlinked to all clients. Clients use the global diffusion model to generate supplementary samples, which are uploaded to the server for data augmentation to balance the local class distribution, thereby improving classifier performance. Ultimately, the classification model is trained through federated learning by receiving both local data and generated samples, and is deployed to clients for image classification and recognition. To generate high-quality images, a denoising diffusion probabilistic model is used as the generation backbone, while a ResNet-18 architecture is employed for the federated classification model. Experimental results show that the fine-tuned global diffusion model can generate images that are more consistent with the real data distribution. By augmenting the data through generated samples, the local data distribution on clients becomes more balanced, significantly improving global classification accuracy. Under the non-IID condition with a Dirichlet coefficient α=0.1, the accuracy of CIFAR-10 and CIFAR-100 increased from 46.76% and 21.31% to 54.64% and 25.57%, respectively, demonstrating the effectiveness of the proposed data augmentation strategy in mitigating class imbalance.

摘要： 联邦学习是一种在保障数据隐私的前提下充分利用分散数据资源的分布式机器学习范式。然而，现实场景中各客户端数据通常呈现非独立同分布，存在标签偏移和类别不均衡问题，导致全局模型难以收敛且泛化能力下降。为缓解此类数据异构性对模型性能的影响，提出了一种基于扩散模型的跨客户端数据增强与分类框架。在该框架中，每个客户端基于本地数据训练初始扩散模型，并将模型参数上传至服务器。服务器聚合这些参数以构建全局扩散模型，并下发至各客户端。客户端利用全局扩散模型生成补充样本并上传至服务器，进行数据增强以平衡本地类别分布，从而提升分类器性能。最终，分类模型在接收本地数据和生成样本后通过联邦学习进行训练，并部署至客户端进行图像分类与识别。为了生成高质量图像，采用去噪扩散概率模型作为生成主干，同时使用ResNet-18构建联邦分类模型。实验表明，微调后的全局扩散模型能够生成与真实数据分布更加一致的图像，通过生成样本进行数据增广的策略使客户端本地数据分布更加均衡，进而显著提升全局分类准确率。在Dirichlet系数α=0.1的非独立同分布条件下，CIFAR-10与CIFAR-100的准确率分别从46.76%和21.31%提升至54.64%和25.57%，验证了所提出的数据增强策略在缓解数据不均衡问题中的有效性。

Jinglin Huang, Maoqiang Wu, Siming Wang, Yue Lai, Rong Yu. Data Augmentation for Federated Image Classification Using Diffusion Models[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0253368.

黄景林, 吴茂强, 王思明, 赖粤, 余荣. 基于扩散模型的联邦学习数据增强方法[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0253368.

/ Recommend / Download Citations

URL: https://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0253368

References

[1] KONEČNÝ J, MCMAHAN H B, YU F X, et al. Federated learning: strategies for improving communication efficiency[EB/OL]. Ithaca, NY: arXiv, 2016[2025-12-25]. https://doi.org/10.48550/arXiv.1610.05492.
[2] 熊世强, 何道敬, 王振东, 杜润萌. 联邦学习及其安全与隐私保护研究综述[J]. 计算机工程, 2024, 50(5): 1-15. XIONG Shiqiang, HE Daojing, WANG Zhendong, DU Runmeng. Review of Federated Learning and Its Security and Privacy Protection[J]. Computer Engineering, 2024, 50(5): 1-15.
[3] KAIR0UZ P, MCMAHAN H B, AVENT B, et al. Advances and open problems in federated learning[J]. Foundations and Trends in Machine Learning, 2021, 14(1/2): 1-210.
[4] HSU T-M H, QI H, BROWN M. Federated visual classification with real-world data distribution[C]//Proceedings of the European Conference on Computer Vision. Cham: Springer International Publishing, 2020: 76-92.
[5] YANG L, ZHANG Z, SONG Y, et al. Diffusion models: a comprehensive survey of methods and applications[J]. ACM Computing Surveys, 2023, 56(4): 105:1-105:39.
[6] GOODFELLOW I J, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial nets[C]//Advances in Neural Information Processing Systems. Red Hook, NY: Curran Associates, 2014: 2672-2680.
[7] 殷康宁. 非独立同分布数据场景下异构联邦学习技术研究[D]. 成都: 电子科技大学, 2025. YIN Kangning. Research on Heterogeneous Federated Learning Technology in Non-Independent and Identically Distributed Data Scenarios[D]. Chengdu: University of Electronic Science and Technology of China, 2025.
[8] 谭照军. 基于差分隐私的扩散模型研究[D]. 贵阳: 贵州大学, 2025. TAN Zhaojun. Research on Diffusion Models Based on Differential Privacy[D]. Guiyang: Guizhou University, 2025.
[9] Posner J, Tseng L, Aloqaily M, et al. Federated learning in vehicular networks: Opportunities and solutions[J]. IEEE Network, 2021, 35(2): 152-159.
[10] Lu Y, Huang X, Zhang K, et al. Blockchain empowered asynchronous federated learning for secure data sharing in internet of vehicles[J]. IEEE Transactions on Vehicular Technology, 2020, 69(4): 4298-4311.
[11] Chai D, Wang L, Chen K, et al. Secure federated matrix factorization[J]. IEEE Intelligent Systems, 2020, 36(5): 11-20.
[12] 曹天涯, 张宇帆, 贾俊杰. 一种有效提高数据可用性的联邦学习隐私保护算法[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0070294.
CAO Tianya, ZHANG Yufan, JIA Junjie. A Federal Learning Privacy Protection Algorithm to Effectively Improve Data Availability[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0070294. [13] 陈杰, 安之焕, 唐占元, 卢志超. 基于改进YOLOv4模型的无人机巡检图像杆塔缺陷检测方法研究[J]. 电测与仪表, 2023, 60(10): 155-160.
Chen Jie, An Zhihuan, Tang Zhanyuan, Lu Zhichao. Research on tower defect detection method of UAV patrol inspection image based on improved YOLOv4 model[J]. Electrical Measurement & Instrumentation, 2023, 60(10): 155-160.
[14] SONG Y, ERMON S. Generative modeling by estimating gradients of the data distribution[C]//Proceedings of the 33rd International Conference on Neural Information Processing Systems. Red Hook, NY, USA: Curran Associates, 2019: 11918-11930.
[15] HO J, JAIN A, ABBEEL P. Denoising diffusion probabilistic models[C]//Proceedings of the 34th International Conference on Neural Information Processing Systems. Red Hook, NY, USA: Curran Associates, 2020: 6840-6851.
[16] SONG Y, SOHL-DICKSTEIN J, KINGMA D P, KUMAR A, ERMON S, POOLE B. Score-based generative modeling through stochastic differential equations[EB/OL]. Ithaca, NY: arXiv, 2020[2025-12-25]. https://arxiv.org/abs/2011.13456.
[17] SONG J, MENG C, ERMON S. Denoising diffusion implicit models[EB/OL]. Ithaca, NY: arXiv, 2020[2025-12-25]. https://arxiv.org/abs/2010.02502.
[18] 陈子民, 关志涛. 基于条件扩散模型的图像分类对抗样本防御方法[J]. 计算机工程, 2024, 50(12): 296-305. CHEN Zimin, GUAN Zhitao. Image Classification Adversarial Example Defense Method Based on Conditional Diffusion Model[J]. Computer Engineering, 2024, 50(12): 296-305.
[19] 姬莉霞, 周洪鑫, 肖士杰, 陈允峰, 张晗. 一种基于邻域注意力的扩散模型训练方法研究[J]. 计算机工程, 2025, 51(8): 262-269. JI Lixia, ZHOU Hongxin, XIAO Shijie, CHEN Yunfeng, ZHANG Han. A Research on Training Method for Diffusion Model Based on Neighborhood Attention[J]. Computer Engineering, 2025, 51(8): 262-269.
[20] 何明研, 李斯源, 刘鹏, 黄剑华. 基于交叉注意力扩散模型的对手建模研究[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0070737. He Mingyan, Li Siyuan, Liu Peng, Huang Jianhua. Opponent Modeling Based on Cross-Attention Diffusion Model[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0070737.
[21] Ronneberger O, Fischer P, Brox T. U-Net: Convolutional networks for biomedical image segmentation[C]//MICCAI. Cham: Springer, 2015: 234-241.
[22] VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS'17). Long Beach, CA, USA: Curran Associates, Inc., 2017: 6000-6010.
[23] HE K, ZHANG X, REN S, SUN J. Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, USA: IEEE Press, 2016: 770-778.
[24] LE Y, YANG X. Tiny imagenet visual recognition challenge[EB/OL]. Stanford, CA: Stanford University, 2015[2025-12-25]. https://www.kaggle.com/c/tiny-imagenet
[25] KRIZHEVSKY A, HINTON G. Learning multiple layers of features from tiny images[R/OL]. Toronto, Ontario: University of Toronto, 2009[2025-12-25]. https://www.cs.toronto.edu/~kriz/learning-features-2009-TR.pdf.
[26] HEUSEL M, RAMSAUER H, UNTERTHINER T, et al. GANs trained by a two time-scale update rule converge to a local Nash equilibrium[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. Red Hook, NY, USA: Curran Associates Inc., 2017: 6629-6640.
[27] WANG Z, BOVIK A C, SHEIKH H R, et al. Image quality assessment: from error visibility to structural similarity[J]. IEEE Transactions on Image Processing, 2004, 13(4): 600-612.
[28] LOSHCHILOV I, HUTTER F. Decoupled weight decay regularization[EB/OL]. Ithaca, NY: arXiv, 2017[2025-12-25]. https://arxiv.org/abs/1711.05101.
[29] MAJEED I A, KAUSHIK S, BARDHAN A, et al. Comparative assessment of federated and centralized machine learning[EB/OL]. Ithaca, NY: arXiv, 2022[2025-12-25]. https://doi.org/10.48550/arXiv.2202.01529.
[30] Zhu H, Xu J, Liu S, et al. Federated learning on non-IID data: A survey[J]. Neurocomputing, 2021, 465: 371-390.
[31] Seol M, Kim T. Performance enhancement in federated learning by reducing class imbalance of non-iid data[J]. Sensors, 2023, 23(3): 1152.
[32] Mao W, Yu B, Zhang C, Qin A K, Xie Y. FedKT: Federated learning with knowledge transfer for non-IID data[J]. Pattern Recognition, 2025, 159: 111143.

Please choose a citation manager

Content to export