作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2022, Vol. 48 ›› Issue (6): 33-41. doi: 10.19678/j.issn.1000-3428.0064095

• 区块链理论与技术 • 上一篇    下一篇

基于区块链的公平性联邦学习模型

陈乃月, 金一, 李浥东, 蔡露鑫, 魏圆梦   

  1. 北京交通大学 计算机与信息技术学院, 北京 100044
  • 收稿日期:2022-03-04 修回日期:2022-04-14 发布日期:2022-05-03
  • 作者简介:陈乃月(1989—),女,讲师、博士,主研方向为联邦学习、数据挖掘;金一、李浥东,教授、博士;蔡露鑫、魏圆梦,硕士研究生。
  • 基金资助:
    科技创新2030—“新一代人工智能”重大项目(2021ZD0113002)。

Federated Learning Model with Fairness Based on Blockchain

CHEN Naiyue, JIN Yi, LI Yidong, CAI Luxin, WEI Yuanmeng   

  1. School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China
  • Received:2022-03-04 Revised:2022-04-14 Published:2022-05-03

摘要: 为解决典型联邦学习框架在训练样本数据分布不均衡情况下产生的聚合模型对各个客户端模型不公平的问题,结合区块链的去中心化、不可篡改性以及智能合约的特点,提出基于本地数据特征的公平性联邦学习模型,以实现数据分布差异的客户模型可信安全共享。多个客户端通过区块链上传本地参数以及信用值,利用区块链的共识机制选择信用值最高的区块进行模型聚合,在模型聚合过程中按照节点信用依次进行融合,并根据区块链记录工作节点的本地模型参数作为证据,完成整体模型参数的聚合任务,在此基础上通过广播下传当前聚合模型参数,模型利用区块链的共识机制可降低参数在传输过程中所面临的安全风险。在开源数据集上的实验结果表明,该模型相较FedAvg模型训练精度提高40%,不仅能够优化非独立同分布下的模型训练精度,同时可以防止中间参数传输信息泄露,保证了多个客户端的利益与安全隐私,从而实现具有隐私保护的公平性模型。

关键词: 联邦学习, 区块链, 非独立同分布, 公平性, 隐私保护

Abstract: This paper focuses on the unfair problem that the aggregation model generates by the non-independent and homogeneous distribution of training data in a typical Federated Learning (FL) framework.Accordingly, the characteristics of decentralization, immutability, and smart contracts of blockchain are combined to propose a blockchain-based fairness FL model to achieve trusted and secure sharing of customer models with differences in data distribution.First, each client uploads local parameters and trust values through the blockchain.Subsequently, the consensus mechanism of the blockchain was utilized to select the block with the highest trust value for model aggregation.The chain records the local model parameters of the working nodes as evidence to complete the aggregation task of the overall model parameters.Ultimately, the proposed algorithm downloads the current model parameters by broadcasting with low-security risks in the parameter transmission process.Experiments under the training of the open-source dataset reveal that the training accuracy of this model is improved by 40% compared with the FedAvg model.Accordingly, besides improving the training accuracy with the non-independent data distribution of FL, the FL algorithm also prevents information leakage in intermediate parameter transmission.Thus, it is the fairness model of privacy protection which ensures the interests, security, and privacy of each client are upheld.

Key words: Federated Learning(FL), blockchain, non-independent identically distribution, fairness, privacy protection

中图分类号: