基于Transformer和GAN的对抗样本生成算法

doi:10.19678/j.issn.1000-3428.0067077

摘要/Abstract

摘要：

对抗攻击与防御是计算机安全领域的一个热门研究方向。针对现有基于梯度的对抗样本生成方法可视质量差、基于优化的方法生成效率低的问题，提出基于Transformer和生成对抗网络（GAN）的对抗样本生成算法Trans-GAN。首先利用Transformer强大的视觉表征能力，将其作为重构网络，用于接收干净图像并生成攻击噪声；其次将Transformer重构网络作为生成器，与基于深度卷积网络的鉴别器相结合组成GAN网络架构，提高生成图像的真实性并保证训练的稳定性，同时提出改进的注意力机制Targeted Self-Attention，在训练网络时引入目标标签作为先验知识，指导网络模型学习生成具有特定攻击目标的对抗扰动；最后利用跳转连接将对抗噪声施加在干净样本上，形成对抗样本，攻击目标分类网络。实验结果表明：Trans-GAN算法针对MNIST数据集中2种模型的攻击成功率都达到99.9%以上，针对CIFAR10数据集中2种模型的攻击成功率分别达到96.36%和98.47%，优于目前先进的基于生成式的对抗样本生成方法；相比快速梯度符号法和投影梯度下降法，Trans-GAN算法生成的对抗噪声扰动量更小，形成的对抗样本更加自然，满足人类视觉不易分辨的要求。

关键词: 深度神经网络, 对抗样本, 对抗攻击, Transformer模型, 生成对抗网络, 注意力机制

Abstract:

Adversarial attack and defense is a popular research area in computer security. Trans-GAN, an adversarial example generation algorithm based on the combination of Transformer and Generate Adversarial Network(GAN), is proposed to address the problems of the poor visual quality of existing gradient-based adversarial example generation methods and the low generation efficiency of optimization-based methods. First, the algorithm utilizes the powerful visual representation capability of the Transformer as a reconstruction network for receiving clean images and generating adversarial noise. Second, the Transformer reconstruction network is combined with a deep convolutional network-based discriminator as a generator to form a GAN architecture, which improves the authenticity of the generated images and ensures the stability of training. Meanwhile, the improved attention mechanism, Targeted Self-Attention, is proposed to introduce target labels as a priori knowledge when training the network, which guides the network model to learn to generate adversarial perturbations with specific attack targets. Finally, adversarial noise is added to the clean examples using skip-connections to form adversarial examples. Experimental results demonstrate that the proposed algorithm achieves an attack success rate of more than 99.9% on both models used for the MNIST dataset and 96.36% and 98.47% on the two models used for the CIFAR10 dataset, outperforming the current state-of-the-art generative-based adversarial attack methods. The qualitative results show that compared to the Fast Gradient Sign Method(FGSM)and Projected Gradient Descent(PGD)algorithms, the generated adversarial noise of the Trans-GAN algorithm is less perturbed, and the formed adversarial examples are more natural and meet the requirements of human vision, which is not easily distinguished.

Key words: deep neural network, adversarial example, adversarial attack, Transformer model, Generate Adversarial Network(GAN), attention mechanism

刘帅威, 李智, 王国美, 张丽. 基于Transformer和GAN的对抗样本生成算法[J]. 计算机工程, 2024, 50(2): 180-187.

Shuaiwei LIU, Zhi LI, Guomei WANG, Li ZHANG. Adversarial Example Generation Algorithm Based on Transformer and GAN[J]. Computer Engineering, 2024, 50(2): 180-187.

https://www.ecice06.com/CN/Y2024/V50/I2/180

图/表 11

图1 计算机视觉中的自注意力机制

Fig.1 Self-attention mechanism in computer vision

图2 Trans-GAN网络结构

Fig.2 Trans-GAN network structure

图3 生成器模型架构

Fig.3 Generator model architecture

图4 CIFAR10数据集的对抗样本示例

Fig.4 Examples of adversarial samples of CIFAR10 dataset

图5 ImageNet数据集的对抗样本示例

Fig.5 Example of adversarial samples of ImageNet dataset

参考文献 26

1	姜妍, 张立国. 面向深度学习模型的对抗攻击与防御方法综述. 计算机工程, 2021, 47(1): 1- 11. doi: 10.3969/j.issn.1007-130X.2021.01.001
	JIANG Y, ZHANG L G. Survey of adversarial attacks and defense methods for deep learning model. Computer Engineering, 2021, 47(1): 1- 11. doi: 10.3969/j.issn.1007-130X.2021.01.001
2	白祉旭, 王衡军. 基于改进遗传算法的对抗样本生成方法. 计算机工程, 2023, 49(5): 139- 149. URL
	BAI Z X, WANG H J. Adversarial example generation method based on improved genetic algorithm. Computer Engineering, 2023, 49(5): 139- 149. URL
3	SZEGEDY C, ZAREMBA W, SUTSKEVER I, et al. Intriguing properties of neural networks[EB/OL]. [2023-02-05]. https://arxiv.org/abs/1312.6199.
4	DONG Y P, SU H, WU B Y, et al. Efficient decision-based black-box adversarial attacks on face recognition[EB/OL]. [2023-02-05]. https://arxiv.org/abs/1904.04433.
5	CARRARA F, FALCHI F, AMATO G, et al. Detecting adversarial inputs by looking in the black box[EB/OL]. [2023-02-05]. https://www.researchgate.net/publication/336809813_Detecting_Adversarial_Inputs_by_Looking_in_the_black_box.
6	MA X J, NUI Y H, GU L, et al. Understanding adversarial attacks on deep learning based medical image analysis systems. Pattern Recognition, 2021, 110, 107332. doi: 10.1016/j.patcog.2020.107332
7	GOODFELLOW I J, SHLENS J, SZEGEDY C. Explaining and harnessing adversarial examples[EB/OL]. [2023-02-05]. https://arxiv.org/abs/1412.6572.
8	MADRY A, MAKELOV A, SCHMIDT L, et al. Towards deep learning models resistant to adversarial attacks[EB/OL]. [2023-02-05]. https://arxiv.org/abs/1706.06083.pdf.
9	CARLINI N, WAGNER D. Towards evaluating the robustness of neural networks[C]//Proceedings of 2017 IEEE Symposium on Security and Privacy. Washington D. C., USA: IEEE Press, 2017: 39-57.
10	XIAO C, LI B, ZHU J, et al. Generating adversarial examples with adversarial networks[EB/OL]. [2023-02-05]. https://arxiv.org/abs/1801.02610.pdf.
11	JANDIAL S, MANGLA P, VARSHNEY S, et al. AdvGAN++: harnessing latent layers for adversary generation[C]//Proceedings of 2019 IEEE/CVF International Conference on Computer Vision Workshop. Washington D. C., USA: IEEE Press, 2019: 2045-2048.
12	BAI T, ZHAO J, ZHU J L, et al. AI-GAN: attack-inspired generation of adversarial examples[C]//Proceedings of 2021 IEEE International Conference on Image Processing. Washington D. C., USA: IEEE Press, 2021: 2543-2547.
13	VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. New York, USA: ACM Press, 2017: 6000-6010.
14	GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial networks. Communications of the ACM, 2020, 63(11): 139- 144. doi: 10.1145/3422622
15	KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[C]//Proceedings of International Conference on Neural Information Processing Systems. New York, USA: ACM Press, 2012: 1097-1105.
16	DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An image is worth 16×16 words: transformers for image recognition at scale[EB/OL]. [2023-02-05]. https://arxiv.org/abs/2010.11929.pdf.
17	ZAMIR S W, ARORA A, KHAN S, et al. Restormer: efficient Transformer for high-resolution image restoration[C]//Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2022: 5718-5729.
18	WANG Z, BOVIK A C, SHEIKH H R, et al. Image quality assessment: from error visibility to structural similarity. IEEE Transactions on Image Processing, 2004, 13(4): 600- 612. doi: 10.1109/TIP.2003.819861
19	JOHNSON J, ALAHI A, LI F F. Perceptual losses for real-time style transfer and super-resolution[EB/OL]. [2023-02-05]. https://arxiv.org/abs/1603.08155.
20	SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[EB/OL]. [2023-02-05]. https://arxiv.org/abs/1409.1556.pdf.
21	LECUN Y, BOTTOU L, BENGIO Y, et al. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 1998, 86(11): 2278- 2324. doi: 10.1109/5.726791
22	KRIZHEVSKY A. Learning multiple layers of features from tiny images[EB/OL]. [2023-02-05]. https://www.semanticscholar.org/paper/Learning-Multiple-Layers-of-Features-from-Tiny-Krizhevsky/5d90f06bb70a0a3dced62413346235c02b1aa086.
23	DENG J, DONG W, SOCHER R, et al. ImageNet: a large-scale hierarchical image database[C]//Proceedings of 2009 IEEE Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2009: 248-255.
24	TRAMÈR F, KURAKIN A, PAPERNOT N, et al. Ensemble adversarial training: attacks and defenses[EB/OL]. [2023-02-05]. https://arxiv.org/abs/1705.07204.pdf.
25	HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2016: 770-778.
26	ZAGORUYKO S, KOMODAKIS N. Wide residual networks[EB/OL]. [2023-02-05]. https://arxiv.org/abs/1605.07146.pdf.

[1]	李俊俊, 董建刚, 李坤. 基于Kubernetes的集群节能策略研究[J]. 计算机工程, 2024, 50(9): 82-91.
[2]	林畅, 郭伟, 任哲聪, 金海波. 基于Transformer的目标跟踪与分割统一算法[J]. 计算机工程, 2024, 50(9): 130-141.
[3]	李泽霖, 吕兆峰, 陈富强, 李克. 基于多跳信息融合的实体对齐模型[J]. 计算机工程, 2024, 50(9): 142-152.
[4]	王汝英, 马嘉骏, 董建强, 刘万龙, 张海涛, 尹凯, 赵博超. 基于MTS-BiGRU-DMHSA的工业负荷预测方法[J]. 计算机工程, 2024, 50(9): 169-178.
[5]	朱凯, 李理, 张彤, 江晟, 别一鸣. 基于Transformer的多阶段运动模糊图像修复网络[J]. 计算机工程, 2024, 50(9): 276-285.
[6]	张天鹏, 韩晶, 吕学强. 基于多任务学习的超分辨率辅助小目标检测[J]. 计算机工程, 2024, 50(9): 304-312.
[7]	郭敏, 张熙涵, 李阳. 融合注意力的教师互一致性半监督医学图像分割[J]. 计算机工程, 2024, 50(9): 313-323.
[8]	曾钰琦, 刘博, 钟柏昌, 钟瑾. 智慧教育下基于改进YOLOv8的学生课堂行为检测算法[J]. 计算机工程, 2024, 50(9): 344-355.
[9]	王言国, 吕鹏远, 兰金江, 刘明哲, 秦冠军, 张硕桦, 周宇. 基于对抗训练与Transformer的风力发电机故障分类方法[J]. 计算机工程, 2024, 50(9): 377-384.
[10]	饶日昕, 王怡文, 曾砺志, 童心恬, 赵海涛. 面向废旧电缆检测的轻量化网络模型[J]. 计算机工程, 2024, 50(8): 22-30.
[11]	李华昱, 张智康, 闫阳, 岳阳. 基于知识图谱增强的领域多模态实体识别[J]. 计算机工程, 2024, 50(8): 31-39.
[12]	王蕾, 党时鹏, 潘丰. 基于卷积神经网络的隐匿性旁路预测模型[J]. 计算机工程, 2024, 50(8): 40-49.
[13]	陈瀚, 赵春蕾, 蒋昊达, 王春东. 基于融合模型与语义网络的App用户意图识别研究[J]. 计算机工程, 2024, 50(8): 50-63.
[14]	张亚洲, 和玉, 戎璐, 王祥凯. 基于上下文知识增强型Transformer网络的抑郁检测[J]. 计算机工程, 2024, 50(8): 75-85.
[15]	白雪冰, 车进, 吴金蔓, 陈玉敏. 基于Transformer视觉特征融合的图像描述方法[J]. 计算机工程, 2024, 50(8): 229-238.

选择文件类型/文献管理软件名称

选择包含的内容