融合弱层惩罚的卷积神经网络模型剪枝方法

doi:10.19678/j.issn.1000-3428.0061461

计算机工程 ›› 2022, Vol. 48 ›› Issue (5): 67-73. doi: 10.19678/j.issn.1000-3428.0061461

融合弱层惩罚的卷积神经网络模型剪枝方法

房志远, 石守东, 郑佳罄, 胡加钿

宁波大学信息科学与工程学院, 浙江宁波 315211

收稿日期:2021-04-26 修回日期:2021-06-19 发布日期:2021-05-25
作者简介:房志远(1995—),男,硕士研究生,主研方向为模型压缩与加速;石守东,副教授、博士;郑佳罄、胡加钿,硕士研究生。
基金资助:
宁波市公益项目“基于深度学习的儿童学习姿态识别系统研究与实现”（2019C50020）。

Pruning Method of Convolutional Neural Network Model with Weak Layer Penalty

FANG Zhiyuan, SHI Shoudong, ZHENG Jiaqing, HU Jiadian

Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, Zhejiang 315211, China

Received:2021-04-26 Revised:2021-06-19 Published:2021-05-25

摘要/Abstract

摘要： 深度卷积神经网络的存储和计算需求巨大，难以在一些资源受限的嵌入式设备上进行部署。为尽可能减少深度卷积神经网络模型在推理过程中的资源消耗，引入基于几何中值的卷积核重要性判断标准，提出一种融合弱层惩罚的结构化非均匀卷积神经网络模型剪枝方法。使用欧式距离计算各层卷积核间的信息距离，利用各卷积层信息距离的数据分布特征识别弱层，通过基于贡献度的归一化函数进行弱层惩罚，消除各层间的差异性。在全局层面评估卷积核重要性，利用全局掩码技术对所有卷积核实现动态剪枝。在CIFAR-10、CIFAR-100和SVHN数据集上的实验结果表明，与SFP、PFEC、FPGM和MIL剪枝方法相比，该方法剪枝得到的VGG16单分支、Resnet多分支、Mobilenet-v1轻量化网络模型在保证精度损失较小的情况下，有效地减少了模型参数量和浮点操作数。

关键词: 模型剪枝, 弱层惩罚, 全局掩码, 欧式距离, 核重要性评估

Abstract: The extensive demand of convolutional neural networks for memory and computation makes it difficult to deploy them in resource-constrained embedded devices.To minimize the resource consumption of the deep convolutional neural network model during the inference process, this study introduces a criterion for judging the importance of the convolution kernel, based on the geometric median and further proposes a structured, non-uniform pruning method for convolutional neural network models with weak layer penalty.First, using the Euclidean distance, the algorithm calculates the information distance for each layer of the convolution kernel.Then, the data distribution characteristics of the information distance of each convolutional layer are used to identify the weak layers, and a normalization function based on the contribution degree is proposed to eliminate the difference between layers while weakening the redundant layers.Second, the importance of the convolution kernel is evaluated at the global level, and the global mask technique is used to achieve dynamic pruning.The experimental results on the CIFAR-10, CIFAR-100, and SVHN datasets demonstrate that compared with SFP, PFEC, FPGM, and MIL pruning methods, the proposed method prunes the VGG16 single-branch, Resnet multi-branch, and Mobilenet-v1 lightweight network models, effectively reducing the number of model parameters and Floating Points of Operations(FLOPs) while ensuring that the loss of precision is small.

Key words: model pruning, weak layer penalty, global mask, Euclidean distance, kernel importance evaluation

中图分类号:

TP391

房志远, 石守东, 郑佳罄, 胡加钿. 融合弱层惩罚的卷积神经网络模型剪枝方法[J]. 计算机工程, 2022, 48(5): 67-73.

FANG Zhiyuan, SHI Shoudong, ZHENG Jiaqing, HU Jiadian. Pruning Method of Convolutional Neural Network Model with Weak Layer Penalty[J]. Computer Engineering, 2022, 48(5): 67-73.

https://www.ecice06.com/CN/Y2022/V48/I5/67

图/表 9

20220723173147

20220723173151

20220723173155

20220723173158

20220723173202

20220723173206

20220723173210

20220723173214

20220723173217

参考文献

[1] BULAT A, KOSSAIFI J, TZIMIROPOULOS G, et al.Toward fast and accurate human pose estimation via soft-gated skip connections[C]//Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition.Washington D.C., USA:IEEE Press, 2020:8-15.
[2] HAN W, ZHANG Z D, ZHANG Y, et al.ContextNet:improving convolutional neural networks for automatic speech recognition with global context[EB/OL].[2021-03-05].https://arxiv.org/abs/2005.03191v3.
[3] TORFI A, SHIRVANI R A, KENESHLOO Y, et al.Natural language processing advancements by deep learning:a survey[EB/OL].[2021-03-05].https://arxiv.org/abs/2003.01200.
[4] KINGSBURY B E D, SAINATH T N, SINDHWANI V.Low-rank matrix factorization for deep belief network training with high-dimensional output targets:US9262724[P].2016-02-16.
[5] GONG R H, LIU X L, JIANG S H, et al.Differentiable soft quantization:bridging full-precision and low-bit neural networks[EB/OL].[2021-03-05].https://arxiv.org/abs/1908.05033.
[6] WANG W H, WEI F R, DONG L, et al.MiniLM:deep self-attention distillation for task-agnostic compression of pre-trained transformers[EB/OL].[2021-03-05].https://arxiv.org/abs/2002.10957.
[7] MA N N, ZHANG X Y, ZHENG H T, et al.ShuffleNet V2:practical guidelines for efficient CNN architecture design[M].Berlin, Germany:Springer, 2018:122-138.
[8] YOU Z H, YAN K, YE J M, et al.Gate decorator:global filter pruning method for accelerating deep convolutional neural networks[EB/OL].[2021-03-05].https://arxiv.org/abs/1909.08174.
[9] HE Y, DING Y H, LIU P, et al.Learning filter pruning criteria for deep convolutional neural networks acceleration[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2020:2006-2015.
[10] GUO Y W, YAO A B, CHEN Y R.Dynamic network surgery for efficient DNNs[C]//Proceedings of the 30th International Conference on Neural Information Processing Systems.New York, USA:ACM Press, 2006:1387-1395.
[11] LUO J H, WU J X, LIN W Y.ThiNet:a filter level pruning method for deep neural network compression[C]//Proceedings of IEEE International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2017:5068-5076.
[12] HE Y, LIU P, WANG Z W, et al.Filter pruning via geometric Median for deep convolutional neural networks acceleration[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2019:4335-4344.
[13] HE Y, DONG X Y, KANG G L, et al.Asymptotic soft filter pruning for deep convolutional neural networks[EB/OL].[2021-03-05].https://arxiv.org/abs/1808.07471.
[14] 卢海伟, 夏海峰, 袁晓彤.基于滤波器注意力机制与特征缩放系数的动态网络剪枝[J].小型微型计算机系统, 2019, 40(9):1832-1838. LU H W, XIA H F, YUAN X T.Dynamic network pruning via filter attention mechanism and feature scaling factor[J].Journal of Chinese Computer Systems, 2019, 40(9):1832-1838.(in Chinese)
[15] 甘岚, 李佳, 沈鸿飞.面向嵌入式的残差网络加速方法研究[J].小型微型计算机系统, 2020, 41(11):2314-2320. GAN L, LI J, SHEN H F.Research on the acceleration method of residual network for embedded system[J].Journal of Chinese Computer Systems, 2020, 41(11):2314-2320.(in Chinese)
[16] LIU Z, LI J G, SHEN Z Q, et al.Learning efficient convolutional networks through network slimming[C]//Proceedings of IEEE International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2017:2755-2763.
[17] SCHENCK C, FOX D.SPNets:differentiable fluid dynamics for deep neural networks[EB/OL].[2021-03-05].https://arxiv.org/abs/1806.06094.
[18] HE Y H, LIN J, LIU Z J, et al.AMC:AutoML for model compression and acceleration on mobile devices[C]//Proceedings of the 15th European Conference on Computer Vision.Berlin, Germany:Springer, 2018:851-832.
[19] YANG T J, HOWARD A, CHEN B, et al.NetAdapt:platform-aware neural network adaptation for mobile applications[C]//Proceedings of European Conference on Computer Vision.Berlin, Germany:Springer, 2018:289-304.
[20] LI B L, WU B W, SU J, et al.EagleEye:fast sub-net evaluation for efficient neural network pruning[C]//Proceedings of European Conference on Computer Vision.Berlin, Germany:Springer, 2020:639-654.
[21] LIN S H, JI R R, LI Y C, et al.Accelerating convolutional networks via global & dynamic filter pruning[C]//Proceedings of the 27th International Joint Conference on Artificial Intelligence.New York, USA:ACM Press, 2018:2425-2432.
[22] PASZKE A, GROSS S, CHINTALA S, et al.Automatic differentiation in PyTorch[EB/OL].[2021-03-05].https://openreview.net/pdf?id=BJJsrmfCZ.
[23] DONG X Y, HUANG J S, YANG Y, et al.More is less:a more complicated network with less inference complexity[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2017:1895-1903.
[24] LI H, KADAV A, DURDANOVIC I, et al.Pruning filters for efficient ConvNets[EB/OL].[2021-03-05].https://arxiv.org/abs/1608.08710.

选择文件类型/文献管理软件名称

选择包含的内容

融合弱层惩罚的卷积神经网络模型剪枝方法

Pruning Method of Convolutional Neural Network Model with Weak Layer Penalty

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 9

参考文献

相关文章 8

编辑推荐

Metrics

本文评价

[1]	程小辉, 李钰, 康燕萍. 基于中间图特征提取的卷积网络双标准剪枝[J]. 计算机工程, 2023, 49(3): 105-112.
[2]	韦越, 陈世超, 朱凤华, 熊刚. 基于稀疏正则化的卷积神经网络模型剪枝方法[J]. 计算机工程, 2021, 47(10): 61-66.
[3]	刘崇阳, 刘勤让. 基于LZW编码的卷积神经网络压缩方法[J]. 计算机工程, 2019, 45(9): 188-193.
[4]	刘崇阳, 刘勤让. 一种神经网络模型剪枝后泛化能力的验证方法[J]. 计算机工程, 2019, 45(10): 234-238.
[5]	王建新,王柏人,曲鸣,张磊. 基于改进欧式距离的硬件木马检测[J]. 计算机工程, 2017, 43(6): 92-96.
[6]	蔡维玲, 陈东霞. 数据规范化方法对K近邻分类器的影响[J]. 计算机工程, 2010, 36(22): 175-177.
[7]	张建明, 杨锋清, 房芳, 段丽. 基于FKPCA与双决策子空间的人脸识别[J]. 计算机工程, 2010, 36(18): 182-184.
[8]	滕在霞, 刘悦, 高峻峻. 基于加权欧式距离的替代率估算方法[J]. 计算机工程, 2010, 36(15): 283-285.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

融合弱层惩罚的卷积神经网络模型剪枝方法

Pruning Method of Convolutional Neural Network Model with Weak Layer Penalty

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 9

参考文献

相关文章 8

编辑推荐

Metrics

本文评价