作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2021, Vol. 47 ›› Issue (2): 111-117. doi: 10.19678/j.issn.1000-3428.0056932

• 人工智能与模式识别 • 上一篇    下一篇

基于参数子空间和缩放因子的YOLO剪枝算法

杨民杰, 梁亚玲, 杜明辉   

  1. 华南理工大学 电子与信息学院, 广州 510641
  • 收稿日期:2019-12-17 修回日期:2020-01-19 出版日期:2021-02-15 发布日期:2020-02-17
  • 作者简介:杨民杰(1996-),男,硕士研究生,主研方向为目标检测及模型优化;梁亚玲(通信作者),副教授、博士;杜明辉,教授、博士。
  • 基金资助:
    国家自然科学基金(61701181);广东省自然科学基金(2017A030325430);广州市科技计划项目(201707010070)。

YOLO Pruning Algorithm Based on Parameter Subspace and Scaling Factor

YANG Minjie, LIANG Yaling, DU Minghui   

  1. School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510641, China
  • Received:2019-12-17 Revised:2020-01-19 Online:2021-02-15 Published:2020-02-17

摘要: 为保证YOLO网络在嵌入式设备上正常运行,需采用剪枝算法精简滤波器以减小网络存储空间和计算量,而现有剪枝算法耗时较长且剪枝精度较低。提出一种基于参数子空间和批量归一化(BN)层缩放因子的双准则剪枝算法。将卷积层滤波器通过k均值聚类得到不同参数子空间,在子空间内使滤波器按权重排序并去除权重较低的滤波器,同时采用BN层缩放因子剪枝算法避免剪枝精度下降。实验结果表明,采用该算法剪枝后的YOLOv3网络在精度不变的情况下,占用的内存减少5/6且计算时间缩短1/3,与PF、CP等剪枝算法相比,该算法在保持较高网络精度的情况下计算量更少。

关键词: 模型压缩, 剪枝, 目标检测, 均值聚类, 缩放因子

Abstract: In order to ensure the normal operation of YOLO network on embedded devices,it is necessary to use pruning algorithm to simplify the filter to reduce the network storage space and the amount of calculation. However,the existing pruning algorithms are time-consuming and have low pruning precision.This paper proposes a bi-criteria pruning algorithm based on parameter subspace and Batch Normalization(BN) layer scaling factor.The convolution layer filter is clustered into different parameter subspaces by k-means clustering.In the subspace,the filters are sorted by weight,and the filters with lower weight are removed.The BN layer scaling factor pruning algorithm is used to avoid the degradation of pruning precision.Experimental results show that the memory occupied by the network which is pruned by the proposed algorithm is reduced by 5/6 while the precision remains unchanged and the computing time is reduced by 1/3.Compared with PF,CP and other pruning algorithms,the proposed algorithm requires less computation while maintaining high network precision.

Key words: model compression, pruning, object detection, mean clustering, scaling factor

中图分类号: