Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2021, Vol. 47 ›› Issue (9): 59-68. doi: 10.19678/j.issn.1000-3428.0058784

• Artificial Intelligence and Pattern Recognition • Previous Articles     Next Articles

Sensitivity-based Integrated Pruning Algorithm for YOLO Network

ZHANG Jiangyong1,2, XU Zhiyong1, ZHANG Jianlin1, XU Tao3   

  1. 1. Institute of Optics and Electronics, Chinese Academy of Sciences, Chengdu 610209, China;
    2. School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100049, China;
    3. China State Shipbuilding Corporation Systems Engineering Research Institute, Beijing 100036, China
  • Received:2020-06-29 Revised:2020-08-17 Published:2020-09-01

基于敏感度的YOLO网络集成剪枝算法

张江永1,2, 徐智勇1, 张建林1, 许涛3   

  1. 1. 中国科学院光电技术研究所, 成都 610209;
    2. 中国科学院大学 电子电气与通信工程学院, 北京 100049;
    3. 中国船舶工业系统工程研究院, 北京 100036
  • 作者简介:张江永(1995-),男,硕士研究生,主研方向为机器学习、深度学习、模型压缩;徐智勇(通信作者)、张建林,研究员;许涛,高级工程师。
  • 基金资助:
    国家重点研发计划(G158207)。

Abstract: Deep Convolutional Neural Network(CNN) require considerable storage space and operation times, which hampers their application and deployment on platforms with limited resources.The pruning algorithms can alleviate this problem, but the ones based on a single method for parameter importance evaluation or feature reconstruction have poor generalization performance.To address the problem, an integrated pruning algorithm based on sensitivity is proposed.The algorithm employs sparse scaling factor of BN layer to reduce the density of the layers with many convolutional kernels in YOLO.Then three methods for parameter importance evaluation are used to sort the convolutional kernels by importance.The ratio of to-be-pruned parts of each layer is determined according to the sensitivity.Experimental results show that the proposed algorithm reduces the parameter number of YOLOv3 by 80.5% and YOLOv3-tiny by 92.6%.Compared with the pruning algorithm based on network lightweight method, the proposed algorithm can better improve the detection accuracy and the generalization performance of the pruned model.

Key words: Convolutional Neural Network(CNN), sensitivity, integrated pruning algorithm, YOLO network, importance evaluation

摘要: 深层卷积神经网络所需的计算量和存储空间严重制约了其在资源有限平台上的应用与部署。针对基于单一参数重要性评价或者特征重建的剪枝算法泛化能力较差的问题,提出基于敏感度的集成剪枝算法,利用BN层的缩放因子稀疏YOLO网络中卷积核个数较多的冗余层,结合3种参数重要性评价方法对卷积核做重要性排序,并根据敏感度确定每一层的剪枝比率。实验结果表明,该剪枝算法对于YOLOv3和YOLOv3-tiny网络分别缩减80.5%和92.6%的参数量,并且相比基于网络轻量化方法的剪枝算法提升了网络模型压缩后的检测精度和泛化能力。

关键词: 卷积神经网络, 敏感度, 集成剪枝算法, YOLO网络, 重要性评价

CLC Number: