Research and Application of Lightweight Object Detection Algorithm

doi:10.19678/j.issn.1000-3428.0059168

Abstract

Abstract: The existing target detection algorithms based on convolutional neural networks have achieved a high accuracy, but the accuracy gain comes at the cost of detection speed, making it difficult for the algorithms to implement real-time detection with limited computing power.To solve this problem, a series of lightweight methods are adopted based on the YOLO target detection algorithm.The methods employ Mobilenetv1 to replace the basic network of Darknet53, and depthwise separable convolutions to replace the 3×3 standard convolutions in the YOLO head part.On this basis, the convolution layer filter is sorted and pruned according to sensitivity.Finally, C++ inference algorithms are deployed on the embedded GPU TX2 platform.The test results on the VOC data set show that the improved algorithm provides an acceleration of 2.4 times while the accuracy is reduced by only 0.75 percentage points.Additionally, the memory occupied by the improved model is only 21.5% of that occupied by the original model.

Key words: object detection, lightweight, depthwise separable convolution, pruning, embedded GPU, C++ inferred deployment

摘要： 基于卷积神经网络的目标检测算法在追求较高精度的同时，忽略了检测速度，使得算法难以在有限算力的情况下实现实时检测。在YOLO目标检测算法的基础上，采用一系列轻量化的方法，运用Mobilenetv1网络替换Darknet53基础网络，将YOLO head部分3×3标准卷积替换为深度可分离卷积，根据灵敏度对卷积层滤波器进行排序和修剪，并在嵌入式GPU TX2平台上进行C++推理部署。在VOC数据集上的测试结果表明，改进算法在精度仅下降0.75个百分点的前提下实现了2.4倍加速，模型占用内存仅为原来的21.5%。

关键词: 目标检测, 轻量化, 深度可分离卷积, 剪枝, 嵌入式GPU, C++推理部署

CLC Number:

TP391.41

HUANG Jingsong, ZUO Haorui, ZHANG Jianlin. Research and Application of Lightweight Object Detection Algorithm[J]. Computer Engineering, 2021, 47(10): 236-241.

黄靖淞, 左颢睿, 张建林. 轻量化目标检测算法研究及应用[J]. 计算机工程, 2021, 47(10): 236-241.

/ / Recommend / Download Citations

URL: http://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0059168

http://www.ecice06.com/EN/Y2021/V47/I10/236

Figures/Tables 8

References

[1] 胡挺, 祝永新, 田犁, 等.面向移动平台的轻量级卷积神经网络架构[J].计算机工程, 2019, 45(1):17-22. HU T, ZHU Y X, TIAN L, et al.Lightweight convolutional neural network architecture for mobile platforms[J].Computer Engineering, 2019, 45(1):17-22.(in Chinese)
[2] REDMON J, DIVVALA S, GIRSHICK R, et al.You only look once:unified, real-time object detection[C]//Proceedings of IEEE International Conference on Computer Vision & Pattern Recognition.Washington D.C., USA:IEEE Press, 2016:259-268.
[3] REDMON J, FARHADI A.Yolo9000:better, faster, stronger[C]//Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2017:6517-6525.
[4] REDMON J, FARHADI A.Yolov3:an incremental improvement[EB/OL].[2020-07-01].https://arXivpreprintarXiv:1804.02767.
[5] GIRSHICK R.Fast R-CNN[C]//Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2015:1440-1448.
[6] HOWARD A G, ZHU M, CHEN B, et al.Mobilenets:efficient convolutional neural networks for mobile vision applications[EB/OL].[2020-07-01].https://arXivpreprintarXiv:1704.04861.
[7] SANDLER M, HOWARD A, ZHU M, et al.MobileNetV2:inverted residuals and linear bottlenecks[C]//Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2018:4510-4520.
[8] HOWARD A, SANDLER M, CHU G, et al.Searching for MobileNetV3[C]//Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2019:589-571.
[9] 周舟, 王海鹏, 徐丰, 等.基于通道剪枝的SAR图像舰船检测优化算法[J].上海航天, 2020, 37(4):48-54. ZHOU Z, WANG H P, XU F, et al.A SAR image ship detection optimization algorithm based on channel pruning[J].Shanghai Aerospace, 2020, 37(4):48-54.(in Chinese)
[10] 占浩, 朱振才, 张永合, 等.基于残差网络的图像序列闭环检测方法[J/OL].激光与光电子学进展:1-13[2020-10-11].http://kns.cnki.net/kcms/detail/31.1690.TN.20200916.1254.008.html. ZHAN H, ZHU Z C, ZHANG Y H, et al.Image sequence closed-loop detection method based on residual network[J/OL].Progress in Laser and Optoelectronics:1-13[2020-10-11].http://kns.cnki.net/kcms/detail/31.1690.TN.20200916.1254.008.html. (in Chinese)
[11] ZHAO H P, ZHOU Y, ZHANG Let al.Mixed YOLOv3-LITE:a lightweight real-time object detection method[J].Sensors, 2020, 20(7):1861-1875.
[12] 赵文清, 严海, 邵绪强.改进的非极大值抑制算法的目标检测[J].中国图象图形学报, 2018, 23(11):1676-1685. ZHAO W Q, YAN H, SHAO X Q.Target detection based on improved non-maximum suppression algorithm[J].Journal of Image and Graphics, 2018, 23(11):1676-1685.(in Chinese)
[13] 王韦祥, 周欣, 何小海, 等.基于改进MobileNet网络的人脸表情识别[J].计算机应用与软件, 2020, 37(4):137-144. WANG W X, ZHOU X, HE X H, et al.Facial expression recognition based on improved MobileNet network[J].Computer Applications and Software, 2020, 37(4):137-144.(in Chinese)
[14] 曹渝昆, 桂丽嫒.基于深度可分离卷积的轻量级时间卷积网络设计[J].计算机工程, 2020, 46(9):95-100, 109. CAO Y K, GUI L Y.Design of lightweight time convolutional network based on deep separable convolution[J].Computer Engineering, 2020, 46(9):95-100, 109.(in Chinese)
[15] 靳丽蕾, 杨文柱, 王思乐, 等.一种用于卷积神经网络压缩的混合剪枝方法[J].小型微型计算机系统, 2018, 39(12):2596-2601. JIN L L, YANG W Z, WANG S L, et al.Mixed pruning method for convolutional neural network compression[J].Journal of Chinese Computer Systems, 2018, 39(12):2596-2601.(in Chinese)
[16] LI H, KADAV A, DURDANOVIC I, et al.Pruning filters for efficient ConvNets[EB/OL].[2020-07-01].https://www.researchgate.net/publication.
[17] BOOIL O.Machine learning intelligent systems[D].[S.1.]:University of Sistan and Baluchestan, 2020.
[18] HE Y, LIU P, WANG Z, et al.Filter pruning via geometric median for deep convolutional neural networks acceleration[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2019:324-336.
[19] 齐健.NVIDIA Jetson TX2平台:加速发展小型化人工智能终端[J].智能制造, 2017(5):20-21. QI J.NVIDIA Jetson TX2 platform:accelerating the development of miniaturized artificial intelligence terminals[J].Intelligent Manufacturing, 2017(5):20-21.(in Chinese)
[20] 张永合.Jetson TX2平台的最小应用系统硬件设计[J].单片机与嵌入式系统应用, 2019, 19(10):52-54. ZHANG Y H.Minimum application system hardware design of Jetson TX2 platform[J].Single-chip Microcomputer and Embedded System Applications, 2019, 19(10):52-54.(in Chinese)
[21] 马艳军, 于佃海, 吴甜, 等.飞桨:源于产业实践的开源深度学习平台[J].数据与计算发展前沿, 2019, 5(1):105-115. MA Y J, YU D H, WU T, et al.Flying paddle:an open source deep learning platform derived from industrial practice[J].Frontiers in Data and Computing Development, 2019, 5(1):105-115.(in Chinese)
[22] 韩梦梦.大型CMake类项目源码分析方法的研究与实现[D].北京:北京交通大学, 2019. HAN M M.Research and implementation of source code analysis methods for large-scale CMake projects[D].Beijing:Beijing Jiaotong University, 2019.(in Chinese)

Please choose a citation manager

Content to export