基于改进YOLOv4算法的轻量化网络设计与实现

doi:10.19678/j.issn.1000-3428.0060948

计算机工程 ›› 2022, Vol. 48 ›› Issue (3): 181-188. doi: 10.19678/j.issn.1000-3428.0060948

基于改进YOLOv4算法的轻量化网络设计与实现

孔维刚¹, 李文婧², 王秋艳¹, 曹鹏程³, 宋庆增¹

1. 天津工业大学计算机科学与技术学院, 天津 300387;
2. 天津工业大学电气工程与自动化学院, 天津 300387;
3. 中国电子科技集团公司信息科学研究院, 北京 100086

收稿日期:2021-02-26 修回日期:2021-04-15 发布日期:2021-04-27
作者简介:孔维刚(1997-),男,硕士研究生,主研方向为FPGA开发、嵌入式系统开发、深度学习;李文婧,硕士研究生;王秋艳,副教授、博士;曹鹏程,博士;宋庆增(通信作者),副教授、博士。
基金资助:
国家自然科学基金（61802281，61702366）；天津市自然科学基金（18JCQNJC70300，19JCYBJC15800）；天津市教委科研计划项目（2018KJ215，2020KJ112，KYQD1817）。

Design and Implementation of Lightweight Network Based on Improved YOLOv4 Algorithm

KONG Weigang¹, LI Wenjing², WANG Qiuyan¹, CAO Pengcheng³, SONG Qingzeng¹

1. School of Computer Science and Technology, Tiangong University, Tianjin 300387, China;
2. School of Electrical Engineering and Automation, Tiangong University, Tianjin 300387, China;
3. Information Science Academy of China Electronics Technology Group Corporation, Beijing 100086, China

Received:2021-02-26 Revised:2021-04-15 Published:2021-04-27

摘要/Abstract

摘要： 在嵌入式设备上进行目标检测时易受能耗和功耗等限制，使得传统目标检测算法效果不佳。为此，对YOLOv4算法进行优化，设计YOLOv4-Mini网络结构，将其特征提取网络由CSPDarkNet53改为MobileNetv3-large并进行INT8量化处理，其中网络结构利用PW和DW卷积操作代替传统卷积操作以大幅减少计算量。采用SE模块为通道施加注意力机制，激活函数层运用h-swish非线性激活函数，在保证精度的情况下降低网络计算量。同时，通过量化感知训练将权重转为INT8类型，以实现模型轻量化，进一步降低网络参数量和计算量，从而在嵌入式设备上完成无人机数据集的目标检测任务。在NVIDIA Jetson Xavier NX设备上进行测试，结果显示，YOLOv4-MobileNetv3网络的mAP为34.3%，FPS为30，YOLOv4-Mini网络的mAP为32.5%，FPS为73，表明YOLOv4-Mini网络能够在低功耗、低能耗的嵌入式设备上完成目标实时检测任务。

关键词: 目标检测, 模型压缩, 嵌入式设备, 轻量化神经网络, 模型量化, Jetson Xavier NX设备

Abstract: Target detection using embedded devices is limited by energy and power consumption, deteriorating the performance of traditional target detection algorithms.Therefore, to address this issue, the YOLOv4 algorithm is optimized;the YOLOv4-Mini network structure is designed;the feature extraction network is changed from CSPDarkNet53 to MobileNetv3-large;and INT8 quantization processing is carried out.The network structure uses PW and DW convolution operations to replace the traditional convolution operation to greatly reduce the amount of calculation.The SE module is used to apply attention mechanism to the channel, and h-swish nonlinear activation function is used in the activation function layer to reduce the amount of network calculation while ensuring accuracy. Concurrently, the weight is transformed into INT8 type through quantitative perception training to realize the lightweight of the model and further reduce the amount of network parameters and computation, in addition to completing the target detection task of UAV data set on embedded devices.The test results on NVIDIA Jetson Xavier NX show that the mAP of YOLOv4-MobileNetv3 network is 34.3%, the FPS is 30, the mAP of YOLOv4-Mini network is 32.5%, and the FPS is 73 indicating that the YOLOv4-Mini network can complete the target real-time detection task on the embedded device with low power consumption and low energy consumption.

Key words: target detection, model compression, embedded device, lightweight neural network, model quantification, Jetson Xavier NX equipment

中图分类号:

TP391

孔维刚, 李文婧, 王秋艳, 曹鹏程, 宋庆增. 基于改进YOLOv4算法的轻量化网络设计与实现[J]. 计算机工程, 2022, 48(3): 181-188.

KONG Weigang, LI Wenjing, WANG Qiuyan, CAO Pengcheng, SONG Qingzeng. Design and Implementation of Lightweight Network Based on Improved YOLOv4 Algorithm[J]. Computer Engineering, 2022, 48(3): 181-188.

https://www.ecice06.com/CN/Y2022/V48/I3/181

图/表 11

20220331202347

20220331202350

20220331202353

20220331202356

20220331202359

20220331202402

20220331202405

20220331202408

20220331202411

20220331202416

20220331202419

参考文献

[1] GIRSHICK R, DONAHUE J, DARRELL T, et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of IEEE Conference on Computer Vision & Pattern Recognition.Washington D.C., USA:IEEE Computer Society, 2014:580-587.
[2] GIRSHICK R.Fast R-CNN[C]//Proceedings of IEEE International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2015:1440-1448.
[3] REN S, HE K, GIRSHICK R, et al.Faster R-CNN:towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017, 39(6):1137-1149.
[4] LIU W, ANGUELOV D, ERHAN D, et al.SSD:single shot multibox detector[C]//Proceedings of European Conference on Computer Vision.Berlin, Germany:Springer, 2016:21-37.
[5] REDMON J, DIVVALA S, GIRSHICK R, et al.You Only Look Once:unified, real time objecct detection[C]//Proceedings of European Conference on Computer Vision and Pattern Recognition.Berlin, Germany:Springer, 2017:6517-6525.
[6] REDMON J, FARHADI A.YOLO9000:better, faster, stronger[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2017:6517-6525.
[7] REDMON J, FARHADI A.YOLOv3:an incremental improvement[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2018:1804-1823.
[8] BOCHKOVSKIY A, WANG C Y, LIAO H Y M.YOLOv4:optimal speed and accuracy of object detection[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2020:102-123.
[9] HOWARD A, SANDLER M, CHU G, et al.Searching for MobileNetv3[EB/OL].[2021-01-05].https://arxiv.org/pdf/1905.02244v3.pdf.
[10] ZHANG Z, HE T, ZHANG H, et al.Bag of freebies for training object detection neural networks[EB/OL].(2019-04-12)[2021-01-05].https://arxiv.org/abs/1902.04103.
[11] JIE H, LI S, GANG S.Squeeze-and-excitation networks[EB/OL].[2021-01-05].https://arxiv.org/pdf/1709.01507.pdf.
[12] HE K, ZHANG X, REN S, et al.Spatial pyramid pooling in deep convolutional networks for visual recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9):1904-1916.
[13] HOWARD A G, ZHU M, CHEN B, et al.MobileNets:efficient convolutional neural networks for mobile vision applications[EB/OL].[2021-01-05].https://arxiv.org/pdf/1704.04861.pdf.
[14] SANDLER M, HOWARD A, ZHU M, et al.MobileNetv2:inverted residuals and linear bottlenecks[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2018:4510-4520.
[15] JACOB B, KLIGYS S, CHEN B, et al.Quantization and training of neural networks for efficient integer-arithmetic-only inference[EB/OL].[2021-01-05].https://arxiv.org/pdf/1712.05877.pdf.
[16] CHEN Y H, EMER J, SZE V.Eyeriss:a spatial architecture for energy-efficient dataflow for convolutional neural networks[EB/OL].[2021-01-05].https://www.rle.mit.edu/eems/wp-content/uploads/2016/04/eyeriss_isca_2016.pdf.
[17] JOUPPI N P, YOUNG C, PATIL N, et al.In-datacenter performance analysis of a tensor processing unit[C]//Proceedings of 2017 ACM/IEEE Annual International Symposium on Computer Architecture.Washington D.C., USA:IEEE Press, 2017:1-12.
[18] JOUPPI N, YOUNG C, PATIL N, et al.Motivation for and evaluation of the first tensor processing unit[J].IEEE Micro, 2018, 38(3):10-19.
[19] MISRA D.Mish:a self regularized non-monotonic neural activation function[EB/OL].[2021-01-05].https://arxiv.org/pdf/1908.08681.pdf.
[20] SERRA T, KUMAR A, RAMALINGAM S.Scaling up exact neural network compression by ReLU stability[EB/OL].[2021-01-05].https://arxiv.org/pdf/2102.07804.pdf.
[21] ZHANG S H, LI X L, ZONG M, et al.Learning k for kNN classification[J].ACM Transactions on Intelligent Systems and Technology, 2017, 8(8):1-19.

选择文件类型/文献管理软件名称

选择包含的内容

基于改进YOLOv4算法的轻量化网络设计与实现

Design and Implementation of Lightweight Network Based on Improved YOLOv4 Algorithm

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 11

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	张天鹏, 韩晶, 吕学强. 基于多任务学习的超分辨率辅助小目标检测[J]. 计算机工程, 2024, 50(9): 304-312.
[2]	曾钰琦, 刘博, 钟柏昌, 钟瑾. 智慧教育下基于改进YOLOv8的学生课堂行为检测算法[J]. 计算机工程, 2024, 50(9): 344-355.
[3]	饶日昕, 王怡文, 曾砺志, 童心恬, 赵海涛. 面向废旧电缆检测的轻量化网络模型[J]. 计算机工程, 2024, 50(8): 22-30.
[4]	王昱婷, 刘志明, 万亚平, 朱涛. 基于可见光与红外图像的弱光条件下目标检测[J]. 计算机工程, 2024, 50(8): 270-281.
[5]	贵向泉, 刘世清, 李立, 秦庆松, 李唐艳. 基于改进YOLOv8的景区行人检测算法[J]. 计算机工程, 2024, 50(7): 342-351.
[6]	孙帮勇, 马铭, 于涛. 基于区域特征强化的多尺度伪装目标检测方法[J]. 计算机工程, 2024, 50(5): 209-219.
[7]	刘仕兵, 周诗涵. 高铁接触网绝缘子检测算法研究[J]. 计算机工程, 2024, 50(5): 200-208.
[8]	赵继达, 甄国涌, 储成群. 基于YOLOv8的无人机图像目标检测算法[J]. 计算机工程, 2024, 50(4): 113-120.
[9]	崔丽群, 曹华维. 基于改进YOLOv5的遥感图像目标检测[J]. 计算机工程, 2024, 50(4): 228-236.
[10]	周金涛, 高迪驹, 刘志全. 基于全景视觉的无人船水面障碍物检测方法[J]. 计算机工程, 2024, 50(2): 113-121.
[11]	王非凡, 陈希爱, 任卫红, 管宇, 韩志, 唐延东. 基于图像自适应增强的低照度目标检测算法[J]. 计算机工程, 2024, 50(10): 352-361.
[12]	兰红, 王惠钊. 结合轻量化与多尺度融合的交通标志检测算法[J]. 计算机工程, 2024, 50(10): 381-392.
[13]	王子豪, 方成, 李丽萍, 鹿存跃. 基于热力图预测的免“锚框”人物目标检测算法[J]. 计算机工程, 2024, 50(10): 51-60.
[14]	袁昊, 葛海波, 辛世澳, 胥冬梅, 杨雨迪. 基于深度纹理特征的伪装目标边缘细化检测[J]. 计算机工程, 2024, 50(10): 89-99.
[15]	罗偲, 李凯扬, 吴吉花, 任鹏. 基于对抗注意力机制的水下遮挡目标检测算法[J]. 计算机工程, 2024, 50(10): 313-321.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于改进YOLOv4算法的轻量化网络设计与实现

Design and Implementation of Lightweight Network Based on Improved YOLOv4 Algorithm

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 11

参考文献

相关文章 15

编辑推荐

Metrics

本文评价