作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2022, Vol. 48 ›› Issue (8): 206-214. doi: 10.19678/j.issn.1000-3428.0062216

• 图形图像处理 • 上一篇    下一篇

轻量化的YOLOv4目标检测算法

张宝朋, 康谦泽, 李佳萌, 郭俊宇, 陈少华   

  1. 大连交通大学 计算机与通信工程学院, 辽宁 大连 116028
  • 收稿日期:2021-07-30 修回日期:2021-09-30 发布日期:2021-10-11
  • 作者简介:张宝朋(1995-),男,硕士研究生,主研方向为FPGA开发、车辆信息控制、目标检测;康谦泽、李佳萌、郭俊宇,硕士研究生;陈少华,教授、博士。
  • 基金资助:
    辽宁省自然科学基金面上项目“高速列车无线健康管理通信系统关键技术研究”(2021-MS-298);辽宁省教育厅科学研究项目“列控系统故障诊断和预警机制的研究”(JDL2020006)。

Lightweight YOLOv4 Target Detection Algorithm

ZHANG Baopeng, KANG Qianze, LI Jiameng, GUO Junyu, CHEN Shaohua   

  1. School of Computer and Communication Engineering, Dalian Jiaotong University, Dalian, Liaoning 116028, China
  • Received:2021-07-30 Revised:2021-09-30 Published:2021-10-11

摘要: YOLOv4目标检测算法主干网络庞大且参数量和计算量过多,难以部署在算力和存储资源有限的移动端嵌入式设备上。提出一种改进的YOLOv4目标检测算法,使用轻量化的ShuffleNet V2网络作为主干特征提取网络,更换模型激活函数及扩大卷积核,同时将YOLOv4网络中的普通卷积替换为深度可分离卷积,降低算法参数量、计算量和模型占用空间。在ShuffleNet V2网络结构的改进过程中分析并剪裁其基本组件,利用2个3 × 3卷积核级联的方式增强网络感受野,并使用Mish激活函数进一步提升网络检测精度和模型推理速度。在GPU平台和VisDrone 2020数据集上的实验结果表明,与YOLOv4算法相比,改进的YOLOv4算法在牺牲1.8个百分点的检测精度情况下,提高了27%的检测速度,压缩了23.7%的模型容量,并且能够充分发挥ZYNQ平台并行高速数据处理及低功耗的优势。

关键词: YOLOv4目标检测, ShuffleNet V2网络模型, 卷积运算, 轻量化网络, ZYNQ平台

Abstract: The YOLOv4 algorithm has a large backbone network and involves a large amount of parameters, due to which it is difficult to use the algorithmfor mobile embedded devices with limited computing power and storage.Aiming at solving these disadvantages, this study proposes an improved YOLOv4 target detection algorithm.It uses the lightweight ShuffleNetV2 network as the backbone feature extraction network, replaces the model activation function, and expands the convolution kernel.Moreover, the ordinary convolutions in the YOLOv4 network are replaced with depthwise separable convolutions, which reduce the amount of algorithm parameters and computations, and the size of the network model.Using the improved ShuffleNetV2 network structure, the basic components of the network structure are analyzed and clipped, the network receptive field is improved by cascading two 3×3 convolution cores, and the Mish activation function is used to further improve the network accuracy and speed of model reasoning.The experimental results on the GPU platform and the VisDrone 2020 dataset show that compared with the YOLOv4 algorithm, the proposed YOLOv4 algorithm improves the detection speed by 27% and compresses the algorithm capacity by 23.7%;however, detection accuracy is reduced by 1.8 percentage points.At the same time, the proposed algorithm can give full play to the advantages of parallel high-speed data processing and low power consumption of the ZYNQ platform.

Key words: YOLOv4 target detection, ShuffleNet V2 network model, convolution operation, lightweight network, ZYNQ platform

中图分类号: