作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2023, Vol. 49 ›› Issue (12): 252-261. doi: 10.19678/j.issn.1000-3428.0067820

• 开发研究与工程应用 • 上一篇    下一篇

基于改进YOLOX-m的安全帽佩戴检测

王晓龙1, 江波2   

  1. 1. 上海华讯网络系统有限公司 行业数智事业部, 上海 200127
    2. 中国电子科技集团公司第三十二研究所, 上海 201808
  • 收稿日期:2023-06-08 出版日期:2023-12-15 发布日期:2023-12-14
  • 作者简介:

    王晓龙(1980—),男,高级工程师、硕士,主研方向为智能信息处理、数据建模和分析

    江波,研究员、博士生导师

Safety Helmet Wearing Detection Based on Improved YOLOX-m

Xiaolong WANG1, Bo JIANG2   

  1. 1. Industry Digital Intelligence Division, ECCOM Network System Co., Ltd., Shanghai 200127, China
    2. The 32nd Research Institute of China Electronics Technology Group Corporation, Shanghai 201808, China
  • Received:2023-06-08 Online:2023-12-15 Published:2023-12-14

摘要:

安全帽佩戴检测是安全监控系统中的重要组成部分,其检测精度取决于目标分类、小目标检测、域迁移差异等因素。针对现有基于YOLOX-m模型的安全帽佩戴检测算法通常存在分类精度较低、检测目标不完整、轻量化模型性能下降等问题,构建一种基于多阶段网络训练策略的改进YOLOX-m模型。首先对YOLOX-m主干特征网络卷积块的堆叠次数进行重新设计,在减小网络规模的同时最大化模型性能,然后将残差化重参视觉几何组与快速空间金字塔池化相结合,提高检测精度和推理速度。设计一种多阶段网络训练策略,将训练集和测试集拆分成多个组,并结合推理阶段生成的伪标签进行多次网络训练,以减少域迁移差异,获得更高的检测精度。实验结果表明,与YOLOX-m模型相比,改进YOLOX-m模型的推理延迟降低了5 ms,模型大小减少了4.7 MB,检测精度提高了1.26个百分点。

关键词: 安全帽佩戴检测, 深度学习, 残差化重参视觉几何组, 快速空间金字塔池化, 多阶段网络训练策略

Abstract:

The safety helmet wearing detection is a crucial part of the security monitoring system. Its precision depends on object classification, small-object detection, domain transfer discrepancy, and other factors. Existing algorithms based on YOLOX-m for safety helmet wearing detection have drawbacks of reduced classification precision, incomplete detection targets, and degraded performance of lightweight models. An improved YOLOX-m model based on a multi-stage network training strategy is proposed to solve these problems. First, the number of stacks of convolution blocks of the YOLOX-m backbone feature network is redesigned to maximize the performance of the model while reducing the network. Next, the Residual Re-parameterized Visual Geometry Group(Res-RepVGG) is combined with Spatial Pyramid Pooling-Fast(SPPF) to improve the detection accuracy and reasoning speed. In addition, a multi-stage network training strategy is proposed, which divides the training and test sets into multiple groups and combines the pseudo labels generated in the inference stage for multiple network training to reduce the domain transfer difference and improve the detection accuracy. The experimental results show that compared with YOLOX-m, the improved YOLOX-m exhibits improved performance in helmet wearing detection in three aspects: the delay is reduced by 5 ms, the model size is reduced by 4.7 MB, and the average accuracy is improved by 1.26 percentage points.

Key words: safety helmet wearing detection, deep learning, Residual Re-parameterized Visual Geometry Group(Res-RepVGG), Spatial Pyramid Pooling-Fast(SPPF), multi-stage network training strategy