作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2023, Vol. 49 ›› Issue (12): 194-204. doi: 10.19678/j.issn.1000-3428.0066520

• 图形图像处理 • 上一篇    下一篇

基于CGS-Ghost YOLO的交通标志检测研究

赵宏, 冯宇博   

  1. 兰州理工大学 计算机与通信学院, 兰州 730050
  • 收稿日期:2022-12-14 出版日期:2023-12-15 发布日期:2023-12-11
  • 作者简介:

    赵宏(1971—),男,教授、博士、博士生导师,主研方向为计算机视觉、自然语言处理、深度学习

    冯宇博,硕士研究生

  • 基金资助:
    国家自然科学基金(62166025); 甘肃省重点研发计划(21YF5GA073)

Research on Traffic Sign Detection Based on CGS-Ghost YOLO

Hong ZHAO, Yubo FENG   

  1. School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China
  • Received:2022-12-14 Online:2023-12-15 Published:2023-12-11

摘要:

在交通标志检测任务中,YOLOv5检测算法在复杂的环境和路况下存在漏检、错检及模型参数量过大等问题。为此,提出一种改进的CGS-Ghost YOLO检测模型。YOLOv5在图片输入后使用Focus模块进行下采样,增加较多参数,CGS-Ghost YOLO模型使用StemBlock模块替换Focus模块进行采样,能够在维持精度的同时减少参数,并通过引入坐标注意力机制,强化特征中的语义信息和位置信息,提高模型的特征提取能力。设计SMU激活函数与组归一化相结合的CGS卷积模块,避免训练过程中Batch Size大小对模型所造成的影响,在使用GhostConv减少模型参数的同时,提升模型的检测精度。在此基础上,通过$ \alpha $-CIoU Loss+VFocal Loss损失函数,改善交通标志检测任务中正负样本不平衡的问题,提升模型整体性能,Neck部分使用Bi-FPN双向特征金字塔网络,实现检测目标多尺度特征的有效融合。实验结果表明,改进的CGS-Ghost YOLO模型在交通标志检测数据集TT100K中的平均精度均值达到93.1%,相较于原始模型提高了11.3个百分点,模型参数量相较于原始模型降低了21.2个百分点。此外,该网络模型优化了卷积层及下采样部分,在大幅减少模型参数的同时提高了模型检测精度。

关键词: 深度学习, 目标检测, YOLOv5检测算法, 注意力机制, CGS Conv模块

Abstract:

In tasks involving traffic sign detection, the YOLOv5 detection algorithm encounters several issues including missed detections, erroneous detections, and a complex model in complex environments and road conditions. To address these challenges, an improved CGS-Ghost YOLO detection model is proposed. YOLOv5 uses the focus module for sampling, which introduces more parameters. In this study, the StemBlock module is used to replace the focus module for sampling after input, which can reduce the number of parameters while maintaining the accuracy. CGS-Ghost YOLO uses a Coordinate Attention(CA) mechanism, which improves the semantic and location information within the features and enhances the feature extraction ability of the model. Additionally, a CGS convolution module, which combines the SMU activation function with GroupNorm(GN) normalization, is proposed. The CGS convolution module is designed to avoid the influence of the batch Size on the model during training and improve model performance. This study aims to use GhostConv to reduce the number of model parameters and effectively improve the detection accuracy of the model.The loss function, $ \alpha $-CIoU Loss+VFocal Loss, is used to solve the problem of unbalanced positive and negative samples in traffic sign detection tasks and improve the overall performance of the model. The neck part uses a Bi-FPN bidirectional feature pyramid network, ensuring that the multi-scale features of the detection target are effectively fused. The results of an experiment on the TT100K traffic sign detection dataset show that the detection accuracy of the improved CGS-Ghost YOLO model reaches 93.1%, which is 11.3 percentage points higher than the accuracy achieved by the original model. Additionally, the proposed network model reduces the model parameter quantity by 21.2 percentage points compared to the original model. In summary, the network model proposed in this study optimizes the convolution layer and the downsampling part, thus considerably reducing the model parameters while enhancing the model detection accuracy.

Key words: deep learning, object detection, YOLOv5 detection algorithm, attention mechanism, CGS Conv module