作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2022, Vol. 48 ›› Issue (10): 306-312. doi: 10.19678/j.issn.1000-3428.0063188

• 开发研究与工程应用 • 上一篇    下一篇

基于新型算子采样优化的交通标志检测网络

陈春辉, 马社祥   

  1. 天津理工大学 集成电路科学与工程学院, 天津 300384
  • 收稿日期:2021-11-11 修回日期:2021-12-14 发布日期:2021-12-15
  • 作者简介:陈春辉(1997—),男,硕士研究生,主研方向为计算机视觉、深度学习;马社祥,教授、博士。
  • 基金资助:
    国家自然科学基金(61371108,61601326)。

Traffic Sign Detection Network Based on New Operator Sampling Optimization

CHEN Chunhui, MA Shexiang   

  1. School of Integrated Circuit Science and Engineering, Tianjin University of Technology, Tianjin 300384, China
  • Received:2021-11-11 Revised:2021-12-14 Published:2021-12-15

摘要: 传统基于卷积神经网络的交通标志检测网络采用堆叠大量卷积核的方式进行下采样,限制了卷积神经网络的感受野建模,难以灵活地调整内部参数,从而丢失图像的细节信息,导致小目标与遮挡目标的检测精度与定位精度降低。提出基于YOLOv5采样优化的交通标志检测网络。以新型算子作为基础架构,采用自卷积方式灵活提取不同通道的特征,并构建跨阶段注意力机制模块,以增加各通道特征的重要性权值,从而提高小目标的检测能力。通过改进的通道聚合网络实现多尺度语义信息与细节特征的融合与增强,同时利用K-means聚类算法生成更适合交通标志的先验框,在非极大值抑制算法中引入距离交并比函数对预测框进行后处理,避免错误抑制复杂场景下被遮挡的目标,从而提高定位精度。在中国交通标志数据集上的实验结果表明,当交并比阈值为0.5时,该网络的平均精度均值为95.8%,与YOLOv5网络相比模型参数量减少了15.7%,在满足实时性的同时具有较优的小目标检测性能。

关键词: 交通标志检测, 特征融合, 自注意力算子, 小目标, 注意力机制

Abstract: The traditional traffic sign detection network based on the Convolutional Neural Network (CNN) adopts the method of stacking many convolution kernels for downsampling.This method limits the receptive field modeling of the CNN, making it difficult to flexibly adjust the internal parameters.Thus, the details of the image are lost, resulting in decreased detection accuracy of small and occluded targets.This study proposes a traffic sign detection network based on YOLOv5 sampling optimization.With the new operator as the basic framework, the features of different channels are flexibly extracted via self-convolution.A cross stage attention mechanism module is constructed to increase the importance weight of the features of each channel to improve the detection ability for small targets.The fusion and enhancement of multi-scale semantic information and detailed features are realized through the improved Path Aggregation Network(PAN).In addition, the K-means clustering algorithm is used to generate a suitable a priori frame for traffic signs.The Distance Intersection Over Union (DIOU) function is introduced in the Non-Maximum Suppression (NMS) algorithm to postprocess the prediction frame to avoid erroneously suppressing the occluded targets in complex scenes, thereby improving the positioning accuracy.The experimental results for the Chinese traffic sign dataset show that when the Intersection Over Union(IOU) threshold is 0.5, the mean Average Precision(mAP) of the proposed network is 95.8%. Compared with the YOLOv5 network, the model parameters decreased by 15.7%, exhibiting improved small target detection performance while satisfying real-time requirements.

Key words: traffic sign detection, feature fusion, self-attention operator, small target, attention mechanism

中图分类号: