Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2022, Vol. 48 ›› Issue (4): 39-49. doi: 10.19678/j.issn.1000-3428.0061502

• Research Hotspots and Reviews • Previous Articles     Next Articles

Research on Mask Wearing Detection Method Based on YOLOv5 Enhancement Model

PENG Cheng1,2, ZHANG Qiaohong1, TANG Zhaohui2, GUI Weihua2   

  1. 1. School of Computer Science, Hunan University of Technology, Zhuzhou, Hunan 412007, China;
    2. School of Automation, Central South University, Changsha 410083, China
  • Received:2021-04-28 Revised:2021-06-18 Published:2021-07-15

基于YOLOv5增强模型的口罩佩戴检测方法研究

彭成1,2, 张乔虹1, 唐朝晖2, 桂卫华2   

  1. 1. 湖南工业大学 计算机学院, 湖南株洲 412007;
    2. 中南大学 自动化学院, 长沙 410083
  • 作者简介:彭成(1982—),男,副教授、博士、博士后,主研方向为工业大数据分析;张乔虹(通信作者),学士;唐朝晖,教授、博士生导师;桂卫华,教授、博士生导师、中国工程院院士。
  • 基金资助:
    国家自然科学基金面上项目(61871432,61771492);湖南省自然科学基金(2020JJ4275,2019JJ6008,2019JJ60054);湖南省大学生研究性学习和创新性实验计划项目(S201911535027)。

Abstract: Face mask wearing detection is a very important measure in epidemic prevention and control in public places.Intelligent and efficient detection of mask wearing is of great significance in realizing the automation and digitization of epidemic prevention and control.Convolution deep neural networks are feasible for end-to-end face mask wearing detection, but convolution deep neural networks have a complex structure, large number of parameters, and floating-point calculation, resulting in high computation overhead and memory requirements, which greatly limit their application in terminal devices with limited resources.To facilitate the face mask wearing supervision function and realize model compression for accelerated detection under multi-scale conditions, a lightweight enhanced network model based on improved YOLOv5 is proposed.Design GhostBottleneckCSP and ShuffleConv modules with fewer parameters and less computation replace C3 and some Conv modules in the original YOLOv5 network, to reduce the computation overload in the process of feature channel fusion and enhance the ability of feature expression.The experimental results show that the recognition accuracy of the model is more than 95%.On the premise of almost no loss of accuracy, the number of parameters and calculation load of the model are only 34.24% and 33.54% of the original YOLOv5 network, respectively, while the running speed on GPU and CPU is increased by 13.64% and 28.25% respectively, thereby reducing the requirements of the model for memory storage and computing power, making it more suitable for deployment on mobile terminals with limited resources.

Key words: deep learning, mask wearing detection, YOLOv5 network, GhostBottleneckCSP module, ShuffleConv module

摘要: 人脸口罩佩戴检测是公共场所疫情防控中极为重要的措施,智能、高效地检测口罩佩戴情况对实现疫情防控的自动化和数字化具有重要意义。使用卷积类深度神经网络实现端到端的人脸口罩佩戴检测具有可行性,但卷积类神经网络具有结构复杂、参数量和浮点计算量庞大的特点,从而产生较高的计算开销和内存需求,极大地限制了其在资源有限的终端设备上的应用。为了使人脸口罩佩戴监督功能更易获取,并实现多尺度条件下的模型压缩和加速检测,提出一种基于改进YOLOv5的轻量化增强网络模型。设计参数量和计算量更小的GhostBottleneckCSP和ShuffleConv模块并替换原YOLOv5网络中的C3及部分Conv模块,以降低特征通道融合过程中的计算量并增强特征表达能力。实验结果表明,该模型的识别精度达95%以上,模型在精度近乎无损失的前提下,参数量和计算量分别仅为原YOLOv5网络的34.24%和33.54%,且在GPU和CPU上的运行速度分别提升13.64%和28.25%,降低了模型对内存存储及计算能力的要求,更适用于在资源有限的移动端部署。

关键词: 深度学习, 口罩佩戴检测, YOLOv5网络, GhostBottleneckCSP模块, ShuffleConv模块

CLC Number: