作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2023, Vol. 49 ›› Issue (9): 313-320. doi: 10.19678/j.issn.1000-3428.0065697

• 开发研究与工程应用 • 上一篇    

监控视角下密集人群口罩佩戴检测算法

孙龙, 张荣芬*, 刘宇红, 饶庭漓   

  1. 贵州大学 大数据与信息工程学院, 贵阳 550025
  • 收稿日期:2022-09-07 出版日期:2023-09-15 发布日期:2022-12-07
  • 通讯作者: 张荣芬
  • 作者简介:

    孙龙(1999—),男,硕士研究生,主研方向为目标检测、智能安防监控

    刘宇红,教授

    饶庭漓,硕士研究生

  • 基金资助:
    贵州省科学技术基金(黔科合基础-ZK[2021]重点001)

Mask Wearing Detection Algorithm for Dense Crowds from a Monitoring Perspective

Long SUN, Rongfen ZHANG*, Yuhong LIU, Tingli RAO   

  1. College of Big Data and Information Engineering, Guizhou University, Guiyang 550025, China
  • Received:2022-09-07 Online:2023-09-15 Published:2022-12-07
  • Contact: Rongfen ZHANG

摘要:

针对密集人群场景的口罩佩戴检测面临着监控视角下目标密集、互相遮挡、目标小、人脸透视畸变等难题, 同时,涵盖不规范佩戴口罩场景的公开数据集也较为缺乏。提出一种基于YOLO-v5改进的监控视角下密集人群口罩佩戴检测算法MDDC-YOLO。利用空洞卷积构造多分支感受野模块MRF-C3替换YOLO-v5中常规C3模块,解决密集人群中小目标占比大的问题。使用Repulsion Loss基于样本边界框排斥吸引的原则提高模型抗遮挡能力,并充分利用训练过程中的遮挡正样本。在此基础上, 引入ECA注意力机制进行特征通道最优化选择,并提出基于透视变换的离线数据增强方法, 结合使用更适用于生成更多小目标样本的Mosaic-9数据增强方法,解决监控视角下密集人群口罩佩戴数据集缺乏的问题。实验结果表明,MDDC-YOLO算法相较于YOLO-v5算法mAP提升6.5个百分点,并达到32帧/s的检测速度,满足密集人群口罩佩戴检测的应用需求。

关键词: 口罩佩戴检测, 密集人群, 多分支空洞卷积, ECA注意力机制, Repulsion Loss

Abstract:

In dense crowds scenario, dense targets under the monitoring perspective, mutual occlusion, small targets, and face perspective distortion cause problems in mask wearing detection. Meanwhile, public datasets covering incorrectly worn masks are also lacking. Therefore, this paper proposes a mask wearing detection algorithm from a monitoring perspective, MDDC-YOLO, based on the YOLO-v5 improvement. In view of the large proportion of small- and medium-sized targets in dense population, the conventional C3 module in YOLO-v5 is replaced with the MRF-C3 module of the atrous convolutional structure. The anti-occlusion ability of the model is also improved by using Repulsion Loss based on the principle of repulsion attraction of the sample bounding box, and the masking positive sample is fully utilized during the training process. An Efficient Channel Attention(ECA) mechanism is further introduced for optimal selection of feature channels. Finally, to address the lack of mask wearing data in the crowd from a monitoring perspective, an offline data enhancement method based on perspective transformation is proposed. The proposed Mosaic-9 data enhancement generates additional small target samples to address this problem. The experimental results show that the MDDC-YOLO algorithm provides 6.5 percentage points mAP improvement compared with YOLO-v5, thereby reaching a detection speed of 32 frame/s, which satisfies the application requirements of mask-wearing detection in dense populations.

Key words: mask wearing detection, dense crowds, multi-branch atrous convolution, Efficient Channel Attention(ECA) mechanism, Repulsion Loss