作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2022, Vol. 48 ›› Issue (7): 42-50. doi: 10.19678/j.issn.1000-3428.0062231

• 热点与综述 • 上一篇    下一篇

多尺度特征融合的轻量化口罩佩戴检测算法

叶茂, 马杰, 王倩, 武麟   

  1. 河北工业大学 电子信息工程学院, 天津 300401
  • 收稿日期:2021-08-01 修回日期:2021-09-16 出版日期:2022-07-15 发布日期:2022-07-12
  • 作者简介:叶茂(1997—),女,硕士研究生,主研方向为计算机视觉;马杰(通信作者),教授、博士;王倩、武麟,硕士研究生。
  • 基金资助:
    河北省自然科学基金(F2020202045)。

Lightweight Mask-Wearing Detection Algorithm with Multi-Scale Feature Fusion

YE Mao, MA Jie, WANG Qian, WU Lin   

  1. School of Electronic Information Engineering, Hebei University of Technology, Tianjin 300401, China
  • Received:2021-08-01 Revised:2021-09-16 Online:2022-07-15 Published:2022-07-12

摘要: 科学规范地佩戴口罩是预防新冠、流感等呼吸道传染病的有效方法,在当前疫情形势下,正确佩戴口罩显得尤为重要。已有的口罩佩戴检测算法多数存在结构复杂、训练难度较高和特征提取不足等问题,为此,提出一种多尺度特征融合的轻量化口罩佩戴检测算法L-MFFN-YOLO。以YOLOv4-Tiny网络为基础,L-MFFN-YOLO改进原始残差结构,使用轻量化残差模块促进模型快速收敛,在有效降低模型计算量的同时保证检测精度。在原网络这2个尺度的基础上增加特征分支,以增强低特征层的信息表达能力并降低小目标的漏检率。通过多层级交叉融合结构最大程度地提取有用信息,从而提高特征利用率。除佩戴和未佩戴口罩2种情况外,在数据集中新增口罩佩戴不正确的类别并进行手工标注,实验结果表明,L-MFFN-YOLO算法的模型大小仅为5.8 MB,较原始网络YOLOv4-Tiny,其模型规模减小76%,mAP提高5.25个百分点,CPU下的处理时间快14 ms,能在资源受限的设备中满足口罩佩戴检测任务对准确率和实时性的要求。

关键词: 口罩佩戴检测, 轻量化检测算法, 残差结构, 低特征层, 多层级交叉融合

Abstract: Standardized usage of face masks is effective as a non-pharmaceutical intervention to prevent the spread of infectious respiratory diseases, such as COVID-19 and influenza.In the current epidemic situation, wearing face masks correctly is especially important.Most existing mask-wearing detection algorithms involve problems such as complex structures, high training difficulty, and insufficient feature extraction.Therefore, this study proposes a lightweight mask-wearing detection algorithm based on multi-scale feature fusion and the YOLOv4-Tiny network, called L-MFFN-YOLO.L-MFFN-YOLO improves on the original residual structure and uses a lightweight residual module to promote rapid convergence.Moreover, it reduces the computational load while ensuring detection accuracy. Based on the original network's 13×13 and 26×26 feature maps, 52×52 feature branches are added to enhance the ability of the lower feature layer to express information and reduce the false negative rate for small targets.On this basis, a Multi-level Cross Fusion(MCF) structure is used to maximally extract useful information so as to improve feature utilization.In addition to detecting mask-wearing, a category of masks worn incorrectly is added to the dataset and manually labeled.The experimental results show that the size of the proposed L-MFFN-YOLO model is only 5.8 MB, which is 76% smaller than that of the original YOLOv4-Tiny.Moreover, the mean Average Precision(mAP) of the proposed approach is 5.25 percentage points higher, and its processing time is 14 ms faster on an equivalent CPU.These results demonstrate that the proposed approach can meet the requirements of accuracy and real-time operation in resource-constrained devices to detect faces wearing masks.

Key words: mask-wearing detection, lightweight detection algorithm, residual structure, lower characteristic layer, Multi-level Cross Fusion(MCF)

中图分类号: