作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2024, Vol. 50 ›› Issue (3): 277-289. doi: 10.19678/j.issn.1000-3428.0067416

• 开发研究与工程应用 • 上一篇    下一篇

基于SwinT-YOLOX模型的自动扶梯行人安全检测算法

侯颖1,2,*(), 杨林1, 胡鑫1, 贺顺1,2, 宋婉莹1,2, 赵谦1,2   

  1. 1. 西安科技大学通信与信息工程学院, 陕西 西安 710054
    2. 西安科技大学西安市网络融合通信重点实验室, 陕西 西安 710054
  • 收稿日期:2023-04-16 出版日期:2024-03-15 发布日期:2023-08-09
  • 通讯作者: 侯颖
  • 基金资助:
    国家自然科学基金(62071481); 国家自然科学基金(61901358); 陕西省科技厅工业攻关项目(2022GY-115)

Automatic Escalator Pedestrian Safety Detection Algorithm Based on SwinT-YOLOX Model

Ying HOU1,2,*(), Lin YANG1, Xin HU1, Shun HE1,2, Wanying SONG1,2, Qian ZHAO1,2   

  1. 1. College of Communication and Information Engineering, Xi'an University of Science and Technology, Xi'an 710054, Shannxi, China
    2. Xi'an Laboratory of Network Convergence Communication, Xi'an University of Science and Technology, Xi'an 710054, Shannxi, China
  • Received:2023-04-16 Online:2024-03-15 Published:2023-08-09
  • Contact: Ying HOU

摘要:

自动扶梯被广泛应用在公共场合,乘客摔倒事故如果不能被及时发现并处理,会造成严重的人身伤害,因此实现自动扶梯智能化监控管理势在必行。受自动扶梯运行环境复杂、行人多以及局部遮挡情况的影响,传统的人体姿态特征摔倒检测模型效果不佳且检测速度减慢。融合Swin Transformer和YOLOX目标检测算法的优秀策略,提出一种基于SwinT-YOLOX网络模型的自动扶梯行人摔倒检测算法。采用Swin Transformer模型作为骨干网络,颈部网络使用添加注意力机制的YOLOX模型,进一步提升特征图的多样性和表达能力。此外,利用漏斗修正线性单元视觉激活函数构建CBF模块,改进颈部网络和Head网络结构,从而获得更优的特征检测性能。实验结果表明,针对自建扶梯行人摔倒数据库和网络采集实际扶梯行人摔倒事故,与AlphaPose、OpenPose、YOLOv5等算法相比,该算法检测性能明显提高,行人摔倒平均检测精度可以达到95.92%,检测帧率为24.08帧/s,能够快速、精准地检测到乘客摔倒事故发生,监控管理平台立刻采取安全急停措施以保证乘客安全。

关键词: 自动扶梯, 摔倒检测, 深度学习, YOLOX模型, Swin Transformer模型, 漏斗修正线性单元视觉激活函数

Abstract:

Escalators are widely used in public places. If passenger fall accidents cannot be detected and handled in a timely manner, they will cause serious personal injury. Therefore, it is imperative to achieve intelligent monitoring and management of escalators. Owing to the complex operating environment, large number of pedestrians, and local occlusion of escalators, traditional human posture feature fall detection models have poor performance and slow detection speed. A pedestrian fall detection algorithm for escalators is proposed based on the SwinT-YOLOX network model, which combines the excellent strategy of the Swin Transformer and YOLOX object detection algorithms. Adopting the Swin Transformer model as the backbone network, the neck network uses the YOLOX model with an added attention mechanism to further enhance the diversity and expression ability of feature maps. In addition, utilizing the Funnel Rectified Linear Unit (FReLU) visual activation function to construct a CBF module improves the structure of the neck and Head networks, thereby achieving better feature detection performance. The experimental results demonstrate that compared with algorithms such as AlphaPose, OpenPose, and YOLOv5, the detection performance of this algorithm is significantly improved for self-built escalator pedestrian fall databases and network collection of actual escalator pedestrian fall accidents. The average detection accuracy of pedestrian falls can reach 95.92%, with a detection frame rate of 24.08 frames/s, which can quickly and accurately detect the occurrence of passenger fall accidents. The monitoring management platform immediately takes safety emergency stop measures to ensure passenger safety.

Key words: automatic escalator, fall detection, deep learning, YOLOX model, Swin Transformer model, Funnel Rectified Linear Unit(FReLU) visual activation function