An improved dense pedestrian detection algorithm based on YOLOv8-n in complex scenes

doi:10.19678/j.issn.1000-3428.0070531

Abstract

Abstract: Dense pedestrian detection is one of the key problems in the development of crowd flow monitoring system in large public places. Aiming at the difficulty of small target detection caused by crowd occlusion in dense pedestrian detection scenes and the deployment requirement of lightweight model, this paper proposes an improved YOLOv8-n dense pedestrian detection model CAD-YOLO(CGDown-Adaptive Fusion Module-Dyhead). Embedded CGDown subsampling module, through an efficient context information extraction mechanism, effectively alleviates the problem that the traditional target detector is easy to lose context features when dealing with dense scenes, and significantly enhances the ability to capture dense pedestrian features and focus on small targets. A BiFPN-Adaptive structure was designed and the neck network was reconstructed. By adaptive fusion of feature information of different scales, the model was more accurate in extracting features of obscured pedestrians and small and medium-sized target pedestrians, and the number of parameters and calculation cost of the model were greatly reduced. The dynamic detection head Dyhead, combined with the new 160×160 small target detection layer, enables the model to capture the fine features of the dense small target area more accurately, thus effectively alleviating the problem of missing detection in the occlusion scene. The experimental results show that compared with YOLOv8-n, the detection accuracy of CAD-YOLO on Crowd Human dataset and WiderPerson dataset is improved by 5.1% and 2.1%, respectively. Despite the significant performance improvement, CAD-YOLO has a reference count of only 2.9M and a model compute capacity of 12.3GFLOPs, meeting the requirements of low power consumption and high precision when deployed on edge devices or mobile devices.

摘要： 密集行人检测是大型公共场所人流监控系统发展的关键问题之一。针对密集行人检测场景中由于人群遮挡导致的小目标检测困难以及模型轻量化的部署需求，本文提出一种改进的YOLOv8-n密集行人检测模型CAD-YOLO(CGDown-Adaptive Fusion Module-Dyhead)。嵌入了CGDown下采样模块，通过高效的上下文信息提取机制，有效缓解了传统目标检测器在处理密集场景时上下文特征易丢失的问题，显著增强了对密集行人特征的捕获能力以及对小目标的聚焦性能。设计了一种BiFPN-Adaptive结构并重构了颈部网络，通过自适应融合不同尺度的特征信息，使模型在提取被遮挡行人及中小型目标行人特征时表现更加精准，同时大幅减少了模型的参数量与计算成本。引入了动态检测头Dyhead，结合新增的160×160尺度的小目标检测层，使模型能够更加精确地捕获密集小目标区域的细微特征，从而有效缓解遮挡场景中的漏检问题。实验结果显示，相较于YOLOv8-n，CAD-YOLO在Crowd Human数据集上和在WiderPerson数据集上的检测精度分别提升了5.1%和2.1%。尽管性能大幅提升，CAD-YOLO的参数量仅为2.9M，模型计算量为12.3GFLOPs，满足了在边缘设备或移动设备上部署时低功耗、高精度的要求。

CHEN Haixiu, CHEN Ziang, FANG Weizhi, LU Haitao, HUANG Zijie, CHENG Rong. An improved dense pedestrian detection algorithm based on YOLOv8-n in complex scenes[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0070531.

陈海秀, 陈子昂, 房威志, 卢海涛, 黄仔洁, 成荣. 复杂场景下的改进YOLOv8-n密集行人检测模型[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0070531.

/ Recommend / Download Citations

URL: https://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0070531

References

1] Li N ,Bai X ,Shen X , et al.Dense Pedestrian Detection Based on GR-YOLO[J].Sensors,2024,24(14):4747-4747.
[2] J. Wang, K. Huang and J. Pi. RUP2S- YOLO: An Improved YOLOv8-Based Algorithm for Dense Pedestrian Detection[C]//2024 5th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT), Nanjing, China, 2024: 667-671.
[3] C. Bhagya and A. Shyna. An Overview of Deep Learning Based Object Detection Techniques[C]//2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT), Chennai, India, 2019:1-6.
[4] Y. -j. Liang, X. -p. Cui, X. -h. Xu and F. Jiang. A Review on Deep Learning Techniques Applied to Object Detection[C]//2020 7th International Conference on Information Science and Control Engineering (ICISCE), Changsha, China, 2020: 120-124.
[5] N. Ge and Y. Yong. A Survey of Vision-based Object Detection[C]//2022 International Conference on Image Processing, Computer Vision and Machine Learning (ICICML), Xi’an, China, 2022:240-244.
[6] U. Dwivedi, K. Joshi, S. K. Shukla and A. S. Rajawat. An Overview of Moving Object Detection Using YOLO Deep Learning Models[C]//2024 2nd International Conference on Disruptive Technologies (ICDT), Greater Noida, India, 2024:1014-1020.
[7] 张阳婷,黄德启,王东伟,等.基于深度学习的目标检测算法研究与应用综述 [J]. 计算机工程与应用,2023,59(18):1-13. Y. Zhang Y, Q. Huang, D. Wang, et al. Research and application of object Detection algorithms based on Deep Learning [J]. Computer Engineering and Applications,2023,59(18):1-13.
[8] 徐彦威,李军,董元方,等.YOLO 系列目标检测算法综述[J].计算机科学与探索,2024,18(09):2221-2238. Y. Xu, J. Li, Y. Dong, et al. Review of YOLO series target detection algorithms[J]. Exploration of Computer Science and Technology,2024,18(09):2221-2238.
[9] Zhang W C, Fu C, Xie H Y, et al.Global context aware RCNN for object detection[J].Neural Computing and Applications, 2021, 33(18):11627-11639.
[10] Nitika A, Yogesh K, Rashmi K, et al.Automatic vehicle detection system in different environment conditions using fast R-CNN[J].Multimedia Tools and Applications, 2022, 81(13):18715-18735.
[11] Li X M, Xie Z J, Deng X, et al.Traffic sign detection based on improved faster R-CNN for autonomous driving[J].The Journal of Supercomputing, 2022, 78(6):7982-8002.
[12] Ujwalla G, Kamal H, Yogesh G. SIRA:scale illumination1] Li N ,Bai X ,Shen X , et al.Dense Pedestrian Detection Based on GR-YOLO[J].Sensors,2024,24(14):4747-4747.
[2] J. Wang, K. Huang and J. Pi. RUP2S- YOLO: An Improved YOLOv8-Based Algorithm for Dense Pedestrian Detection[C]//2024 5th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT), Nanjing, China, 2024: 667-671. [3] C. Bhagya and A. Shyna. An Overview of Deep Learning Based Object Detection Techniques[C]//2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT), Chennai, India, 2019:1-6. [4] Y. -j. Liang, X. -p. Cui, X. -h. Xu and F. Jiang. A Review on Deep Learning Techniques Applied to Object Detection[C]//2020 7th International Conference on Information Science and Control Engineering (ICISCE), Changsha, China, 2020: 120-124. [5] N. Ge and Y. Yong. A Survey of Vision-based Object Detection[C]//2022 International Conference on Image Processing, Computer Vision and Machine Learning (ICICML), Xi’an, China, 2022:240-244. [6] U. Dwivedi, K. Joshi, S. K. Shukla and A. S. Rajawat. An Overview of Moving Object Detection Using YOLO Deep Learning Models[C]//2024 2nd International Conference on Disruptive Technologies (ICDT), Greater Noida, India, 2024:1014-1020. [7] 张阳婷,黄德启,王东伟,等.基于深度学习的目标检测算法研究与应用综述 [J]. 计算机工程与应用,2023,59(18):1-13. Y. Zhang Y, Q. Huang, D. Wang, et al. Research and application of object Detection algorithms based on Deep Learning [J]. Computer Engineering and Applications,2023,59(18):1-13. [8] 徐彦威,李军,董元方,等.YOLO 系列目标检测算法综述[J].计算机科学与探索,2024,18(09):2221-2238. Y. Xu, J. Li, Y. Dong, et al. Review of YOLO series target detection algorithms[J]. Exploration of Computer Science and Technology,2024,18(09):2221-2238. [9] Zhang W C, Fu C, Xie H Y, et al.Global context aware RCNN for object detection[J].Neural Computing and Applications, 2021, 33(18):11627-11639. [10] Nitika A, Yogesh K, Rashmi K, et al.Automatic vehicle detection system in different environment conditions using fast R-CNN[J].Multimedia Tools and Applications, 2022, 81(13):18715-18735. [11] Li X M, Xie Z J, Deng X, et al.Traffic sign detection based on improved faster R-CNN for autonomous driving[J].The Journal of Supercomputing, 2022, 78(6):7982-8002. [12] Ujwalla G, Kamal H, Yogesh G. SIRA:scale illuminatiotnformation Processing (AIIIP), Hangzhou, China, 2023: 293-296.
[29] Y. Yang and X. Wang. An Improved YOLOv7-tiny-based Lightweight Network for the Identification of Fish Species[C]//2023 5th International Conference on Robotics and Computer Vision (ICRCV), Nanjing, China, 2023: 188-192.
[30] H. Cai, J. Li, M. Hu, C. Gan and S. Han. EfficientViT: Lightweight Multi-Scale Attention for High-Resolution Dense Prediction[C]//2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2023:1364-1380.
[31] T. Wang and X. Lu. Face Forgery Detection Algorithm Based on Improved MobileViT Network[C]//2023 8th International Conference on Intelligent Computing and Signal Processing (ICSP), Xi'an, China, 2023:1396-1400.
[32] H. Liu, Y. Zhang, S. Liu, M. Zhao and L. Sun.UAV Wheat Rust Detection based on FasterNet-YOLOv8[C]//2023 IEEE International Conference on Robotics and Biomimetics (ROBIO), Koh Samui, Thailand, 2023:1-6.
[33] G. Yang, J. Lei, Z. Zhu, S. Cheng, Z. Feng and R. Liang.AFPN: Asymptotic Feature Pyramid Network for Object Detection[C]//2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Honolulu, Oahu, HI, USA, 2023:2184-2189.
[34] M. S. A. Vigil, M. M. Barhanpurkar, N. R. Anand, Y. Soni and A. Anand. EYE SPY Face Detection and Identification using YOLO[C]//2019 International Conference on Smart Systems and Inventive Technology (ICSSIT), Tirunelveli, India, 2019:2164-2169.
[35] R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh and D. Batra.Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization[C]//2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 2017:618-626.

Please choose a citation manager

Content to export