基于改进YOLOv8的景区行人检测算法

doi:10.19678/j.issn.1000-3428.0068125

摘要/Abstract

摘要： 针对当前景区行人检测存在检测精度低、算法参数量大和现有公开数据集在小目标检测上存在限制等问题，本文创建了TAPDataset行人检测数据集，填补了现有数据集在小目标检测方面的不足；并基于YOLOv8算法，提出了一种检测精度高、硬件要求低的新模型YOLOv8-L。首先，引入了DepthSepConv轻量化卷积模块，降低了模型的参数量和计算量。其次，采用BiFormer注意力机制和上采样算子CARAFE，加强了模型对图像的语义理解和信息融合能力，显著提升了模型的检测精度。最后，增加了一层小目标检测层，来提取更多的浅层特征，从而有效的改善模型对小目标的检测性能。使用TAPDataset、VOC 2007及TAP+VOC数据集验证算法的有效性。实验结果表明，与YOLOv8相比，在TAPDataset数据集上FPS基本不变的情况下，模型的参数量减少了18.06%，mAP@0.5提高了5.51%，mAP@0.5:0.95提高了6.03%；在VOC 2007数据集上，模型的参数量减少了13.6%，mAP@0.5提高了3.96%，mAP@0.5:0.95提高了6.39%；在TAP+VOC数据集上，模型的参数量减少了14.02%，mAP@0.5提高了4.49%，mAP@0.5:0.95提高了5.68%；改进后的算法具有更强的泛化性能，能够更好的适用于景区行人检测任务。

Abstract: Aiming at the problems of low detection accuracy, large number of algorithm parameters and limitations of existing public datasets on small target detection in the current scenic pedestrian detection, this paper creates the TAPDataset pedestrian detection dataset, which fills the shortcomings of the existing datasets on small target detection; and based on the YOLOv8 algorithm, it proposes a new model with high detection accuracy and low hardware requirements, the YOLOv8-L .First,the lightweight convolution module DepthSepConv is introduced to reduce the number of parameters and computation of the model. Second, the BiFormer attention mechanism and the CARAFE upsampling operator are used to strengthen the model's semantic understanding of images and information fusion ability, which significantly improves the model's detection accuracy. Finally, a small target detection layer is added to extract more shallow features, which effectively improves the model's detection performance for small targets. The effectiveness of the algorithm is verified using TAPDataset, VOC 2007 and TAP+VOC datasets. The experimental results show that compared with YOLOv8, the amount of parameters of the model is reduced by 18.06% on the TAPDataset dataset with the FPS basically unchanged, mAP@0.5 improved by 5.51% and mAP@0.5:0.95 improved by 6.03%; on the VOC 2007 dataset, the amount of parameters of the model is reduced by 13.6%, and mAP@ 0.5 improved by 3.96%, mAP@0.5:0.95 improved by6.39%; on the TAP+VOC dataset, the parameter amount of the model decreased by 14.02%, mAP@0.5 improved by 4.49%, mAP@0.5:0.95 improved by5.68%; the improved algorithm has stronger generalization performance and can be better applied to the scenic pedestrian detection task.

贵向泉, 刘世清, 李立, 秦庆松, 李唐艳. 基于改进YOLOv8的景区行人检测算法[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0068125.

GUI Xiangquan, LIU Shiqing, LI Li, QING Qingsong, LI Tangyan. Pedestrian detection algorithm for scenic spots based on improved YOLOv8[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0068125.

参考文献

[1] Tang Chang, Chunping Hou. "RGBD salient object detection by structured low-rank matrix recovery and Laplacian constraint." Transactions of Tianjin University 23 (2017): 176-183.
[2] 王亮,张超.一种基于 YOLOv5 的轻量型行人检测方法[J]. 工业控制计算机，2023，36(04):84-86+89. Wang Liang,Zhang Chao. A lightweight pedestrian detection method based on YOLOv5[J].Industrial Control Computer, 2023,36(04):84-86+89.(in Chinese)[3] Zou Zhengxia,et al.Object detection in 20 years:Asurvey.Proceedingsofthe IEE-E (2023).
[4] Terven,Juan,and Diana Cordova-Esparza. A Comprehensive Review of YOLO: From YOLOv1 to YOLOv8 and Beyond. arXiv preprint arXiv:2304.00501 (2023).
[5] 邱天衡,王玲,王鹏,等.基于改进YOLOv5 的目标检测算法研究[J].计算机工程与应用，2022，58(13):63-73. Qiu Tianheng,Wang Ling,Wang Peng,et al. Research on target detection algorithm based on improved YOLOv5[J]. Computer Engineering and Application, 2022,58(13):63-73. (in Chinese)
[6] 孙传猛,王燕平,王冲,等.融合改进YOLOv3 与三次样条插值的煤岩界面识别方法[J].采矿与岩层控制工程学报,2022,4(01):81-90. SUN Chuanmeng, WANG Yanping, WANG Chong, et al. Coal-rock interface recognition method integrating improved YOLOv3 and cubic spline interpolation[J]. Journal of Mining andRock Control Engineering,2022,4(01):81-90.
[7] 严开忠,马国梁,许立松,等.基于改进YOLOv3 的机载平台目标检测算法[J].电光与控制，2021，28(05):70-74. YAN Kaizhong, MA Guoliang, XU Lisong,et al. Tar-get detection algorithm for airborne platform based on improved YOLOv3[J]. Electro-Optics and Control, 2021, 28(05):70-74. (in Chinese)
[8] 王艺成,张国良,张自杰.基于改进YOLOv5 的小目标检测方法[J].计算机与现代化，2023(05):100-105. WANG Yicheng,ZHANG Guoliang,ZHANG Zijie. Sm-all target detection method based on improved YOLOv5[J]. Computer and Modernization, 2023(05):100-105.(in Chinese)
[9] Cui Cheng,et al.PP-LCNet: A lightweig-ht CPU convol-utional neural network.arXiv preprint arXiv:2109.15099 (2021).
[10] 李文豪,周斌,胡波,张子涵.基于轻量化网络的遮挡人脸检测[J].中南民族大学学报(自然科学版)，2022， 41(03):339-346. W.H. Li, B. Zhou, B. Hu, Z.H. Zhang. Occluded face detection based on lightweight network[J]. Journal of Central South University for Nationalities (Natural Science Edition),2022, 41(03):339-346. (in Chinese)
[11] Zhu Lei,et al.BiFormer:Vision Transfor-mer with Bi- Level Routing Attention.arXiv preprint arXiv:2303.08810(2023).
[12] Wang Jiaqi,et al. Carafe: Content-aware reasse-mbly of features. Proceedings of the IEEE/CV-F international conference on computer vision.2019.
[13] 王程,刘元盛,刘圣杰.基于改进YOLOv4的小目标行人检测算法 [J]. 计算机工程， 2023 ， 49(02):296-302+313.DOI:10.19678/j.issn.1000-3428.006 3623. WANG Cheng,LIU Yuansheng,LIU Shengjie. Small t-arget pedestrian detection algorithm based on improv-ed YOLOv4[J]. Computer Engin-eering, 2023, 49(02):296-302+313. doi:10.19678/j.issn.1000-3428.0063623. (in Chinese)
[14] Zhan Y,Yu J, et al. Multi-task CompositionalNetwork for Visual Relationship Det-ection. IntJ Comput Vis 128, 2146–2165(2020).
[15] 李闻,李小春,闫昊雷.基于改进YOLO v3 的PCB缺陷检测 [J].电光与控制，2022，29(04):106-111. LI Wen, LI Xiaochun, YAN Haolei. PCB defect detection based on improved YOLO v3[J]. Electro-Optics and Control, 2022, 29(04):106-111. (in Chinese)
[16] 洪松,高定国.基于YOLOv3的车辆和行人检测方法[J].电脑知识与技术， 2020 ， 16(08):192-193+198.DOI:10.14004/j.cnki.ckt.2020.0944. HONG Song,GAO Dingguo. Vehicle and pedestrian detection method based on YOLOv3[J]. Computer Knowledge and Technology, 2020, 16(08):192-193+198.DOI:10.14004/j.cnki.ckt.2020.0944. (in Chinese)
[17] Hong, W; Ma, Z;et al. Detection of Green Asparagus in Complex Environments Based on the ImprovedYOLOv5 Algorithm. Sensors 2023, 23, 1562.
[18] Wang Q, Feng W, Yao L, et al . TPH-YOLOv5-Air: Airport Confusing Object Detection via Adaptively Spatial Feature Fusion. Remote Sens. 2023, 15, 3883.
[19] Yue X, Qi K, Na X, Zhang Y, et al .Improved YOL-Ov8-Seg Network for Instance Segmentation of Heal- thy and DiseasedTomato Plants in the Growth Sta- ge. Agriculture 2023, 13, 1643.
[20] Liu S, Qi L, et al. Path aggregation network for insta-nce segmentation. In Proceedings of the IEEE Conference on Comp-uter Vision and Pattern Recognition (C-VPR). 2018, p8759–8768.
[21] 杨永波,李栋.改进YOLOv5的轻量级安全帽佩戴检测算法[J].计算机工程与应用,2022,58(09):201-207. YANG Yongbo,LI Dong. A lightweight helmet wear detection algorithm to improve YOLOv5[J]. Computer Engineering and Applications, 2022,58(09):201-207. (in Chinese)
[22] Wu D, Jiang S, et al. Detection of Camellia oleifera Fruit in Complex Scenes by Using YOLOv7 and Data Augmentation. Appl. Sci. 2022, 12, 11318.
[23] Aboah, Armstrong,et al.“Real-time Multi-Class Helmet Violation Detection Using Few-Shot Data Sampling Technique and YOLOv8.” 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (2023): 5350-5358.
[24] Agorku, Geoffery et al. “Real-Time Helmet Violation Detection Using YOLOv5 and Ensemble Learning.” ArXiv abs/2304.09246 (2023).
[25] Lawal OM. YOLOv5-LiNet: A lightweight network for fruits instance segmentation. PLoS One. 2023 Mar 2;18(3):e0282297.

选择文件类型/文献管理软件名称

选择包含的内容