作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• •    

基于改进YOLOv8的轻量化无人机图像目标检测算法

  • 发布日期:2024-12-03

Lightweight Target Detection Algorithm in UAV Images Based on Improved YOLOv8

  • Published:2024-12-03

摘要: 针对无人机图像中小目标实例多、目标间存在遮挡所导致的漏检、误检等现象,提出了一种基于改进YOLOv8的轻量化无人机图像小目标检测算法。在颈部引入三特征编码器、尺度序列特征融合模块,增强了网络对不同尺度特征的提取能力;同时,设计了小目标检测层,并与改进的颈部特征提取网络进行融合,在头部引入一个额外的检测头,减小小目标特征的损失,增强网络对小目标的识别能力;此外,针对CIoU的缺陷,结合Wise-IoU、inner-IoU和MPDIoU,提出了一种回归损失算法Wise-inner-MPDIoU;最后,为了实现算法在移动端和嵌入式场景下的轻量化应用需求,进行了基于幅度的层自适应稀疏化剪枝,在保证模型精度的同时,进一步压缩了模型的大小。实验结果表明,改进后的模型相比于原YOLOv8s模型,在mAP50提高6.8%的同时,参数量下降76.1%,计算量下降17.1%,模型大小降低73.5%,达到了优秀的提升模型检测精度及轻量化效果,具有很强的实用意义。

Abstract: In view of the phenomenon of missed detection and false detection caused by many small target instances and occlusion between targets in drone images, this paper proposes a lightweight small target detection algorithm in UAV images based on improved YOLOv8. The triple feature encoder and scale sequence feature fusion module are introduced in the neck to enhance the network's ability to extract features of different scales. Furthermore, a small target detection layer is designed and fused with the improved neck feature extraction network, and an additional detection head is introduced in the head to reduce the loss of small target features and enhance the network's recognition ability of small targets. Additionally, in view of the defects of CIoU, a regression loss algorithm Wise-inner-MPDIoU is proposed by combining Wise-IoU, inner-IoU and MPDIoU. Finally, in order to realize the lightweight application requirements of the algorithm in mobile and embedded scenarios, amplitude-based layer adaptive sparse pruning is performed to further compress the model size while ensuring the accuracy of the model. Experimental results demonstrate that compared with the original YOLOv8s model, the improved model this paper proposed improves mAP50 by 6.8%, while the number of parameters decreases by 76.1%, the amount of computation decreases by 17.1%, and the model size decreases by 73.5%. This improved model achieves excellent accuracy improvement, lightweight effect, and practical value.