Author Login Chief Editor Login Reviewer Login Editor Login Remote Office

Computer Engineering

   

Air-to-Ground Dense Small Target Detection Algorithm Based on Multi-scale Feature Fusion

  

  • Published:2025-07-08

基于多尺度特征融合的空对地密集小目标检测算法

Abstract: UAV low-altitude aerial photography technology has been widely used in a variety of fields, such as data acquisition, security monitoring, and terrain mapping, and has significantly improved operational efficiency. However, since aerial images usually cover a wide area, the detection targets are often small in size, dense and unevenly distributed, which limits the accuracy of target detection. To address the above problems, a target detection algorithm TOD-YOLO is proposed based on an improved YOLOv8 network. First, an RFAConv downsampling module is introduced in the backbone network part, which provides effective attention to the convolutional kernel without increasing the number of parameters, allowing the network to focus on smaller targets in the feature extraction phase. Secondly, the feature fusion module CSPOKM and a new feature fusion path are proposed to increase the focus on small targets and improve the performance of the feature fusion network. Then, based on the depth separable convolution DWConv and EMA attention mechanism, the lightweight attention detection head DWA-Head is designed to reduce the number of parameters while improving the accuracy of detection. Finally, in order to solve the problem that CIoU is easy to misdetect and miss detection in complex scenes with high density of small targets, DA-MPDIoU, a loss function that can dynamically adjust the positive and negative sample coefficients, is designed to assign higher weights to small target samples that are easy to miss and difficult to classify, and to optimize the training results. Experimental results show that compared with the original YOLOv8 algorithm, the improved algorithm improves mAP@0.5 and mAP@0.5:0.95 on the VisDrone2019 dataset by 9.6 percentage points and 6.8 percentage points respectively, and further generalization experiments are carried out on the DOTA dataset. The experimental results show that the algorithm in this paper exhibits significant advantages and potentials in small target detection tasks.

摘要: 无人机低空航拍技术在数据采集、安全监控以及地形测绘等多个领域中得到了广泛的应用,显著提高了作业效率。然而在无人机航拍目标检测中,检测目标往往存在尺寸较小,密集且分布不均等问题。为此提出一种基于改进YOLOv8网络的目标检测算法TOD-YOLO。首先,在主干网络部分引入RFAConv下采样模块,在增加较少参数量的情况下给卷积核提供了有效的注意力,使得网络在特征提取阶段聚焦于更小的目标。其次,提出特征融合模块CSPOKM以及一种新的特征融合路径,提高模型对小目标特征信息的提取能力,提升特征融合网络的性能。然后,设计轻量级注意力检测头DWA-Head替代了原有的检测头,降低参数量的同时提升了模型的目标检测精度。最后,提出可动态调节正负样本系数的损失函数DA-MPDIoU,为易漏检、难分类的小目标样本分配更高的权重,解决了CIoU在小目标密度大的复杂场景下容易产生误检和漏检的问题。实验结果显示,改进的算法在VisDrone2019数据集上的mAP@0.5和mAP@0.5:0.95指标分别提高了9.6个百分点和6.8个百分点,并进一步在DOTA数据集上进行泛化实验,充分验证了算法在小目标检测任务上有明显的优势和潜力。