Author Login Chief Editor Login Reviewer Login Editor Login Remote Office

Computer Engineering ›› 2026, Vol. 52 ›› Issue (4): 200-213. doi: 10.19678/j.issn.1000-3428.0070252

• Computer Vision and Image Processing • Previous Articles    

Target Detection Algorithm for Remote Sensing Images with Multi-Scale Information Enhancement

YANG Lu, LIU Junjie, YU Xiang   

  1. School of Communications and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
  • Received:2024-08-14 Revised:2024-10-21 Published:2024-11-20

多尺度信息增强的遥感图像目标检测算法

杨路, 刘俊杰, 余翔   

  1. 重庆邮电大学通信与信息工程学院, 重庆 400065
  • 作者简介:杨路,女,高级工程师、硕士,主研方向为机器学习、计算机视觉;刘俊杰(通信作者),硕士研究生,E-mail:s220131052@stu.cqupt.edu.cn;余翔,教授。
  • 基金资助:
    国家自然科学基金(62176035)。

Abstract: Feature extraction from remote sensing images with complex backgrounds is challenging, and he accuracy is low due to the high density of small targets and significant scale variations. To address these challenges, this paper proposes a multi-scale information-enhanced target detection algorithm based on YOLOv5s: Deep Learning YOLO(DL-YOLO). First, the improved algorithm employs cavity convolutional fast spatial pyramid pooling designed based on Spatial Pyramid Pooling-Fast (SPPF) at the top of the backbone network. This improves the feature extraction capability of the network by fusing the detailed information of the multi-scale targets with the semantic information through the Receptive Field Enhancement Block (RFEB). Second, the improved algorithm incorporates a Lightweight and Efficient Detection Head (LEDH), which is based on the Decoupling Head (DH) of YOLOv6. The original detection head is replaced with the LEDH, which features a lightweight cavity Global Depth Convolution (GDConv) module, to improve the correlation learning of classification and regression tasks. The LEDH also employs lightweight convolution for lightweighting purposes, which enhances the target detection accuracy at different scales and reduces the number of decoupling head parameters. The results of the experiment on the DIOR dataset demonstrate that the proposed DL-YOLO algorithm increases precision, recall, mAP@0.5, and mAP by 1.6, 2.1, 2.1, and 4.7 percentage points, respectively, compared with YOLOv5s. The all-around score of the proposed algorithm surpasses those of several current exceptional target detection algorithms; hence, it is feasible for detecting targets in remote sensing images at multiple scales.

Key words: remote sensing images, complex background, YOLOv5s algorithm, multi-scale target detection, Decoupling Header (DH)

摘要: 针对复杂背景遥感图像中小目标密集、目标尺度变化大等因素给目标检测带来的特征提取困难、精度不佳的问题,在YOLOv5s基础上提出一种多尺度信息增强的目标检测算法——深度学习YOLO(DL-YOLO)。首先,改进算法在主干网络顶部采用基于快速空间金字塔池化设计的空洞卷积快速空间金字塔池化,通过其中的感受野增强模块(RFEB)融合多尺度目标的细节信息与语义信息,提高网络的特征提取能力。其次,改进算法的检测头部分采用以YOLOv6s解耦头(DH)为基础设计的轻量高效解耦头(LEDH)来替换原有的检测头,在该解耦头中设计了轻量化空洞全局深度卷积(GDConv)模块来增强分类与回归任务关联信息的学习,以及引用轻量化卷积实现轻量化,在提高各尺度目标检测精度的同时,降低解耦头参数量。在DIOR数据集上的实验结果表明,与YOLOv5s相比,提出的DL-YOLO算法在精确率、召回率、mAP@0.5、mAP上分别提高了1.6、2.1、2.1和4.7百分点,综合指标超过了现有优秀的目标检测算法,对遥感图像中多尺度目标检测具有实际应用意义。

关键词: 遥感图像, 复杂背景, YOLOv5s算法, 多尺度目标检测, 解耦头

CLC Number: