基于特征交织与仿生中央凹聚焦的实时小目标检测方法

doi:10.19678/j.issn.1000-3428.0253460

摘要/Abstract

摘要： 复杂场景下的小目标检测任务长期面临两大技术瓶颈：一是微弱目标特征在深层神经网络中极易衰减，二是环境背景噪声干扰严重。针对上述瓶颈，本研究提出一种端到端实时小目标检测模型WF-DETR。特征提取阶段设计特征交织网络，摒弃简单层级堆叠方式，采用异构特征交织策略；借助跨层级特征互校正机制，将深层语义信息与浅层几何细节紧密交织并双向校准，在保证高层语义强度的同时，有效抑制特征传递过程中的空间信息衰减，缓解小目标特征丢失问题。颈部网络部分受人类视觉生理机制启发，提出FoveaFormer模块，通过自适应稀疏注意力机制与门控单元模拟人眼中央凹成像机制，动态过滤背景冗余噪声，聚焦高价值目标区域，显著提升特征纯度。此外，引入哈尔小波下采样算子重构下采样过程，从频域角度克服传统池化导致的高频纹理细节不可逆丢失弊端，进一步增强小目标特征辨识度。在VisDrone2019基准数据集上的实验结果表明，模型mAP@0.5:0.95达23.7%，推理速度高达166.3 FPS。实验结果充分验证WF-DETR在复杂背景小目标检测任务中的实时性与优越性。

Abstract: Small object detection in complex scenarios has long grappled with two major technical bottlenecks: the propensity for weak object features to attenuate within deep neural networks, and the severe interference caused by environmental background noise. To address these challenges, this study proposes WF-DETR, an end-to-end real-time small object detection model. In the feature extraction stage, a Feature Weaving Network (WeaveNet) is designed. Diverging from simple hierarchical stacking, WeaveNet employs a heterogeneous feature weaving strategy. Leveraging a cross-level feature mutual correction mechanism, it tightly interweaves and bidirectionally calibrates deep semantic information with shallow geometric details. This approach effectively suppresses the attenuation of spatial information during feature transmission and mitigates small object feature loss, all while maintaining high-level semantic strength. Inspired by human visual physiological mechanisms, the neck network incorporates a FoveaFormer module. By simulating the human foveal imaging mechanism via adaptive sparse attention and gating units, this module dynamically filters redundant background noise and focuses on high-value target regions, significantly enhancing feature purity. Furthermore, a Haar Wavelet Downsample (HWD) operator is introduced to reconstruct the downsampling process. From a frequency domain perspective, this overcomes the irreversible loss of high-frequency texture details caused by traditional pooling, further augmenting the discriminability of small object features. Experimental results on the VisDrone2019 benchmark dataset demonstrate that the proposed model achieves mAP@0.5:0.95 of 23.7% and an inference speed of 166.3 FPS. These results fully validate the real-time performance and superiority of WF-DETR in small object detection tasks within complex backgrounds.

谢斌红, 孙晓松, 张睿. 基于特征交织与仿生中央凹聚焦的实时小目标检测方法[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0253460.

XIE Binhong, SUN Xiaosong, ZHANG Rui. Real-Time Small Object Detection via Feature Weaving and Bio-inspired Foveal Focusing[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0253460.

参考文献

[1] Wei W, Cheng Y, He J, et al. A review of small object detection based on deep learning[J]. Neural Computing and Applications, 2024, 36(12): 6283-6303.
[2] WANG H, LIU C, CAI Y, et al. YOLOv8-QSD: An improved small object detection algorithm for autonomous vehicles based on YOLOv8[J]. IEEE Transactions on Instrumentation and Measurement, 2024, 73: 1-16.
[3] ZHAO T, KIBLAWI S, USUYAMA N, et al. Boltzmann attention sampling for image analysis with small objects[C]//Proceedings of the Computer Vision and Pattern Recognition Conference. 2025: 25950-25959.
[4] 邝先验,王星星,王龙锋,等.基于改进YOLO的多尺度聚合遥感图像小目标检测算法[J].科学技术与工程,2025,25(20):8560-8570. (KUANG X Y, WANG X X, WANG L F, et al. Small target detection algorithm for multi-scale aggregate remote sensing images based on improved YOLO[J]. Science Technology and Engineering, 2025, 25(20): 8560-8570.)
[5] Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779-788.
[6] Liu W, Anguelov D, Erhan D, et al. Ssd: Single shot multibox detector[C]//European conference on computer vision. Cham: Springer International Publishing, 2016: 21-37.
[7] Tian Z, Shen C, Chen H, et al. FCOS: A simple and strong anchor-free object detector[J]. IEEE transactions on pattern analysis and machine intelligence, 2020, 44(4): 1922-1933.
[8] 陈海秀,陈子昂,宁馨,等.改进YOLOv10n的密集行人小目标检测方法[J/OL].小型微型计算机系统,1-10[2025-0930].https://link.cnki.net/urlid/21.1106.tp.20250618.1645.002.(CHEN H X, CHEN Z A, NING X, et al. Improved YOLOv10n Dense Pedestrian Small Target Detection Method[J/OL], Journal of Chinese Computer Systems, 1-10[2025-0930]. https://link.cnki.net/urlid/21.1106.tp.20250618.1645.002.).
[9] 李牧,陶启婷,柯熙政.基于改进YOLOv5的小目标交通标志检测算法[J].计算机应用,2025,45(S1):239-244.DOI:CNKI:SU-N:JSJY.0.2025-S1-041.(LI M, TAO Q, KE X. Detection algorithm for small-target traffic signs based on improved YOLOv5[J], Journal of ComputerApplications, 2025, 45(S1): 239-244. DOI: CNKI: SUN: JSJY. 0.2025-S1-041.)
[10] Xiao Y, Di N. SOD-YOLO: A lightweight small object detection framework[J]. Scientific Reports, 2024, 14(1): 25624.
[11] Li M, Chen Y, Zhang T, et al. TA-YOLO: a lightweight small object detection model based on multi-dimensional trans-attention module for remote sensing images[J]. Complex & Intelligent Systems, 2024, 10(4): 5459-5473.
[12] Carion N, Massa F, Synnaeve G, et al. End-to-end object detection with transformers[C]//European conference on computer vision. Cham: Springer International Publishing, 2020: 213-229.
[13] Zhao Y, Lv W, Xu S, et al. Detrs beat yolos on real-time object detection[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2024: 16965-16974.
[14] Peng Y, Li H, Wu P, et al. D-FINE: Redefine regression task in DETRs as fine-grained distribution refinement[J]. arXiv preprint arXiv:2410.13842, 2024.
[15] Huang S, Lu Z, Cun X, et al. Deim: Detr with improved matching for fast convergence[C]//Proceedings of the computer vision and pattern recognition conference. 2025: 15162-15171.
[16] 刘思元,高凯,雍龙泉.改进RT-DETR的航拍小目标检测算法[J].计算机工程与应用,2025,61(04):272-281.(LIU S Y,GAO K, YONG L Q. Improved RT-DETR Algorithm for Aerial Small Object Detection[J]. Computer Engineering and Applications, 2025, 61(04): 272-281.)
[17] Dong Y, Xu F, Guo J. LKR-DETR: Small object detection in remote sensing images based on multi-large kernel convolution[J]. Journal of Real-Time Image Processing, 2025, 22(1): 46.
[18] 田红鹏,李志强,杨赛.改进RT-DETR的航拍图像小目标检测算法[J/OL].计算机工程,1-14[2025-11-18].https://doi.org/10.19678/j.issn.1000-3428.0252661.(TIAN H P, LI Z Q, YANG S. An Improved Algorithm for Small Object Detection in UAV Aerial Images Based on RT-DETR[J/OL]. ComputerEngineering, 1-14[2025-11-18]. https://doi.org/10.19678/j.issn.1000-3428.0252661.)
[19] Liu Y, Ji Y, Ren Q, et al. An Innovative Small-Target Detection Approach Against Information Attenuation: Fusing Enhanced Programmable Gradient Information and a Novel Mamba Module[J]. Sensors, 2025, 25(7): 2117.
[20] Li Q, Song W. FPH-DEIM: A Lightweight Underwater Biological Object Detection Algorithm Based on Improved DEIM[J]. Applied System Innovation, 2025, 8(5): 123.
[21] ZHOU Y, WEI Y. UAV-DETR: an enhanced RT-DETR architecture for efficient small object detection in UAV imagery[J]. Sensors, 2025, 25(15): 4582.
[22] Shi D, Zhao C, Shao J, et al. Context-Aware Enhanced Feature Refinement for small object detection with Deformable DETR[J]. Frontiers in Neurorobotics, 2025, 19: 1588565.
[23] Zhu X, Su W, Lu L, et al. Deformable detr: Deformable transformers for end-to-end object detection[J]. arXiv preprint arXiv:2010.04159,2020.
[24] Roh B, Shin J W, Shin W, et al. Sparse detr: Efficient end-to-end object detection with learnable sparsity[J]. arXiv preprint arXiv:2111.14330,2021.
[25] XU G, LIAO W, ZHANG X, et al. Haar wavelet downsampling: A simple but effective downsampling module for semantic segmentation[J]. Pattern recognition, 2023, 143:109819.
[26] Cao Y, He Z, Wang L, et al. VisDrone-DET2021: The vision meets drone object detection challenge results[C]//Proceedings of the IEEE/CVF International conference on computer vision. 2021: 2847-2854.
[27] Yu X, Gong Y, Jiang N, et al. Scale match for tiny person detection[C]//Proceedings of the IEEE/CVF winter conference on applications of computer vision. 2020: 1257-1265.
[28] Xia G S, Bai X, Ding J, et al. DOTA: A large-scale dataset for object detection in aerial images[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 3974-3983.
[29] 杨树莹,葛华勇.基于AMD-YOLOv8的无人机图像小目标检测[J/OL].计算机系统应用,1-11[2025-09-17].https://doi.org/10.15888/j.cnki.csa.009994.(YANG S Y, GE H Y. Small target detection in UAV images based on AMD-YOLOv8[J/OL]. Computer system application, 1-11[2025-09-17]. https://doi.org/10.15888/j.cnki.csa.009994.)
[30] 刘臣杰,刘巍,杨雯迪,等.DEPA-YOLO：无人机视角下的小目标检测模型[J/OL].计算机科学与探索,1-16[2025-09-17].https://link.cnki.net/urlid/11.5602.tp.20250826.0912.004.(LIU C J, LIU W, YANG W D, et al.DEPA-YOLO：A drone-based small object detection model[J/OL]. Journal of Frontiers of Computer Science and Technology, 1-16[2025-09-17]. https://link.cnki.net/urlid/11.5602.tp.20250826.0912.004.)
[31] 蒲小莉,赖惠成,高古学.BF-YOLO:基于YOLOv8改进的小目标检测算法[J].计算机工程与科学,2025,47(08):1425-1436.(PU X L, LAI H C, GAO G X. BF-YOLO: An improved small object detection algorithm based on YOLOv8[J]. Computer Engineering⪼ience, 2025, 47(08): 1425-1436.)
[32] Wang H, Liu J, Zhao J, et al. Precision and speed: LSOD-YOLO for lightweight small object detection[J]. Expert Systems with Applications, 2025, 269: 126440.
[33] Tang S, Zhang S, Fang Y. HIC-YOLOv5: Improved YOLOv5 for small object detection[C]//2024 IEEE international conference on robotics and automation (ICRA). IEEE, 2024: 6614-6619.
[34] 程鑫淼,张雪松,曹冰洁,等.改进RT-DETR的小目标检测方法研究[J].计算机工程与应用,2025,61(15):144-155.(CHENG X M, ZHANG X S, CAO B J, et al.Research on small object detection method of improved RT-DETR[J]. Computer Engineering and Applications, 2025, 61(15): 144-155.)
[35] Zhang C, Yang J. Emsd-detr: efficient small object detection for UAV aerial images based on enhanced RT-DETR model: C. Zhang, J. Yang[J]. The Journal of Supercomputing, 2025, 81(9): 1052.
[36] Ren S, He K, Girshick R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE transactions on pattern analysis and machine intelligence, 2016, 39(6): 1137-1149.
[37] Cai Z, Vasconcelos N. Cascade r-cnn: Delving into high quality object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 6154-6162.
[38] Wang J, Gao J, Zhang B. A small object detection model in aerial images based on CPDD-YOLOv8[J]. Scientific Reports, 2025, 15(1): 770

选择文件类型/文献管理软件名称

选择包含的内容