Author Login Chief Editor Login Reviewer Login Editor Login Remote Office

Computer Engineering ›› 2026, Vol. 52 ›› Issue (3): 119-127. doi: 10.19678/j.issn.1000-3428.0070159

• Computer Vision and Image Processing • Previous Articles     Next Articles

BS-YOLO: A Small Object Detection Algorithm Based on BSAM Attention Mechanism and SCConv

CAO Jiwei1,2, LUO Fei1,*(), DING Weichao1   

  1. 1. School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200000, China
    2. Shanghai Key Laboratory of Computer Software Evaluating and Testing, Shanghai 200000, China
  • Received:2024-07-22 Revised:2024-09-13 Online:2026-03-15 Published:2024-12-02
  • Contact: LUO Fei

BS-YOLO: 基于BSAM注意力机制和SCConv的小目标检测算法

曹继卫1,2, 罗飞1,*(), 丁炜超1   

  1. 1. 华东理工大学信息科学与工程学院, 上海 200000
    2. 上海市计算机软件评测重点实验室, 上海 200000
  • 通讯作者: 罗飞
  • 作者简介:

    曹继卫, 男, 硕士研究生, 主研方向为目标检测与识别

    罗飞(通信作者),副教授

    丁炜超, 副教授

  • 基金资助:
    上海市自然科学基金(22ZR1416500)

Abstract:

In recent years, there has been significant progress in terms of accuracy and robustness of deep-learning-based algorithms for object detection that have been widely applied in industry. However, in the field of small object detection, currently used object detection algorithms suffer from high rates of missed detections and false positives. Therefore, in this study, a YOLO small object detection algorithm, viz., BS-YOLO, which is based on SCConv and BSAM attention mechanism, is developed. First, in response to the problem of the large amount of redundant information generated in the feature extraction network, a new module, viz., C3SC, is proposed to reconstruct the backbone network using SCConv. This module reduces redundant information in both spatial and channel aspects of the extracted feature maps, thereby improving the quality of the feature maps extracted by the backbone network, and in turn enhancing detection accuracy. Second, a new attention mechanism, viz., BSAM, is proposed by combining CBAM and the BiFormer self-attention mechanism, by which weights are allocated reasonably in both spatial and channel aspects, making the feature map more focused on effective information and suppressing background interference. Finally, to solve the problem of uneven distribution of difficult and easy samples in terms of small object detection, Slideloss is used to optimize the loss function, thereby improving the effectiveness of the algorithm for small object detection. The experimental results obtained using the RSOD dataset show that the BS-YOLO algorithm has a precision of 94.2%, a recall rate of 91.6%, and a mAP@0.5 of 95.9%, corresponding to improvements of 3.3, 0.1, and 3.6 percentage, respectively, compared to the original YOLOv5 algorithm. This indicates that the BS-YOLO algorithm can effectively improve the accuracy of small object detection and reduce the missed detection rate.

Key words: small object detection, attention mechanism, feature purification, computer vision, deep learning

摘要:

近年来, 基于深度学习的目标检测算法在准确率和鲁棒性等方面取得了巨大进步, 并且在工业界得到广泛应用。但是, 在小目标检测领域, 当前的目标检测算法仍然存在漏检率和误检率高的问题。因此, 提出一种基于SCConv和BSAM注意力机制的YOLO小目标检测算法BS-YOLO。首先, 针对特征提取网络存在大量冗余信息的问题, 利用SCConv重构主干网络, 提出一种新的模块C3SC, 对提取到的特征图从空间和通道两个方面减少冗余信息, 提升主干网络提取到的特征图质量, 从而提高检测精度; 其次, 结合CBAM和BiFormer自注意力机制提出一种新的注意力机制BSAM, 在空间和通道两个方面合理分配权重, 使特征图更加关注有效信息, 抑制背景的干扰; 最后, 为了解决小目标检测存在的难易样本分布不均的问题, 利用Slideloss优化损失函数, 从而提高小目标检测的效果。在RSOD数据集上的实验结果表明, BS-YOLO算法的精确率为94.2%, 召回率为91.6%, 均值平均精度(mAP@0.5)为95.9%, 相对于原始的YOLOv5算法, 分别提高了3.3、0.1、3.6百分点, 表明BS-YOLO算法可以有效提高小目标检测的精度, 降低漏检率。

关键词: 小目标检测, 注意力机制, 特征提纯, 计算机视觉, 深度学习