基于改进YOLOv5s的莴笋芯部检测算法

doi:10.19678/j.issn.1000-3428.0069727

摘要/Abstract

摘要：

精确的作物行检测作为智能化农业的一项重要技术, 对于无人收获装置的导航和采摘具有重要意义。对莴笋生长过程中歪斜、移位和倒伏等因素导致作物行提取不准确的问题, 将其转化为莴笋芯部区域的目标检测问题, 提出一种以成熟期莴笋芯部为目标的目标检测算法。该算法基于广泛采用的目标检测框架YOLOv5s, 通过在主干网络中嵌入动态卷积模块, 以动态感知的方式过滤特征图中的背景干扰, 在局部区域保留重要细节特征, 从而增强网络对莴笋芯部特征的学习能力。同时, 在网络的特征金字塔网络(FPN)结构中引入基于空洞卷积和权值共享的多尺度融合模块, 确保网络经过多次下采样后能够有效保留目标结构信息, 有利于对莴笋芯部这类小目标的检测。此外, 引入CARAFE上采样操作充分利用特征提取过程中的上下文信息, 增强网络对小目标特征的提取能力。进一步, 基于Wasserstein距离和SIoU提出一种新的损失函数, 解决了传统IoU方法对小目标位置敏感的问题, 并加快了网络拟合速度。实验结果表明, 改进算法对莴笋芯部提取的平均精确度和召回率分别达到了0.586和0.574, 较于YOLOv5s提高了6.1和6.3百分点。网络检测出莴笋芯部坐标信息后, 采用最小二乘法将坐标点进行直线拟合, 得到莴笋作物行中心线。该算法使原始YOLOv5s模型在不同光照条件下对莴笋芯部的漏检问题得到明显改善, 从而能够提取出更加准确的作物行中心线。

关键词: 目标检测网络, 小目标检测, 注意力机制, 多尺度特征融合, 作物行中心线

Abstract:

Precise crop row detection is crucial in intelligent agriculture because it significantly affects the navigation and harvesting capabilities of unmanned harvesters. Crop row extraction accuracy is affected by factors such as slanting, displacement, and lodging during lettuce growth. This study transforms this issue into a target detection problem focusing on the core area of mature lettuce and proposes a target detection algorithm. This algorithm is based on YOLOv5s, a widely adopted target detection framework, and incorporates a dynamic convolution module into its backbone network. By dynamically filtering out background interference from feature maps, it preserves important detail features in local areas, thereby enhancing the network's ability to learn the features of the lettuce core. Additionally, the Feature Pyramid Network (FPN) structure introduces a multiscale fusion module based on dilated convolution and weight sharing, ensuring effective retention of target structural information after multiple downsampling processes, which is beneficial for detecting small targets such as lettuce cores. Furthermore, the CARAFE upsampling operation is introduced to fully utilize the contextual information during the feature extraction process, thereby enhancing the network's ability to extract small target features. Moreover, a new loss function based on the Wasserstein distance and SIoU is proposed to address the sensitivity of traditional IoU methods to the positions of small targets and accelerate the fitting speed of the network. Experimental results demonstrate that the improved algorithm achieves an average precision and recall of 0.586 and 0.574 for lettuce core extraction, representing increases of 6.1 and 6.3 percentage points compared to those achieved by YOLOv5s, respectively. After detecting the coordinates of the lettuce core, the algorithm uses the least squares method to fit the coordinate points into a straight line, thereby obtaining the central line of the lettuce crop row. Experimental results indicate that this algorithm significantly improves the performance of the original YOLOv5s model in detecting lettuce cores under different lighting conditions, thereby enabling a more accurate extraction of the crop row centerline.

Key words: object detection network, small target detection, attention mechanism, multi-scale feature fusion, crop row centerline

代尹翘, 肖武龙, 李柏林, 李立. 基于改进YOLOv5s的莴笋芯部检测算法[J]. 计算机工程, 2026, 52(6): 352-364.

DAI Yinqiao, XIAO Wulong, LI Bailin, LI Li. Lettuce Core Detection Algorithm Based on Improved YOLOv5s[J]. Computer Engineering, 2026, 52(6): 352-364.

https://www.ecice06.com/CN/Y2026/V52/I6/352

图/表 18

图1 注意力机制实现效果对比

Fig.1 Comparison of attention mechanism implementation effects

图2 BRA注意力机制原理

Fig.2 BRA attention mechanism principle

图3 Bi-CBS卷积模块结构

Fig.3 Bi-CBS convolutional module structure

图4 RFE模块结构

Fig.4 RFE module structure

图5 多尺度融合模块结构

Fig.5 Multi-scale fusion module structure

图6 CARAFE上采样算子原理

Fig.6 CARAFE upsampling operator principle

图7 改进模型的结构

Fig.7 Structure of improved model

图8 图像处理前后对比

Fig.8 Comparison before and after image process

图9 边界框损失曲线对比

Fig.9 Comparison of bounding box loss curves

图10 强光条件下检测效果对比

Fig.10 Comparison of detection effects under strong light conditions

图11 弱光条件下检测效果对比

Fig.11 Comparison of detection effects under low light conditions

图12 作物行中心线与实际作物行中心线效果对比

Fig.12 Comparison of the effect between the crop row center line and the actual crop row center line

参考文献 31

1	万欢, 欧媛珍, 管宪鲁, 等. 无人农机作业环境感知技术综述. 农业工程学报, 2024, 40(8): 1- 18. doi: 10.11975/j.issn.1002-6819.202402020
	WAN H, OU Y Z, GUAN X L, et al. Review on operational environment awareness technology of unmanned agricultural machinery. Transactions of the Chinese Society of Agricultural Engineering, 2024, 40(8): 1- 18. doi: 10.11975/j.issn.1002-6819.202402020
2	ZHANG S L, MA Q L, CHENG S K, et al. Crop row detection in the middle and late periods of maize under sheltering based on solid state LiDAR. Agriculture, 2022, 12(12): 2011. doi: 10.3390/agriculture12122011
3	贺静, 何杰, 罗锡文, 等. 基于多传感器融合的水稻行识别与跟踪导航研究. 农业机械学报, 2022, 53(3): 18-26, 137. doi: 10.6041/j.issn.1000-1298.2022.03.002
	HE J, HE J, LUO X W, et al. Research on rice row Recognition and tracking navigation based on multi-sensor fusion. Transactions of the Chinese Society for Agricultural Machinery, 2022, 53(3): 18-26, 137. doi: 10.6041/j.issn.1000-1298.2022.03.002
4	HUANG S P, WU S H, SUN C, et al. Deep localization model for intra-row crop detection in paddy field. Computers and Electronics in Agriculture, 2020, 169, 105203. doi: 10.1016/j.compag.2019.105203
5	王姗姗, 余山山, 张文毅, 等. 基于特征点邻域Hough变换的水稻秧苗行检测. 农业机械学报, 2020, 51(10): 18- 25. doi: 10.6041/j.issn.1000-1298.2020.10.003
	WANG S S, YU S S, ZHANG W Y, et al. Rice seedling row detection based on feature point neighborhood Hough transform. Journal of Agricultural Machinery, 2020, 51(10): 18- 25. doi: 10.6041/j.issn.1000-1298.2020.10.003
6	HE R R, LUO X W, ZHANG Z G, et al. Identification method of rice seedlings rows based on Gaussian heatmap. Agriculture, 2022, 12(10): 1736. doi: 10.3390/agriculture12101736
7	BAH M D, HAFIANE A, CANALS R. CRowNet: deep network for crop row detection in UAV images. IEEE Access, 2020, 8, 5189- 5200. doi: 10.1109/ACCESS.2019.2960873
8	徐广达, 毛国军. 基于多层次特征融合的无人机航拍图像目标检测. 计算机科学与探索, 2023, 17(3): 635- 64.
	XU G D, MAO G J. Target detection in UAV aerial images with multi-level feature fusion. Journal of Frontiers of Computer Science & Technology, 2023, 17(3): 635- 645.
9	LIU S, QI L, QIN H F, et al. Path aggregation network for instance segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2018: 8759-8768.
10	庄集超, 郭保苏, 吴凤和. 基于可变形密集卷积神经网络的布匹瑕疵检测. 计量学报, 2023, 44(2): 178- 185. doi: 10.3969/j.issn.1000-1158.2023.02.04
	ZHUANG J C, GUO B S, WU F H. Fabric defect detection based on deformable dense convolutional neural network. Acta Metrologica Sinica, 2023, 44(2): 178- 185. doi: 10.3969/j.issn.1000-1158.2023.02.04
11	赵楚, 段先华, 苏俊楷. 改进Faster RCNN的瓷砖表面瑕疵检测研究. 计算机工程与应用, 2023, 59(14): 201- 208. doi: 10.3778/j.issn.1002-8331.2203-0414
	ZHAO C, DUAN X H, SU J K. Research on improved Ceramic tile surface defect detection by Faster RCNN. Computer Engineering and Applications, 2023, 59(14): 201- 208. doi: 10.3778/j.issn.1002-8331.2203-0414
12	赵继达, 甄国涌, 储成群. 基于YOLOv8的无人机图像目标检测算法. 计算机工程, 2024, 50(4): 113- 120. doi: 10.19678/j.issn.1000-3428.0068268
	ZHAO J D, ZHEN G Y, CHU Q. The UAV image target detection algorithm based on YOLOv8. Computer Engineering, 2024, 50(4): 113- 120. doi: 10.19678/j.issn.1000-3428.0068268
13	ZHENG Z Z, HU Y H, QIAO Y C, et al. Real-time detection of winter jujubes based on improved YOLOX-nano network. Remote Sensing, 2022, 14(19): 4833. doi: 10.3390/rs14194833
14	袁天乐, 袁巨龙, 朱勇建, 等. 基于改进YOLOv5的推力球轴承表面缺陷检测算法. 浙江大学学报(工学版), 2022, 56(12): 2349- 2357. doi: 10.3785/j.issn.1008-973X.2022.12.004
	YUAN T L, YUAN J L, ZHU Y J, et al. Thrust ball bearing surface defect detection algorithm based on improved YOLOv5. Journal of Zhejiang University (Engineering Edition), 2022, 56(12): 2349- 2357. doi: 10.3785/j.issn.1008-973X.2022.12.004
15	WANG J Q, CHEN K, XU R, et al. CARAFE: content-aware reassembly of features[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Washington D. C., USA: IEEE Press, 2019: 3007-3016.
16	ZHOU Y T, CAO K Y, LI D, et al. Fine-YOLO: a simplified X-ray prohibited object detection network based on feature aggregation and normalized Wasserstein distance. Sensors, 2024, 24(11): 3588. doi: 10.3390/s24113588
17	ZHANG Q H, BAO X, SUN S T, et al. Lightweight network for small target fall detection based on feature fusion and dynamic convolution. Journal of Real-Time Image Processing, 2024, 21(1): 17. doi: 10.1007/s11554-023-01397-2
18	SHEN J, QU Y R, ZHANG W N, et al. Wasserstein distance guided representation learning for domain adaptation. Artificial Intelligence, 2018, 32(1): 328- 336. doi: 10.48550/arXiv.1707.01217
19	HU J, SHEN L, SUN G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2018: 7132-7141.
20	HOU Q B, ZHOU D Q, FENG J S. Coordinate attention for efficient mobile network design[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2021: 13713-13722.
21	ZHU L, WANG X J, KE Z H, et al. BiFormer: vision transformer with bi-level routing attention[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2023: 10323-10333.
22	YU Z P, HUANG H B, CHEN W J, et al. YOLO-FaceV2: a scale and occlusion aware face detector. Pattern Recognition, 2024, 155, 110714. doi: 10.1016/j.patcog.2024.110714
23	JOCHER G. Ultralytics/YOLOv5: v7.0-YOLOv5 SOTA realtime instance segmentation(v7.0)[EB/OL]. [2024-03-21]. https://ui.adsabs.harvard.edu/abs/2022zndo...3908559J/abstract.
24	WANG C Y, BOCHKOVSKIY A, LIAO H M. YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2023: 7464-7475.
25	JOCHER G, CHAURASIA A, QIU J. Ultralytics YOLOv8(8.0.0)[EB/OL]. [2024-03-21]. https://github.com/ultralytics/ultralytics.
26	AN R, ZHANG X C, SUN M P, et al. GC-YOLOv9: Innovative smart city traffic monitoring solution. Alexandria Engineering Journal, 2024, 106, 277- 287. doi: 10.1016/j.aej.2024.07.004
27	REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 39(6): 1137- 1149. doi: 10.1109/TPAMI.2016.2577031
28	CAI Z W, VASCONCELOS N. Cascade R-CNN: delving into high quality object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, USA: IEEE Press, 2018: 6154-6162.
29	LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot MultiBox detector[M]. Berlin, Germany: Springer, 2016.
30	ZHAO Y A, LV W Y, XU S L, et al. DETRs beat YOLOs on real-time object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle, USA: IEEE Press, 2024: 16965-16974.
31	ZHENG Q H, TIAN X Y, YU Z G, et al. DL-PR: generalized automatic modulation classification method based on deep learning with priori regularization. Engineering Applications of Artificial Intelligence, 2023, 122, 106082. doi: 10.1016/j.engappai.2023.106082

[1]	罗恒, 万良. 基于动态时空图神经网络的网络流量入侵检测方法[J]. 计算机工程, 2026, 52(6): 202-213.
[2]	肖泽秋, 李勇, 王霞. 基于PBI-CLA模型的糖尿病患者血糖浓度预测[J]. 计算机工程, 2026, 52(6): 382-390.
[3]	王永旗, 王雷. 基于跨模态增强与时间步门控的多模态情感识别[J]. 计算机工程, 2026, 52(6): 258-267.
[4]	胡康源, 郭涛, 穆楠. 基于自注意力机制和动态掩膜机制的文物图像修复方法[J]. 计算机工程, 2026, 52(6): 179-188.
[5]	周丽君, 张俊然, 王开元, 向军莲. 融合患者临床体征的图增强注意力药物推荐[J]. 计算机工程, 2026, 52(6): 314-325.
[6]	曾安, 郑嘉裕, 潘丹, 赵靖亮, 黄幸青. 基于深度强化学习的主动脉夹层中心线追踪算法[J]. 计算机工程, 2026, 52(6): 414-424.
[7]	于梦源, 刘向阳. 基于多模态可见光和红外图像融合的船舶检测方法[J]. 计算机工程, 2026, 52(6): 278-287.
[8]	瞿靖鸿, 王中卿, 周国栋. 基于预训练模型的问答知识文本生成[J]. 计算机工程, 2026, 52(5): 326-335.
[9]	田辉, 段鑫龙, 郝琪雅, 隋文灏, 马裕莹, 虞祖华, 徐杨, 曹仰杰. 结合多尺度特征融合和改进ViT的细胞计数方法[J]. 计算机工程, 2026, 52(5): 203-215.
[10]	宋天泽, 曹从军, 何佳琪, 王旭升, 刘晨煜. 基于改进DETR的密集行人检测算法研究[J]. 计算机工程, 2026, 52(5): 250-258.
[11]	杨家豪, 王雷. 基于多特征时空推理网络的个体关注目标检测[J]. 计算机工程, 2026, 52(5): 184-191.
[12]	张红, 朱思雨, 张玺君, 魏轿云. 基于自适应图卷积优化元图学习的非平稳交通流预测研究[J]. 计算机工程, 2026, 52(5): 456-466.
[13]	魏文泉, 莫宏伟. 基于改进YOLOv5s的PCB缺陷检测算法[J]. 计算机工程, 2026, 52(5): 226-238.
[14]	吴沛颖, 李晓慧, 王俊峰. 基于上下文感知语言模型的C2流量检测[J]. 计算机工程, 2026, 52(5): 270-280.
[15]	尹恒杰, 郑克清, 柯建楠, 董云泉. 基于本地动量加速的非独立同分布联邦学习方法[J]. 计算机工程, 2026, 52(4): 103-110.

选择文件类型/文献管理软件名称

选择包含的内容