改进DeepLabv3+的道路表面裂缝检测方法

doi:10.19678/j.issn.1000-3428.0069114

摘要/Abstract

摘要：

有效的道路表面裂缝检测是维护道路安全、延长道路寿命的关键。针对传统道路表面裂缝检测方法存在的难以识别细小裂缝、分割断裂以及分割精度低等问题, 提出了一种改进DeepLabv3+的道路表面裂缝检测方法, 旨在降低模型参数量的同时提高裂缝检测的准确性。首先, 使用优化后的MobileNetv2网络替换基础DeepLabv3+模型的主干网络, 以降低模型的参数量和复杂度, 提高运行速度; 其次, 将条形池化模块(SPM)融入空洞空间金字塔池化(ASPP)模块, 使得网络能够捕获到更多的裂缝上下文信息, 保留裂缝细小部分的特征; 最后, 引入卷积块注意力模块(CBAM), 使网络更加关注图像中对裂缝检测起决定作用的像素区域, 增强裂缝图像的特征表达能力。实验结果显示, 改进DeepLabv3+模型的平均像素准确率(MPA)为87.85%, 平均交并比(MIoU)为80.53%, 准确率为97.51%, 精确率为88.65%, F1值为88.24%, 相比于基础DeepLabv3+模型分别提高了1.77%、2.03%、0.30%、2.25%和1.51%, 且高于U-Net、HR-Net和PSP-Net模型。此外, 改进DeepLabv3+模型的参数量为6.382×10⁶, 是基础DeepLabv3+模型的88.3%, 实时性更好, 更适用于道路表面裂缝检测任务。

关键词: 裂缝检测, 语义分割, 卷积神经网络, 条形池化模块, 注意力机制

Abstract:

The effective detection of road surface cracks is key to maintaining road safety and prolonging road life. To address the problems of difficulty in identifying small cracks, segmentation fractures, and low segmentation accuracy for traditional road surface crack detection methods, an improved DeepLabv3+ road surface crack detection method is proposed to simultaneously reduce the number of model parameters and improve the accuracy of crack detection. First, the backbone of the DeepLabv3+ model is replaced with an optimized MobileNetv2 network to reduce the number of parameters and complexity of the model, which speeds up the operation. Second, the Strip Pooling Module (SPM) is integrated into the Atrous Spatial Pyramid Pooling (ASPP) module to enable the network to capture more crack context information and preserve the characteristics of small parts of the crack. Finally, a Convolutional Block Attention Module (CBAM) is introduced to make the network focus more on the pixel region that plays a decisive role in crack detection, which enhances the feature expression ability of crack images. According to the experimental results, the improved DeepLabv3+ model achieved a Mean Pixel Accuracy (MPA) of 87.85%, Mean Intersection over Union (MIoU) of 80.53%, accuracy of 97.51%, precision of 88.65%, and F1-Score of 88.24%; compared with the basic DeepLabv3+ model, the improvements are 1.77%, 2.03%, 0.30%, 2.25%, and 1.51%, respectively. These indices of the proposed model are higher than those of the U-Net, HR-Net, and PSP-Net models. In addition, the number of parameters of the improved model is 6.382×10⁶, which is 88.3% of that of the basis model, and the real-time performance is better, making it more suitable for road surface crack detection.

Key words: crack detection, semantic segmentation, convolutional neural network, strip pooling module, attention mechanism

杨萍, 张汐. 改进DeepLabv3+的道路表面裂缝检测方法[J]. 计算机工程, 2025, 51(4): 261-270.

YANG Ping, ZHANG Xi. Improved DeepLabv3+ Road Surface Crack Detection Method[J]. Computer Engineering, 2025, 51(4): 261-270.

https://www.ecice06.com/CN/Y2025/V51/I4/261

图/表 11

图1 DeepLabv3+模型的网络结构

Fig.1 Network structure of DeepLabv3+ model

图2 改进DeepLabv3+模型的网络结构

Fig.2 Network structure of improved DeepLabv3+ model

图3 传统池化和条形池化对比图

Fig.3 Comparison between conventional pooling and strip pooling

图4 SPM结构图

Fig.4 SPM structure diagram

图5 CBAM结构图

Fig.5 CBAM structure diagram

图6 不同语义分割模型的Loss曲线

Fig.6 Loss curves of different semantic segmentation models

图7 不同模型检测裂缝的效果图对比

Fig.7 Comparison of effects of different models for crack detection

图8 不同模型在复杂背景下检测裂缝的效果图对比

Fig.8 Comparison of effects of different models in crack detection under complex background

参考文献 26

1	XIANG X , ZHANG Y , EL SADDIK A . Pavement crack detection network based on pyramid structure and attention mechanism. IET Image Processing, 2020, 14 (8): 1580- 1586. doi: 10.1049/iet-ipr.2019.0973
2	KANG D H , CHA Y J . Efficient attention-based deep encoder and decoder for automatic crack segmentation. Structural Health Monitoring, 2022, 21 (5): 2190- 2205. doi: 10.1177/14759217211053776
3	陈浩瀚, 谢仁平, 魏文红. 基于区域生长和梯度阈值分割的路面裂缝提取算法. 东莞理工学院学报, 2022, 29 (3): 64- 68.
	CHEN H H , XIE R P , WEI W H . Crack extraction algorithm for road image based on region growing algorithm and gradient segmentation threshold. Journal of Dongguan University of Technology, 2022, 29 (3): 64- 68.
4	YANG C, GENG M. The crack detection algorithm of pavement image based on edge information[C]//Proceedings of 6th International Conference on Computer-Aided Design, Manufacturing, Modeling and Simulation. USA, New York: AIP Publishing, 2018: 040023.
5	李鹏, 李强, 马味敏, 等. 基于K-means聚类的路面裂缝分割算法. 计算机工程与设计, 2020, 41 (11): 3143- 3147.
	LI P , LI Q , MA W M , et al. Pavement crack segmentation based on K-means clustering. Computer Engineering and Design, 2020, 41 (11): 3143- 3147.
6	陶健, 田霖, 张德津, 等. 基于局部纹理特征的沥青路面裂缝检测方法. 计算机工程与设计, 2022, 43 (2): 517- 524.
	TAO J , TIAN L , ZHANG D J , et al. Asphalt pavement crack detection method based on local texture features. Computer Engineering and Design, 2022, 43 (2): 517- 524.
7	郝巨鸣, 杨景玉, 韩淑梅, 等. 引入Ghost模块和ECA的YOLOv4公路路面裂缝检测方法. 计算机应用, 2023, 43 (4): 1284- 1290.
	HAO J M , YANG J Y , HAN S M , et al. YOLOv4 highway pavement crack detection method using Ghost module and ECA. Journal of Computer Applications, 2023, 43 (4): 1284- 1290.
8	付强, 卜凡民, 任洪鹏, 等. 基于深度学习方法的路面裂缝目标检测. 公路, 2023, 68 (9): 395- 405.
	FU Q , PU F M , REN H P , et al. Pavement crack target detection based on deep learning method. Highway, 2023, 68 (9): 395- 405.
9	孙朝云, 马志丹, 李伟, 等. 基于深度卷积神经网络融合模型的路面裂缝识别方法. 长安大学学报(自然科学版), 2020, 40 (4): 1- 13.
	SUN Z Y , MA Z D , LI W , et al. Pavement crack identification method based on deep convolutional neural network fusion model. Journal of Chang'an University (Natural Science Edition), 2020, 40 (4): 1- 13.
10	翁飘, 陆彦辉, 齐宪标, 等. 基于改进的全卷积神经网络的路面裂缝分割技术. 计算机工程与应用, 2019, 55 (16): 235-239, 245. doi: 10.3778/j.issn.1002-8331.1901-0068
	WONG P , LU Y H , QI X B , et al. Pavement crack segmentation technology based on improved fully convolutional networks. Computer Engineering and Applications, 2019, 55 (16): 235-239, 245. doi: 10.3778/j.issn.1002-8331.1901-0068
11	杨秋媛, 李宁, 石林, 等. 基于空洞卷积与动态多核融合池化的裂缝检测. 计算机工程与设计, 2022, 43 (12): 3529- 3537.
	YANG Q Y , LI N , SHI L , et al. Crack detection based on dilated convolution and dynamic multi-kernel fusion pooling module. Computer Engineering and Design, 2022, 43 (12): 3529- 3537.
12	于海洋, 景鹏, 张文涛, 等. 基于残差与注意力机制的道路裂缝检测U-Net改进模型. 计算机工程, 2023, 49 (6): 265- 273. doi: 10.19678/j.issn.1000-3428.0064952
	YU H Y , JING P , ZHANG W T , et al. Improved U-Net model for road crack detection based on residual and attention mechanism. Computer Engineering, 2023, 49 (6): 265- 273. doi: 10.19678/j.issn.1000-3428.0064952
13	张伯树, 张志华, 张洋. 改进的HRNet应用于路面裂缝分割与检测. 测绘通报, 2022 (3): 83- 89.
	ZHANG B S , ZHANG Z H , ZHANG Y . Improved HRNet applied to segmentation and detection of pavement crack. Bulletin of Surveying and Mapping, 2022 (3): 83- 89.
14	李良福, 王楠, 武彪, 等. 基于改进PSPNet的桥梁裂缝图像分割算法. 激光与光电子学进展, 2021, 58 (22): 2210001.
	LI L F , WANG N , WU B , et al. Segmentation algorithm of bridge crack image based on modified PSPNet. Laser and Optoelectronics Progress, 2021, 58 (22): 2210001.
15	FU H , MENG D , LI W , et al. Bridge crack semantic segmentation based on improved Deeplabv3+. Journal of Marine Science and Engineering, 2021, 9 (6): 671. doi: 10.3390/jmse9060671
16	陈宇平, 范高. 基于改进DeepLabV3+在复杂环境下的道路裂缝检测. 广州大学学报(自然科学版), 2023, 22 (2): 43- 51. doi: 10.3969/j.issn.1671-4229.2023.02.006
	CHEN Y P , FAN G . Road crack detection based on improved DeepLabV3+ in complex environments. Journal of Guangzhou University (Natural Science Edition), 2023, 22 (2): 43- 51. doi: 10.3969/j.issn.1671-4229.2023.02.006
17	黄荣霞, 刘德儿. 最大连通域协同的改进Deeplabv3+路面裂缝检测. 计算机仿真, 2023, 40 (5): 182- 186. doi: 10.3969/j.issn.1006-9348.2023.05.032
	HUANG R X , LIU D E . Improved Deeplabv3+ pavement crack detection based on maximum connection region collaboration. Computer Simulation, 2023, 40 (5): 182- 186. doi: 10.3969/j.issn.1006-9348.2023.05.032
18	LIU R, HE D. Semantic segmentation based on Deeplabv3+ and attention mechanism[C]//Proceedings of 2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference. Washington D. C., USA: IEEE Press, 2021: 255-259.
19	SANDLER M, HOWARD A, ZHU M, et al. MobileNetV2: inverted residuals and linear bottle-necks[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2018: 4510-4520.
20	HOWARD A G, ZHU M, CHEN B, et al. MobileNets: efficient convolutional neural networks for mobile vision applications[EB/OL]. (2017-04-17)[2023-11-27]. https://arxiv.org/abs/1704.04861.
21	李强龙, 周新文, 位梦恩, 等. 基于条形池化和注意力机制的街道场景红外目标检测算法. 计算机工程, 2023, 49 (8): 310- 320. doi: 10.19678/j.issn.1000-3428.0065481
	LI Q L , ZHOU X W , WEI M E , et al. Infrared target detection algorithm based on strip pooling and attention mechanism in street scenes. Computer Engineering, 2023, 49 (8): 310- 320. doi: 10.19678/j.issn.1000-3428.0065481
22	WOO S, PARK J, LEE J Y, et al. CBAM: Convolutional Block Attention Module[C]//Proceedings of the European Conference on Computer Vision. Berlin, Germany: Springer, 2018: 3-19.
23	SHI Y , CUI L , QI Z , et al. Automatic road crack detection using random structured forests. IEEE Transactions on Intelligent Transportation Systems, 2016, 17 (12): 3434- 3445. doi: 10.1109/TITS.2016.2552248
24	YANG F , ZHANG L , YU S , et al. Feature pyramid and hierarchical boosting network for pavement crack detection. IEEE Transactions on Intelligent Transportation Systems, 2020, 21 (4): 1525- 1535. doi: 10.1109/TITS.2019.2910595
25	ZOU Q , CAO Y , LI Q , et al. CrackTree: automatic crack detection from pavement images. Pattern Recognition Letters, 2012, 33 (3): 227- 238. doi: 10.1016/j.patrec.2011.11.004
26	LIU Y , YAO J , LU X , et al. DeepCrack: a deep hierarchical feature learning architecture for crack segmentation. Neurocomputing, 2019, 338, 139- 153. doi: 10.1016/j.neucom.2019.01.036

[1]	杜晨阳, 张雪英, 黄丽霞, 李娟. 基于改进高效通道注意力机制的多特征语音情感识别[J]. 计算机工程, 2025, 51(4): 97-106.
[2]	孙子文, 钱立志, 袁广林, 杨传栋, 凌冲. 基于实时动态模板更新的Transformer目标跟踪方法[J]. 计算机工程, 2025, 51(4): 158-168.
[3]	董红亮, 钮焱, 孙杨, 李军. 基于记忆胶囊与注意力的语音情感识别[J]. 计算机工程, 2025, 51(4): 169-177.
[4]	解庆, 张凌峰, 马艳春, 刘永坚. 基于反射分类与梯度恢复的单幅图像去反射模型[J]. 计算机工程, 2025, 51(4): 227-238.
[5]	徐永刚, 孙琦烜, 李凡甲, 程健维, 戴佳俊. 基于扩展时间和时空特征融合图卷积网络的骨架行为识别[J]. 计算机工程, 2025, 51(4): 281-292.
[6]	耿霞, 汪尧. 基于CLIP增强细粒度特征的换装行人重识别方法[J]. 计算机工程, 2025, 51(4): 293-302.
[7]	刘云翔, 梁智超. 一种高效的连续时序图注意力网络的交通预测模型[J]. 计算机工程, 2025, 51(4): 350-359.
[8]	张肇鑫, 黄世泽, 张兵杰, 沈拓. 面向交通场景的运动模糊伪装对抗样本生成方法[J]. 计算机工程, 2025, 51(3): 45-53.
[9]	胡书林, 张华军, 邓小涛, 王征华. 结合依存图卷积的中文文本相似度计算研究[J]. 计算机工程, 2025, 51(3): 76-85.
[10]	卢鹏, 仲闯. 改进CycleGAN的半监督建筑物提取算法[J]. 计算机工程, 2025, 51(3): 241-251.
[11]	王新良, 王璐莹. 特征增强的低照度爆破现场安全帽检测算法[J]. 计算机工程, 2025, 51(3): 252-260.
[12]	孙亭, 杨洁, 李家璇, 王耀宗. 面向弱光交通场景的YOLOv7道路标志检测算法优化[J]. 计算机工程, 2025, 51(3): 342-351.
[13]	栾方军, 龚琪, 袁帅. 基于注意力机制和多尺度融合的人群计数网络[J]. 计算机工程, 2025, 51(3): 352-361.
[14]	张欢, 王晨, 单景东, 仇润鹤. 基于领域自适应与注意力机制的电梯安全风险预测[J]. 计算机工程, 2025, 51(2): 86-93.
[15]	张树华, 王继业, 赵传奇, 陈宏铭, 郭咏雯. 面向输电线路边缘智能的硬件加速设计[J]. 计算机工程, 2025, 51(2): 213-222.

选择文件类型/文献管理软件名称

选择包含的内容