Author Login Chief Editor Login Reviewer Login Editor Login Remote Office

Computer Engineering ›› 2026, Vol. 52 ›› Issue (4): 187-199. doi: 10.19678/j.issn.1000-3428.0070182

• Computer Vision and Image Processing • Previous Articles     Next Articles

Research on Pavement Anomaly Detection Technology Based on Improved RT-DETR

LIU Ze1, SONG Tinglun2,*(), SHI Xianrang1, SU Yang1, ZHAO Qun1   

  1. 1. College of Energy and Power Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, Jiangsu, China
    2. Chery Automobile Co., Ltd., Wuhu 241007, Anhui, China
  • Received:2024-07-26 Revised:2024-09-02 Online:2026-04-15 Published:2024-11-19
  • Contact: SONG Tinglun

基于改进RT-DETR的路面异常检测技术研究

刘泽1, 宋廷伦2,*(), 石先让1, 苏洋1, 赵群1   

  1. 1. 南京航空航天大学能源与动力学院, 江苏 南京 210016
    2. 奇瑞汽车股份有限公司, 安徽 芜湖 241007
  • 通讯作者: 宋廷伦
  • 作者简介:

    刘泽, 男, 硕士研究生, 主研方向为计算机视觉、目标检测

    宋廷伦(通信作者), 教授、博士

    石先让, 博士研究生

    苏洋, 博士研究生

    赵群, 硕士研究生

Abstract:

Pavement anomaly detection holds significant practical importance for ensuring driving safety, optimizing traffic management, and enhancing driving experience. To address the challenges posed by variations in the size, shape, and color of pavement anomalies, and complex environmental interferences that lower detection accuracy and efficiency, this study proposes an improved Real-Time Detection Transformer (RT-DETR)-based technology for pavement anomaly target detection. First, a Large Receptive Field Element-wise Multiplication Block (LRFEM_Block) is designed to replace the BasicBlock module in the original backbone network, effectively enhancing feature expression capabilities based on the element-wise multiplication principle. Next, a Generalized Efficient Layer Aggregation Network (GELAN) is introduced and combined with multi-scale LRFEM_Block modules to design a Multiplicative-based Layer Aggregation Intra-scale Feature Interaction (MLA-IFI) structure, which improves the computational efficiency and performance of the neck network for deep features and optimizes the gradient propagation path. Additionally, the Selective Boundary Aggregation (SBA) concept is employed to construct a Bidirectional Adaptive Boundary Fusion Feature Pyramid Network (BABF-FPN) multi-scale feature fusion module, adaptively aggregating features of different resolutions bidirectionally and promoting the refinement of small object boundaries. Experimental results show that the improved method achieves a 3.4 and 4.7 percentage point increase in mAP@0.5 on a self-built dataset and the RDD2022 public dataset, respectively, outperforming other models. Moreover, it reduces the number of parameters and computational load by 24.5% and 11.2%, respectively, with a detection speed of 74 frame/s, thereby satisfying the deployment requirements for in-vehicle pavement anomaly detection.

Key words: pavement anomaly detection, Real-Time Detection Transformer (RT-DETR) algorithm, element-wise multiplication, Generalized Efficient Layer Aggregation Network (GELAN) structure, recalibration

摘要:

路面异常检测对于保障行车安全、优化交通管理和驾乘体验具有重要的现实意义。针对路面异常物体在尺寸、形状及颜色等方面跨度较大, 以及复杂环境干扰导致检测精度和效率较低的问题, 提出一种改进实时检测Transformer(RT-DETR)的路面异常目标检测技术。首先, 设计大感受野元素乘法模块(LRFEM_Block)替代原主干网络中的BasicBlock模块, 该模块依据元素乘法原理有效增强特征表达能力。其次, 引入广义高效层聚合网络(GELAN)思想, 并结合多尺度LRFEM_Block设计一种基于元素乘法的层聚合尺度内特征交互(MLA-IFI)结构, 提高颈部网络对深层特征的计算效率和性能, 优化梯度传播路径。此外, 引入自适应选择边界聚合(SBA)思想构建双向自适应边界融合特征金字塔网络(BABF-FPN)多尺度特征融合模块, 自适应双向聚合不同分辨率特征, 促进小目标物体模糊边界的细粒度化。实验结果表明, 改进方法在自建数据集和RDD2022公开数据集上的mAP@0.5相较于基线算法分别提升3.4和4.7百分点, 均优于其他测试模型, 并且参数量和计算量分别减少24.5%和11.2%, 算法检测速度达到74帧/s, 更加契合车载路面异常检测的部署需求。

关键词: 路面异常检测, 实时检测Transformer算法, 元素乘法, 广义高效层聚合网络结构, 重校准