Adaptive Spatial Transformation Method for Vehicle Detection Based on Roadside Cameras

doi:10.19678/j.issn.1000-3428.0068539

Abstract

Abstract:

To address the challenges in vehicle detection accuracy and efficiency using roadside cameras, this study presents an innovative vehicle detection framework that synergizes Convolutional Neural Network (CNN) and the Transformer architecture. Given the intricacies of traffic scenarios, we devise an adaptive spatial Transformer and combine it with ResNet50 to form a robust backbone network capable of managing diverse vehicle orientations and scales. We further refine the Transformer's input using position encodings grounded on angles and distances to ensure optimal spatial information utilization. A channel-space attention mechanism is incorporated to enhance the global contextual understanding of the images. In the decoding phase, the autoregressive approach is eschewed, facilitating parallel decoding of multiple targets, and the target query embeddings are integrated for vehicle detection tasks. Empirical evaluations on the UA-DETRAC, IITM-hetra and a proprietary dataset yield mAP@0.5 scores of 96.42%, 87.82% and 98.64%, respectively, surpassing benchmarked models across various scales. Ablation experiments underscore the pivotal role of each component in achieving superior performance.

Key words: adaptive spatial transformation, Transformer, vehicle detection, channel-space attention mechanism, roadside camera

摘要：

为了提高基于路侧相机的车辆检测的准确性和效率, 研究了融合卷积神经网络(CNN)与Transformer模型的车辆检测模型。针对复杂的交通场景, 设计了自适应空间Transformer, 将其与ResNet50结合构建了可以应对车辆视角和尺度变换的主干网络。设计了基于角度和距离的位置编码, 优化Transformer模型输入, 使模型充分利用图像中的空间信息, 并采用了通道空间注意力模块, 以更好地捕获图像中的上下文信息。在解码器部分, 去除了自回归机制, 允许模型并行解码多个目标, 并引入了目标查询集嵌入, 使其更适应车辆检测任务。实验结果表明, 所提模型在UA-DETRAC、IITM-hetra和自采数据集上的mAP@0.5分别达到96.42%、87.82%和98.64%, 在所有尺寸上均超越了其他对比模型。消融实验进一步验证了各模块对性能的关键贡献。

关键词: 自适应空间变换, Transformer, 车辆检测, 通道空间注意力机制, 路侧相机

HUA Jiabao, ZHANG Jingrui, ZHU Fumin, CHEN Lu. Adaptive Spatial Transformation Method for Vehicle Detection Based on Roadside Cameras[J]. Computer Engineering, 2025, 51(6): 349-359.

华家宝, 张京瑞, 朱福民, 陈璐. 基于路侧相机的自适应空间变换车辆检测方法[J]. 计算机工程, 2025, 51(6): 349-359.

/ Recommend / Download Citations

URL: https://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0068539

https://www.ecice06.com/EN/Y2025/V51/I6/349

Figures/Tables 10

Fig.1 General framework of the model

Fig.2 Structure of the improved Transformer model

Fig.3 Schematic of distance coding and angle coding

Fig.4 Channel-space attention module

Fig.5 Example of UA-DETRAC dataset

Fig.6 Comparative experiments for different layer numbers of Encoder and Decoder

Fig.7 Visualization of vehicle detection results

References 28

1	郭宇阳, 胡伟超, 戴帅, 等. 面向路侧交通监控场景的轻量车辆检测模型. 计算机工程与应用, 2022, 58 (6): 192- 199.
	GUO Y Y , HU W C , DAI S , et al. Lightweight vehicle detection model for roadside traffic monitoring scenarios. Computer Engineering and Applications, 2022, 58 (6): 192- 199.
2	李松江, 耿兰兰, 王鹏. 基于改进Yolov4的车辆目标检测. 计算机工程, 2023, 49 (4): 272- 280. doi: 10.19678/j.issn.1000-3428.0062943
	LI S J , GENG L L , WANG P . Vehicle target detection based on improved Yolov4. Computer Engineering, 2023, 49 (4): 272- 280. doi: 10.19678/j.issn.1000-3428.0062943
3	毛其超, 贾瑞生, 左羚群, 等. 基于深度学习的交通监控视频车辆检测算法. 计算机应用与软件, 2020, 37 (9): 111-117, 164.
	MAO Q C , JIA R S , ZUO L Q , et al. A traffic surveillance video vehicle detection method based on deep learning. Computer Applications and Software, 2020, 37 (9): 111-117, 164.
4	SUN Z , BEBIS G , MILLER R . On-road vehicle detection: a review. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2006, 28 (5): 694- 711. doi: 10.1109/TPAMI.2006.104
5	WANG Z , ZHAN J , DUAN C , et al. A review of vehicle detection techniques for intelligent vehicles. IEEE Transactions on Neural Networks and Learning Systems, 2023, 34 (8): 3811- 3831. doi: 10.1109/TNNLS.2021.3128968
6	MAJOR B, FONTIJNE D, ANSARI A, et al. Vehicle detection with automotive radar using deep learning on range-azimuth-Doppler tensors[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops. Washington D.C., USA: IEEE, 2019.
7	ANTONY J J , SUCHETHA M . Vision based vehicle detection: a literature review. International Journal of Applied Engineering Research, 2016, 11 (5): 3128- 3133.
8	SUN Z , BEBIS G , MILLER R . Monocular precrash vehicle detection: features and classifiers. IEEE Transactions on Image Processing, 2006, 15 (7): 2019- 2034. doi: 10.1109/TIP.2006.877062
9	WEN X Z , SHAO L , FANG W , et al. Efficient feature selection and classification for vehicle detection. IEEE Transactions on Circuits and Systems for Video Technology, 2015, 25 (3): 508- 517. doi: 10.1109/TCSVT.2014.2358031
10	HASSABALLAH M , KENK M A , MUHAMMAD K , et al. Vehicle detection and tracking in adverse weather using a deep learning framework. IEEE Transactions on Intelligent Transportation Systems, 2020, 22 (7): 4230- 4242.
11	李琳辉, 伦智梅, 连静, 等. 基于卷积神经网络的道路车辆检测方法. 吉林大学学报(工学版), 2017, 47 (2): 384- 391.
	LI L H , LUN Z M , LIAN J , et al. A convolutional neural network based approach for road vehicle detection. Journal of Jilin University(Engineering and Technology Edition), 2017, 47 (2): 384- 391.
12	WANG H , YU Y , CAI Y , et al. A comparative study of state-of-the-art deep learning algorithms for vehicle detection. IEEE Intelligent Transportation Systems Magazine, 2019, 11 (2): 82- 95. doi: 10.1109/MITS.2019.2903518
13	SONG H , LIANG H , LI H , et al. Vision-based vehicle detection and counting system using deep learning in highway scenes. European Transport Research Review, 2019, 11 (1): 1- 16. doi: 10.1186/s12544-018-0328-2
14	李凯, 林宇舜, 吴晓琳, 等. 基于多尺度融合与注意力机制的小目标车辆检测. 浙江大学学报(工学版), 2022, 56 (11): 2241- 2250. doi: 10.3785/j.issn.1008-973X.2022.11.015
	LI K , LIN Y S , WU X L , et al. Small-target vehicle detection based on multi-scale fusion and attention mechanism. Journal of Zhejiang University(Engineering Science), 2022, 56 (11): 2241- 2250. doi: 10.3785/j.issn.1008-973X.2022.11.015
15	GOMAA A , MINEMATSU T , ABDELWAHAB M M , et al. Faster CNN-based vehicle detection and counting strategy for fixed camera scenes. Multimedia Tools and Applications, 2022, 81 (18): 25443- 25471. doi: 10.1007/s11042-022-12370-9
16	FAN Q, BROWN L, SMITH J. A closer look at Faster R-CNN for vehicle detection[C]// Proceedings of 2016 IEEE Intelligent Vehicles Symposium (Ⅳ). Washington D.C., USA: IEEE, 2016.
17	HAUSLER S, GARG S, CHAKRAVARTY P, et al. DisPlacing objects: improving dynamic vehicle detection via visual place recognition under adverse conditions[EB/OL]. [2023-09-10]. http://arxiv.org/abs/2306.17536.
18	DESHMUKH P , SATYANARAYANA G S R , MAJHI S , et al. Swin Transformer based vehicle detection in undisciplined traffic environment. Expert Systems with Applications, 2023, 213, 118992. doi: 10.1016/j.eswa.2022.118992
19	HENDRYCKS D, GIMPEL K. Gaussian error Linear Units (GeLUs)[EB/OL]. [2023-09-10]. http://arxiv.org/abs/1606.08415.
20	KINGMA D P, BA J. Adam: a method for stochastic optimization[EB/OL]. [2023-09-10]. http://arxiv.org/abs/1412.6980.
21	WEN L , DU D , CAI Z , et al. UA-DETRAC: a new benchmark and protocol for multi-object detection and tracking. Computer Vision and Image Understanding, 2020, 193, 102907. doi: 10.1016/j.cviu.2020.102907
22	DEEPAK M, AVINASH R, GITAKRISHNAN R, et al. Training a deep learning architecture for vehicle detection using limited heterogeneous traffic data[C]//Proceedings of 201810th International Conference on Communication Systems & Networks (COMSNETS). Washington D.C., USA: IEEE, 2018.
23	GIRSHICK R. Fast R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision. Washington D.C., USA: IEEE, 2015: 1440-1448.
24	NORKOBIL S S , ABDUSALOMOV A , JAMIL M K , et al. A YOLOv6-based improved fire detection approach for smart city environments. Sensors, 2023, 23 (6): 3161. doi: 10.3390/s23063161
25	DUAN K, BAI S, XIE L, et al. CenterNet: keypoint triplets for object detection[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Washington D.C., USA: IEEE, 2019: 6569-6578.
26	CARION N, MASSA F, SYNNAEVE G, et al. End-to-end object detection with transformers[C]//Proceedings of European Conference on Computer Vision. Berlin, Germany: Springer International Publishing, 2020: 213-229.
27	ZHU X, SU W, LU L, et al. Deformable DETR: deformable Transformers for end-to-end object detection[EB/OL]. [2023-09-10]. http://arxiv.org/abs/2010.04159.
28	YAO J W, LI C M, SUN K Q, et al. NDC-scene: boost monocular 3D semantic scene completion in normalized device coordinates space[C]//Proceedings of 2023 IEEE/CVF International Conference on Computer Vision (ICCV). Washington D.C., USA: IEEE, 2023.

[1]	ZHANG Rui, ZHANG Xueying, CHEN Guijun, HUANG Lixia. Emotion Recognition in EEG Based on Granger Causality and Brain Regions Frequency Bands Transformer Model [J]. Computer Engineering, 2025, 51(6): 311-319.
[2]	DENG Zexian, ZHANG Yungui, ZHANG Lin. Research on Multi-Dimensional Time Series Classification Based on the Pre-Trained Recursive Transformer-Mixer [J]. Computer Engineering, 2025, 51(5): 154-165.
[3]	CHEN Ziyan, WANG Xiaolong, HE Di, AN Guocheng. Lightweight Vehicle Detection Network Based on Improved YOLOv8 [J]. Computer Engineering, 2025, 51(5): 314-325.
[4]	SUN Ziwen, QIAN Lizhi, YUAN Guanglin, YANG Chuandong, LING Chong. Transformer Object Tracking Method Based on Real-Time Dynamic Template Update [J]. Computer Engineering, 2025, 51(4): 158-168.
[5]	ZHANG Anqin, DING Zhifeng. Network Anomaly Detection Integrating Dynamic Graph Embedding and Transformer Autoencoder [J]. Computer Engineering, 2025, 51(4): 47-56.
[6]	GAO Rui, AN Guocheng, ZOU Danping, PEI Ling. Semi-Supervised Vehicle Detection Algorithm Based on Improved YOLOv5 [J]. Computer Engineering, 2025, 51(3): 300-309.
[7]	AN Guocheng, WANG Xiaolong, JIANG Bo, XING Jian. Prohibited Parking Detection Algorithm for Highway Service Area in Complex Environment [J]. Computer Engineering, 2025, 51(2): 356-364.
[8]	WANG Yang, SONG Shijia, WANG Heqin, YUAN Zhenyu, ZHAO Lijun, WU Qilin. Estimation of Local Illumination Consistency Based on Improved Vision Transformer [J]. Computer Engineering, 2025, 51(2): 312-321.
[9]	YANG Hongju, JI Chang. Research on Learning-Driven Image Compression Algorithm [J]. Computer Engineering, 2025, 51(1): 190-197.
[10]	LIU Zhong, TANG Hong, WANG Ningzhe, ZHU Chuanrun. Text Summarization Method Incorporating RNN and Sparse Self-Attention [J]. Computer Engineering, 2025, 51(1): 312-320.
[11]	ZHOU Yu, XIE Wei, Kwong Tak Wu, JIANG Jianmin. Reconstruction of Video Snapshot Compressive Imaging Based on Triple Self-Attention [J]. Computer Engineering, 2025, 51(1): 20-30.
[12]	XIAO Chaoen, LI Zifan, ZHANG Lei, WANG Jianxin, QIAN Siyuan. Differential Cryptanalysis Based on Transformer Model and Attention Mechanism [J]. Computer Engineering, 2025, 51(1): 156-163.
[13]	LI Junyi, LI Xiangyang, LONG Chaoxun, LI Haiyan, LI Hongsong, YU Pengfei. Wild Mushroom Classification Based on Multi-level Region Selection and Cross-layer Feature Fusion [J]. Computer Engineering, 2024, 50(9): 179-188.
[14]	QU Xiaoya, LI Bing, WEN Liqiang. Research on Event Extraction for Administrative Law Enforcement Case Texts [J]. Computer Engineering, 2024, 50(9): 63-71.
[15]	WANG Yanguo, LÜ Pengyuan, LAN Jinjiang, LIU Mingzhe, QIN Guanjun, ZHANG Shuohua, ZHOU Yu. Wind Turbine Fault Classification Method Based on Adversarial Training and Transformer [J]. Computer Engineering, 2024, 50(9): 377-384.

Please choose a citation manager

Content to export