[1] Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779-788.
[2] Carion N, Massa F, Synnaeve G, et al. End-to-end object detection with transformers[C]//European conference on computer vision. Cham: Springer International Publishing, 2020: 213-229.
[3] Zhao Y, Lv W, Xu S, et al. Detrs beat yolos on real-time object detection[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2024: 16965-16974.
[4] Dalal N, Triggs B. Histograms of oriented gradients for human detection[C]//2005 IEEE computer society conference on computer vision and pattern recognition (CVPR'05). Ieee, 2005, 1: 886-893.
[5] Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 580-587.
[6] Ren S, He K, Girshick R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE transactions on pattern analysis and machine intelligence, 2016, 39(6): 1137-1149.
[7] He K, Gkioxari G, Dollár P, et al. Mask r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2017: 2961-2969.
[8] Varghese R, Sambath M. Yolov8: A novel object detection algorithm with enhanced performance and robustness[C]//2024 International conference on advances in data engineering and intelligent computing systems (ADICS). IEEE, 2024: 1-6.
[9] Tian Y, Ye Q, Doermann D. Yolov12: Attention-centric real-time object detectors[EB/OL]. arXiv preprint arXiv:2502.12524, 2025.
[10] Yao Z, Ai J, Li B, et al. Efficient detr: improving end-to-end object detector with dense prior[J]. arXiv preprint arXiv:2104.01318, 2021.
[11] Zhu X, Su W, Lu L, et al. Deformable detr: Deformable transformers for end-to-end object detection[J]. arXiv preprint arXiv:2010.04159, 2020.
[12] Hou X, Liu M, Zhang S, et al. Relation detr: Exploring explicit position relation prior for object detection[C]//European Conference on Computer Vision. Cham: Springer Nature Switzerland, 2024: 89-105.
[13] Huang S, Lu Z, Cun X, et al. Deim: Detr with improved matching for fast convergence[C]//Proceedings of the Computer Vision and Pattern Recognition Conference. 2025: 15162-15171.
[14] 赵小虎,谢礼逊,慕灯聪,等.基于TCM-YOLO网络的金属表面缺陷检测方法[J].计算机工程,2025,51(06):338-348.
X H Zhao, L X Xie, D C Mu et al. Metal surface defect detection method based on TCM-YOLO network [J]. Computer Engineering, 2025, 51(06): 338-348.
[15] 张旭,陈慈发,董方敏,等.基于改进YOLOv7的PCB缺陷检测算法[J].计算机工程, 2024, 50(12): 318-328.
Zhang X, Cifa C, Fang M D et al. PCB defect detection algorithm based on improved YOLOv7[J]. Computer chenEngineering, 2024, 50(12): 318-328.
[16] 杨毅,桑庆兵.多尺度特征自适应融合的轻量化织物瑕疵检测[J].计算机工程,2022,48(12):288-295.
YANG Y, SANG Q B. Lightweight-fabric defect detection based on adaptive fusion of multiscale features[J]. Computer engineering, 2022, 48(12): 288-295.
[17] Vijayakumar A, Vairavasundaram S, Koilraj J A S, et al. Real-time visual intelligence for defect detection in pharmaceutical packaging[J]. Scientific Reports, 2024, 14(1): 18811.
[18] Meng W, Luo Y, Li X, et al. PolaFormer: Polarity-aware linear attention for vision transformers[J]. arXiv preprint arXiv:2501.15061, 2025.
[19] Fein-Ashley J, Gupta N, Kannan R, et al. SPECTRE: An FFT-Based Efficient Drop-In Replacement to Self-Attention for Long Contexts[J]. arXiv preprint arXiv:2502.18394, 2025.
[20] Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need[EB/OL]. Advances in neural information processing systems, 2017, 30.
[21] Yang J, Liu S, Wu J, et al. Pinwheel-shaped convolution and scale-based dynamic loss for infrared small target detection[C]//Proceedings of the AAAI Conference on Artificial Intelligence. 2025, 39(9): 9202-9210.
[22] Xu S, Zheng S, Xu W, et al. Hcf-net: Hierarchical context fusion network for infrared small object detection[C]//2024 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 2024: 1-6.
[23] Du D, Zhu P, Wen L, et al. VisDrone-DET2019: The vision meets drone object detection in image challenge results[C]//Proceedings of the IEEE/CVF international conference on computer vision workshops. 2019: 0-0.
[24] Wang A, Chen H, Liu L, et al. Yolov10: Real-time end-to-end object detection[J]. Advances in Neural Information Processing Systems, 2024, 37: 107984-108011.
[25] Khanam R, Hussain M. Yolov11: An overview of the key architectural enhancements[J]. arXiv preprint arXiv:2410.17725, 2024.
[26] Tian Y, Ye Q, Doermann D. Yolov12: Attention-centric real-time object detectors[J]. arXiv preprint arXiv:2502.12524, 2025.
[27] Zhang H, Liu K, Gan Z, et al. UAV-DETR: efficient end-to-end object detection for unmanned aerial vehicle imagery[J]. arXiv preprint arXiv:2501.01855, 2025.
|