[1] LIN T Y, GOYAL P, GIRSHICK R, et al.Focal loss for dense object detection[C]//Proceedings of IEEE International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2017:2999-3007. [2] REDMON J, FARHADI A.YOLO9000:better, faster, stronger[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2017:6517-6525. [3] ZHANG H K, CHANG H, MA B P, et al.Cascade RetinaNet:maintaining consistency for single-stage object detection[EB/OL].[2021-06-10].https://arxiv.org/abs/1907.06881. [4] LI Y X, REN F B.Light-weight RetinaNet for object detection[EB/OL].[2021-06-10].https://arxiv.org/abs/1905.10011. [5] SUN P Z, ZHANG R F, JIANG Y, et al.Sparse R-CNN:end-to-end object detection with learnable proposals[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2021:14449-14458. [6] 吴华运, 任德均, 吕义钊, 等.基于改进的RetinaNet医药空瓶表面气泡检测[J].四川大学学报(自然科学版), 2020, 57(6):1090-1095. WU H Y, REN D J, LÜY Z, et al.Bubble detection on the surface of medical empty bottles based on improved RetinaNet[J].Journal of Sichuan University (Natural Science Edition), 2020, 57(6):1090-1095.(in Chinese) [7] 闫建伟, 张乐伟, 赵源, 等.改进RetinaNet的刺梨果实图像识别[J].中国农机化学报, 2021, 42(3):78-83. YAN J W, ZHANG L W, ZHAO Y, et al.Image recognition of Rosa roxburghii fruit by improved RetinaNet[J].Journal of Chinese Agricultural Mechanization, 2021, 42(3):78-83.(in Chinese) [8] LIN T Y, DOLLÁR P, GIRSHICK R, et al.Feature pyramid networks for object detection[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2017:936-944. [9] HE K M, ZHANG X Y, REN S Q, et al.Deep residual learning for image recognition[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2016:770-778. [10] QIN Z Q, ZHANG P Y, WU F, et al.FcaNet:frequency channel attention networks[EB/OL].[2021-06-10].https://arxiv.org/abs/2012.11879. [11] RUSSAKOVSKY O, DENG J, SU H, et al.ImageNet large scale visual recognition challenge[J].International Journal of Computer Vision, 2015, 115(3):211-252. [12] LIU S, QI L, QIN H F, et al.Path aggregation network for instance segmentation[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2018:8759-8768. [13] PANG J M, CHEN K, SHI J P, et al.Libra R-CNN:towards balanced learning for object detection[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2019:821-830. [14] ZHENG Z H, WANG P, LIU W, et al.Distance-IoU loss:faster and better learning for bounding box regression[EB/OL].[2021-06-10].https://arxiv.org/abs/1911.08287. [15] LIN T Y, MAIRE M, BELONGIE S, et al.Microsoft COCO:common objects in context[C]//Proceedings of Conference on Computer Vision.Berlin, Germany:Springer, 2014:740-755. [16] EVERINGHAM M, GOOL L, WILLIAMS C K I, et al.The pascal visual object classes challenge[J].International Journal of Computer Vision, 2010, 88(2):303-338. [17] AHMED N, NATARAJAN T, RAO K R.Discrete cosine transform[J].IEEE Transactions on Computers, 1974, 23(1):90-93. [18] WANG X L, GIRSHICK R, GUPTA A, et al.Non-local neural networks[EB/OL].[2021-06-10].https://arxiv.org/abs/1711.07971. [19] LONG J, SHELHAMER E, DARRELL T.Fully convolutional networks for semantic segmentation[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2015:3431-3440. [20] PASZKE A, GROSS S, CHINTALA S, et al.Automatic differentiation in Pytorch[EB/OL].[2021-06-10].https://openreview.net/forum?id=BJJsrmfCZ. [21] TIAN Z, SHEN C H, CHEN H, et al.FCOS:fully convolutional one-stage object detection[C]//Proceedings of IEEE/CVF International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2019:9626-9635. [22] GUO C X, FAN B, ZHANG Q, et al.AugFPN:improving multi-scale feature learning for object detection[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2020:12592-12601. [23] CAO Y H, CHEN K, LOY C C, et al.Prime sample attention in object detection[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2020:11580-11588. [24] REN S Q, HE K M, GIRSHICK R, et al.Faster R-CNN:towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6):1137-1149. [25] HE K, GKIOXARI G, DOLLAR P, et al.Mask R-CNN[C]//Proceedings of IEEE International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2017:2980-2988. [26] WANG T C, ANWER R M, CHOLAKKAL H, et al.Learning rich features at high-speed for single-shot object detection[C]//Proceedings of IEEE/CVF International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2019:1971-1980. [27] WANG S R, GONG Y C, XING J L, et al.RDSNet:a new deep architecture for reciprocal object detection and instance segmentation[EB/OL].[2021-06-10].https://arxiv.org/abs/1912.05070. [28] WANG J Q, ZHANG W W, CAO Y H, et al.Side-aware boundary localization for more precise object detection[C]//Proceedings of Conference on Computer Vision.Berlin, Germany:Springer, 2020:403-419. |