[1] RUSSAKOVSKY O,DENG J,SU H,et al.ImageNet large scale visual recognition challenge[J].International Journal of Computer Vision,2015,115(3):211-252. [2] ZHU Rui,ZHANG Shifeng,WANG Xiaobo,et al.ScratchDet:exploring to train single-shot object detectors from scratch[EB/OL].(2018-10-19)[2019-09-01].https://arxiv.org/abs/1810.08425v3. [3] SHEN Zhiqiang,LIU Zhuang,LI Jianguo,et al.Dsod:Learning deeply supervised object detectors from scratch[C]//Proceedings of International Conference on Computer Vision.Venice,Italy:IEEE Press,2017:1937-1945. [4] SANTURKAR S,TSIPRAS D,ILYAS A,et al.How does batch normalization help optimization?[EB/OL].(2018-05-29)[2019-09-01].https://arxiv.org/abs/1805.11604. [5] LIN T Y,GOYAL P,GIRSHICK R,et al.Focal loss for dense object detection[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,42(2):318-327. [6] SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[EB/OL].(2014-09-04)[2019-09-01].https://arxiv.org/abs/1409.1556. [7] HE Kaiming,ZHANG Xiangyu,REN Shaoqing,et al.Deep residual learning for image recognition[C]//Proceedings of IEEE Conference on Computer Vision & Pattern Recognition. Las Vegas,USA:IEEE Press,2016:2-8. [8] FU C Y,LIU W,RANGA A,et al.DSSD:deconvolutional single shot detector[EB/OL].(2017-01-23)[2019-09-01].https://arxiv.org/abs/1701.06659. [9] EVERINGHAM M,GOOL L V,WILLIAMS C,et al.Pascal visual object classes challenge results[J].International Journal of Computer Vision,2010,88:303-307. [10] EVERINGHAM M,VAN GOOL L,WILLIAMS C K I,et al.The Pascal Visual Object Classes (VOC) challenge[J].International Journal of Computer Vision,2010,88(2):303-338. [11] SHEN Z Q,SHI H H,ROGERIO F,et al.Learning object detectors from scratch with gated recurrent feature pyramids[EB/OL].(2017-12-04)[2019-09-01].https://arxiv.org/abs/1712.00886v1. [12] IOFFE S,SZEGEDY C.Batch normalization:accelerating deep network training by reducing internal covariate shift[C]//Proceedings of International Conference on International Conference on Machine Learning.Lille,France:[s.n.],2015:21-29. [13] DAI J F,QI H Z,XIONG Y W,et al.Deformable convolutional networks[C]//Proceedings of 2017 IEEE International Conference on Computer Vision.Venice,Italy:IEEE Press,2017:764-773. [14] SHELHAMER E,LONG J,DARRELL T.Fully convolutional networks for semantic segmentation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(4):640-651. [15] LI Hongyan,LI Chungeng,AN Jubai,et al.Attention mechanism improves CNN remote sensing image object detection[J].Journal of Image and Graphics,2019,24(8):1400-1408. 李红艳,李春庚,安居白,等.注意力机制改进卷积神经网络的遥感图像目标检测[J].中国图象图形学报,2019,24(8):1400-1408. [16] LIU W,ANGUELOV D,ERHAN D,et al.SSD:single shot multibox detector[C]//Proceedings of ECCV'16.Amsterdam,Holland:Springer International Publishing,2016:21-37. [17] REDMON J,FARHADI A.YOLO9000:better,faster,stronger[C]//Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition.Honolulu,USA:IEEE Press,2017. [18] REN S Q,HE K M,GIRSHICK R,et al.Faster R-CNN:towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149. [19] REN Yun,ZHU Changren,XIAO Shunping.Deformable faster R-CNN with aggregating multi-layer features for partially occluded object detection in optical remote sensing images[J].Remote Sensing,2018,10(9):1470-1478. |