[1] KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImageNet classification with deep convolutional neural networks[J].Communications of the ACM,2017,60(6):84-90. [2] REDMON J,DIVVALA S,GIRSHICK R,et al.You only look once:unified,real-time object detection[C]//Proceedings of Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2016:779-788. [3] LONG J,SHELHAMER E,DARRELL T.Fully convolutional networks for semantic segmentation[C]//Proceedings of Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2015:3431-3440. [4] YUAN Y S,CHENG H,SESTER M.Keypoints-based deep feature fusion for cooperative vehicle detection of autonomous driving[J].IEEE Robotics and Automation Letters,2022,7(2):3054-3061. [5] NAUMOV M,MUDIGERE D,SHI H J M,et al.Deep learning recommendation model for personalization and recommendation systems[EB/OL].[2022-03-05].https://arxiv.org/abs/1906.00091v1. [6] HATAMIZADEH A,TANG Y C,NATH V,et al.UNETR:Transformers for 3D medical image segmentation[C]//Proceedings of Winter Conference on Applications of Computer Vision.Washington D.C.,USA:IEEE Press,2022:1748-1758. [7] DENG J,DONG W,SOCHER R,et al.ImageNet:a large-scale hierarchical image database[C]//Proceedings of Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2009:248-255. [8] ZHOU B L,LAPEDRIZA A,KHOSLA A,et al.Places:a 10 million image database for scene recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2018,40(6):1452-1464. [9] SUN C,SHRIVASTAVA A,SINGH S,et al.Revisiting unreasonable effectiveness of data in deep learning era[C]//Proceedings of International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2017:843-852. [10] PENG Y X,HE X T,ZHAO J J.Object-part attention model for fine-grained image classification[J].IEEE Transactions on Image Processing,2018,27(3):1487-1500. [11] 桂江生,麻陈飞,包晓安,等.递归深度混合关注网络的细粒度图像分类方法[J].计算机工程,2019,45(5):205-209. GUI J S,MA C F,BAO X A,et al.Fine-grained image classification method for recurrent deep hybrid attention network[J].Computer Engineering,2019,45(5):205-209.(in Chinese) [12] 谭润,叶武剑,刘怡俊.结合双语义数据增强与目标定位的细粒度图像分类[J].计算机工程,2022,48(2):237-242,249. TAN R,YE W J,LIU Y J.Fine-grained image classification combining dual semantic data augmentation and target location[J].Computer Engineering,2022,48(2):237-242,249.(in Chinese) [13] LIN T Y,ROYCHOWDHURY A,MAJI S.Bilinear CNN models for fine-grained visual recognition[C]//Proceedings of International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2016:1449-1457. [14] YU C J,ZHAO X Y,ZHENG Q,et al.Hierarchical bilinear pooling for fine-grained visual recognition[C]//Proceedings of European Conference on Computer Vision.Berlin,Germany:Springer,2018:574-589. [15] WANG J Z,LI N Y,LUO Z M,et al.High-order-interaction for weakly supervised fine-grained visual categorization[J].Neurocomputing,2021,464:27-36. [16] KONG S,FOWLKES C.Low-rank bilinear pooling for fine-grained classification[C]//Proceedings of Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2017:365-374. [17] LIN T Y,MAJI S.Improved bilinear pooling with CNNs[EB/OL].[2022-03-05].https://arxiv.org/pdf/1707.06772.pdf. [18] MIN S B,XIE H T,TIAN Y L,et al.Adaptive bilinear pooling for fine-grained representation learning[C]//Proceedings of Multimedia Asia.New York,USA:ACM Press,2019:1-6. [19] LI P H,XIE J T,WANG Q L,et al.Towards faster training of global covariance pooling networks by iterative matrix square root normalization[C]//Proceedings of Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2018:947-955. [20] MIN S B,YAO H T,XIE H T,et al.Multi-objective matrix normalization for fine-grained visual recognition[J].IEEE Transactions on Image Processing,2020,29:4996-5009. [21] HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2018:7132-7141. [22] GAO Z L,XIE J T,WANG Q L,et al.Global second-order pooling convolutional networks[C]//Proceedings of Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2020:3019-3028. [23] SUN M,YUAN Y C,ZHOU F,et al.Multi-attention multi-class constraint for fine-grained image recognition[C]//Proceedings of the European Conference on Computer Vision.Berlin,Germany:Springer,2018:834-850. [24] ZHENG H L,FU J L,ZHA Z J,et al.Looking for the devil in the details:learning trilinear attention sampling network for fine-grained image recognition[C]//Proceedings of Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2020:5007-5016. [25] YANG Z,LUO T G,WANG D,et al.Learning to navigate for fine-grained classification[C]//Proceedings of European Conference on Computer Vision.Berlin,Germany:Springer,2018:438-454. [26] TAN M,WANG G J,ZHOU J,et al.Fine-grained classification via hierarchical bilinear pooling with aggregated slack mask[J].IEEE Access,2019,7:117944-117953. [27] WAH C,BRANSON S,WELINDER P,et al.The Caltech-UCSD Birds-200-2011 dataset[EB/OL].[2022-03-05].http://authors.library.caltech.edu/27452/1/CUB_200_2011.pdf. [28] KRAUSE J,STARK M,DENG J,et al.3D object representations for fine-grained categorization[C]//Proceedings of International Conference on Computer Vision Workshops.Washington D.C.,USA:IEEE Press,2013:554-561. [29] MAJI S,RAHTU E,KANNALA J,et al.Fine-grained visual classification of aircraft[EB/OL].[2022-03-05].https://arxiv.org/pdf/1306.5151.pdf. [30] HE K M,ZHANG X Y,REN S Q,et al.Deep residual learning for image recognition[C]//Proceedings of Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2016:770-778. [31] SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[EB/OL].[2022-03-05].https://arxiv.org/pdf/1409.1556.pdf. |