[1] 王文欣,贺煜航,陈刚.基于EM路由算法的医学图像分割UCaps网络[J].计算机工程,2022,48(2):268-274.WANG W X,HE Y H,CHEN G.UCaps network based on EM-routing algorithm for medical image segmentation[J].Computer Engineering,2022,48(2):268-274.(in Chinese) [2] 穆世义,徐树公.基于单字符注意力的全品类鲁棒车牌识别[J].自动化学报,2023,49(1):122-134.MU S Y,XU S G.Full-category robust license plate recognition based on character attention[J].Acta Automatica Sinica,2023,49(1):122-134.(in Chinese) [3] 周东明,张灿龙,唐艳平,等.联合语义分割与注意力机制的行人再识别模型[J].计算机工程,2022,48(2):201-206.ZHOU D M,ZHANG C L,TANG Y P,et al.Pedestrian re-identification model combining semantic segmentation and attention mechanism[J].Computer Engineering,2022,48(2):201-206.(in Chinese) [4] 蒋弘毅,王永娟,康锦煜.目标检测模型及其优化方法综述[J].自动化学报,2021,47(6):1232-1255.JIANG H Y,WANG Y J,KANG J Y.A survey of object detection models and its optimization methods[J].Acta Automatica Sinica,2021,47(6):1232-1255.(in Chinese) [5] JIANG Y,TAN Z,WANG J,et al.GiraffeDet:a heavy-neck paradigm for object detection[EB/OL].[2022-05-10].https://arxiv.org/abs/2202.04256. [6] ZAIDI S S A,ANSARI M S,ASLAM A,et al.A survey of modern deep learning based object detection models[J].Digital Signal Processing,2022,126:103514. [7] 李耀仟,李才子,刘瑞强,等.面向手术器械语义分割的半监督时空Transformer网络[J].软件学报,2022,33(4):1501-1515.LI Y Q,LI C Z,LIU R Q,et al.Semi-supervised spatiotemporal Transformer networks for semantic segmentation of surgical instrument[J].Journal of Software,2022,33(4):1501-1515.(in Chinese) [8] STRUDEL R,GARCIA R,LAPTEV I,et al.Segmenter:Transformer for semantic segmentation[C]//Proceedings of IEEE/CVF International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2022:7242-7252. [9] XIE E,WANG W,YU Z,et al.SegFormer:simple and efficient design for semantic segmentation with Transformers[EB/OL].[2022-05-10].https://arxiv.org/abs/2105.15203. [10] BOLYA D,ZHOU C,XIAO F,et al.YOLACT:real-time instance segmentation[EB/OL].[2022-05-10].https://arxiv.org/abs/1904.02689. [11] WANG X L,KONG T,SHEN C H,et al.SOLO:segmenting objects by locations[C]//Proceedings of ECCV'20.Berlin,Germany:Springer,2020:649-665. [12] PENG S D,JIANG W,PI H J,et al.Deep snake for real-time instance segmentation[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2020:8530-8539. [13] DUAN K W,BAI S,XIE L X,et al.CenterNet:keypoint triplets for object detection[C]//Proceedings of IEEE/CVF International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2020:6568-6577. [14] CHEN H,SUN K Y,TIAN Z,et al.BlendMask:top-down meets bottom-up for instance segmentation[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2020:8570-8578. [15] CHEN X L,GIRSHICK R,HE K M,et al.TensorMask:a foundation for dense object segmentation[C]//Proceedings of IEEE/CVF International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2020:2061-2069. [16] TIAN Z,SHEN C H,WANG X L,et al.BoxInst:high-performance instance segmentation with box annotations[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2021:5439-5448. [17] HE K M,GKIOXARI G,DOLLÁR P,et al.Mask R-CNN[C]//Proceedings of IEEE International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2017:2980-2988. [18] REN S Q,HE K M,GIRSHICK R,et al.Faster R-CNN:towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149. [19] WANG K X,LIEW J H,ZOU Y T,et al.PANet:few-shot image semantic segmentation with prototype alignment[C]//Proceedings of IEEE/CVF International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2020:9196-9205. [20] HUANG Z J,HUANG L C,GONG Y C,et al.Mask scoring R-CNN[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2020:6402-6411. [21] SOFIIUK K,BARINOVA O,KONUSHIN A.AdaptIS:adaptive instance selection network[C]//Proceedings of IEEE/CVF International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2020:7354-7362. [22] CHENG T H,WANG X G,HUANG L C,et al.Boundary-preserving Mask R-CNN[C]//Proceedings of ECCV'20.Berlin,Germany:Springer,2020:660-676. [23] JIANG B Y,ZHANG J Y,HONG Y,et al.BCNet:learning body and cloth shape from a single image[C]//Proceedings of ECCV'20.Berlin,Germany:Springer,2020:18-35. [24] KE L,DANELLJAN M,LI X,et al.Mask Transfiner for high-quality instance segmentation[EB/OL].[2022-05-10].https://arxiv.org/abs/2111.13673. [25] NEVEN D,DE BRABANDERE B,PROESMANS M,et al.Instance segmentation by jointly optimizing spatial embeddings and clustering bandwidth[EB/OL].[2022-05-10].https://arxiv.org/abs/1906.11109. [26] DING H,QIAO S Y,YUILLE A,et al.Deeply shape-guided cascade for instance segmentation[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2021:8274-8284. [27] YUAN X D,KORTYLEWSKI A,SUN Y H,et al.Robust instance segmentation through reasoning about multi-object occlusion[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2021:11136-11145. [28] TAN B,XUE N,BAI S,et al.PlaneTR:structure-guided Transformers for 3D plane recovery[C]//Proceedings of IEEE/CVF International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2022:4166-4175. [29] GAO N Y,SHAN Y H,WANG Y P,et al.SSAP:single-shot instance segmentation with affinity pyramid[C]//Proceedings of IEEE/CVF International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2020:642-651. [30] HUANG Z L,WANG X G,HUANG L C,et al.CCNet:criss-cross attention for semantic segmentation[C]//Proceedings of IEEE/CVF International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2020:603-612. [31] ZHANG T Y,ZHANG X R,ZHU P,et al.Semantic attention and scale complementary network for instance segmentation in remote sensing images[J].IEEE Transactions on Cybernetics,2022,52(10):10999-11013. [32] ZHANG H W,ZHANG D,GAO Z F,et al.Joint segmentation and quantification of main coronary vessels using dual-branch multi-scale attention network[C]//Proceedings of MICCAI'21.Berlin,Germany:Springer,2021:369-378. [33] HU M,LI Y L,FANG L,et al.A2-FPN:attention aggregation based feature pyramid network for instance segmentation[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2021:15338-15347. [34] LIN T Y,DOLLÁR P,GIRSHICK R,et al.Feature pyramid networks for object detection[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2017:936-944. [35] WANG K Y,ZHANG L.Reconcile prediction consistency for balanced object detection[C]//Proceedings of IEEE/CVF International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2022:3611-3620. [36] LIN T Y,MAIRE M,BELONGIE S,et al.Microsoft COCO:common objects in context[C]//Proceedings of ECCV'14.Berlin,Germany:Springer,2014:740-755. |