联合语义分割与注意力机制的行人再识别模型

doi:10.19678/j.issn.1000-3428.0060416

摘要/Abstract

摘要： 受行人姿态变化、光照视角、背景变换等因素的影响，现有行人再识别模型通常对数据集中的行人分成若干块提取图像的局部特征进行辨识以提高识别精度，但存在人体局部特征不匹配、容易丢失非人体部件的上下文线索等问题。构建一种改进的行人再识别模型，通过将人体语义解析网络的局部特征进行对齐，增强行人语义分割模型对图像中行人任意轮廓的建模能力，利用局部注意力网络捕捉非人体部分丢失的语境线索。实验结果表明，该模型在Market-1501、DukeMTMC和CUHK03数据集上的平均精度均值分别达到83.5%、80.8%和92.4%，在DukeMTMC数据集上的Rank-1为90.2%，相比基于注意力机制、行人语义解析和局部对齐网络的行人再识别模型具有更强的鲁棒性和迁移性。

关键词: 人体语义解析网络, 局部注意力网络, 行人再识别, 局部对齐网络, 深度学习

Abstract: Pedestrian identification results are easily affected by pedestrian posture changes, illumination perspective, background transformation and other factors.To reduce such interference, the existing pedestrian re-identification models usually divide the pedestrians in a dataset into several pieces to extract the local features of the image and improve the identification accuracy, but this also presents new problems such as the mismatch between local features of the human body and the loss of contextual clues of non-human parts.In order to solve the above problems, an improved pedestrian re-identification model is proposed.By aligning the local features of the human semantic parsing network, the semantic segmentation model can perform better in modeling arbitrary contours of pedestrians in the image.The local attention network is also used to capture the lost contextual clues of non-human body parts.The experimental results show that the proposed model displays an average accuracy of 83.5% on Market-1501, 80.8% on DukeMTMC, and 92.4% on CUHK03.The Rank-1 value on the DukeMTMC dataset is 90.2%.Compared with the pedestrian re-identification models based on attention mechanism, pedestrian semantic parsing network or Partial Alignment Network(PAN), the proposed model has higher robustness and mobility.

Key words: human semantic parsing network, partial attention network, person re-identification, Partial Alignment Network(PAN), deep learning

中图分类号:

TP391

周东明, 张灿龙, 唐艳平, 李志欣. 联合语义分割与注意力机制的行人再识别模型[J]. 计算机工程, 2022, 48(2): 201-206.

ZHOU Dongming, ZHANG Canlong, TANG Yanping, LI Zhixin. Pedestrian Re-Identification Model Combining Semantic Segmentation and Attention Mechanism[J]. Computer Engineering, 2022, 48(2): 201-206.

https://www.ecice06.com/CN/Y2022/V48/I2/201

图/表 4

参考文献

[1] RISTANI E, TOMASI C.Features for multi-target multi-camera tracking and re-identification[C]//Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2018:6036-6046.
[2] KALAYEH M M, BASARAN E, GÖKMEN M, et al.Human semantic parsing for person re-identification[C]//Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2018:1062-1071.
[3] 郑伟诗, 吴岸聪.非对称行人重识别:跨摄像机持续行人追踪[J].中国科学:信息科学, 2018, 48(5):545-563. ZHENG W S, WU A C.Asymmetric person re-identification:cross-view person tracking in a large camera network[J].Scientia Sinica:Information Sciences, 2018, 48(5):545-563.(in Chinese)
[4] 杨婉香, 严严, 陈思, 等.基于多尺度生成对抗网络的遮挡行人重识别方法[J].软件学报, 2020, 31(7):1943-1958. YANG W X, YAN Y, CHEN S, et al.Multi-scale generative adversarial network for person re-identification under occlusion[J].Journal of Software, 2020, 31(7):1943-1958.(in Chinese)
[5] FU J, LIU J, TIAN H, et al.Dual attention network for scene segmentation[C]//Proceedings of 2019 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2019:3146-3154.
[6] SUN Y, ZHENG L, YANG Y, et al.Beyond part models:person retrieval with refined part pooling[C]//Proceedings of 2018 European Conference on Computer Vision.Berlin, Germany:Springer, 2018:480-496.
[7] YAO H T, ZHANG S L, HONG R C, et al.Deep representation learning with part loss for person re-identification[J].IEEE Transactions on Image Processing, 2019, 28(6):2860-2871.
[8] SARFRAZ M S, SCHUMANN A, EBERLE A, et al.A pose-sensitive embedding for person re-identification with expanded cross neighborhood re-ranking[C]//Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2018:420-429.
[9] 徐龙壮, 彭力, 朱凤增.多任务金字塔重叠匹配的行人重识别方法[J].计算机工程, 2021, 47(1):239-245, 254. XU L Z, PENG L, ZHU F Z.Pedestrian re-identification method based on multi-task pyramid overlapping matching[J].Computer Engineering, 2021, 47(1):239-245, 254.(in Chinese)
[10] ZHENG M, KARANAM S, WU Z Y, et al.Re-identification with consistent attentive Siamese networks[C]//Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2019:5728-5737.
[11] RUAN T, LIU T, HUANG Z L, et al.Devil in the details:towards accurate single and multiple human parsing[C]//Proceedings of 2019 AAAI Conference on Artificial Intelligence.Palo Alto, USA:AAAI Press, 2019:4814-4821.
[12] WANG X, GIRSHICK R, GUPTA A, et al.Non-local neural networks[C]//Proceedings of 2018 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2018:7794-7803.
[13] 周林勇, 谢晓尧, 刘志杰, 等.卷积神经网络池化方法研究[J].计算机工程, 2019, 45(4):211-216. ZHOU L Y, XIE X Y, LIU Z J, et al.Research on pooling method of convolution neural network[J].Computer Engineering, 2019, 45(4):211-216.(in Chinese)
[14] 夏胡云, 叶学义, 罗宵晗, 等.多尺度空间金字塔池化PCANet的行人检测[J].计算机工程, 2019, 45(2):270-277. XIA H Y, YE X Y, LUO X H, et al.Pedestrian detection using multi-scale principal component analysis network of spatial pyramid pooling[J].Computer Engineering, 2019, 45(2):270-277.(in Chinese)
[15] LI W, ZHU X T, GONG S G.Harmonious attention network for person re-identification[C]//Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2018:2285-2294.
[16] HAN K, GUO J Y, ZHANG C, et al.Attribute-aware attention model for fine-grained representation learning[C]//Proceedings of the 26th ACM International Conference on Multimedia.New York, USA:ACM Press, 2018:2040-2048.
[17] 祁磊, 于沛泽, 高阳.弱监督场景下的行人重识别研究综述[J].软件学报, 2020, 31(9):2883-2902. QI L, YU P Z, GAO Y.Research on weak-supervised person re-identification[J].Journal of Software, 2020, 31(9):2883-2902.(in Chinese)
[18] 张玉康, 谭磊, 陈靓影.基于图像和特征联合约束的跨模态行人重识别[J].自动化学报, 2021, 47(8):1943-1950. ZHANG Y K, TAN L, CHEN J Y.Cross-modality person re-identification based on joint constraints of image and feature[J].Acta Automatica Sinica, 2021, 47(8):1943-1950.(in Chinese)
[19] 戴臣超, 王洪元, 倪彤光, 等.基于深度卷积生成对抗网络和拓展近邻重排序的行人重识别[J].计算机研究与发展, 2019, 56(8):1632-1641. DAI C C, WANG H Y, NI T G, et al.Person re-identification based on deep convolutional generative adversarial network and expanded neighbor reranking[J].Journal of Computer Research and Development, 2019, 56(8):1632-1641.(in Chinese)
[20] LIU H, FENG J S, QI M B, et al.End-to-end comparative attention networks for person re-identification[J].IEEE Transactions on Image Processing, 2017, 26(7):3492-3506.
[21] SU Y H, FAN K, BACH N, et al.Unsupervised multi-modal neural machine translation[C]//Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2019:10474-10483.
[22] ZHANG Z Z, LAN C L, ZENG W J, et al.Relation-aware global attention for person re-identification[C]//Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2020:3183-3192.
[23] CHEN B H, DENG W H, HU J N.Mixed high-order attention network for person re-identification[C]//Proceedings of 2019 IEEE/CVF International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2019:371-381.
[24] BAI S, BAI X, TIAN Q.Scalable person re-identification on supervised smoothed manifold[C]//Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2017:3356-3365.
[25] CHEN X S, FU C M, ZHAO Y, et al.Salience-guided cascaded suppression network for person re-identification[C]//Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2020:3297-3307.
[26] SHEN Y T, LI H S, XIAO T, et al.Deep group-shuffling random walk for person re-identification[C]//Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2018:2265-2274.
[27] ZHANG Z Z, LAN C L, ZENG W J, et al.Densely semantically aligned person re-identification[C]//Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2019:667-676.
[28] HE K M, ZHANG X Y, REN S Q, et al.Deep residual learning for image recognition[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2016:770-778.
[29] 蒋芸, 谭宁, 张海, 等.基于条件生成对抗网络的咬翼片图像分割[J].计算机工程, 2019, 45(4):223-227. JIANG Y, TAN N, ZHANG H, et al.Bitewing radiography image segmentation based on conditional generative adversarial network[J].Computer Engineering, 2019, 45(4):223-227.(in Chinese)
[30] WANG Q Z, CHAN A B.Describing like humans:on diversity in image captioning[C]//Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2019:4190-4198.

选择文件类型/文献管理软件名称

选择包含的内容