融合多尺度对比池化特征的行人重识别方法

doi:10.19678/j.issn.1000-3428.0061508

计算机工程 ›› 2022, Vol. 48 ›› Issue (4): 292-298. doi: 10.19678/j.issn.1000-3428.0061508

融合多尺度对比池化特征的行人重识别方法

刘晓蓉¹, 李小霞^1,2, 秦昌辉¹

1. 西南科技大学信息工程学院, 四川绵阳 621000;
2. 特殊环境机器人技术四川省重点实验室, 四川绵阳 621010

收稿日期:2021-06-10 修回日期:2021-08-03 发布日期:2022-04-14
作者简介:刘晓蓉(1997—),女,硕士研究生,主研方向为深度学习、模式识别;李小霞(通信作者),教授、博士;秦昌辉,硕士研究生。
基金资助:
国家自然科学基金（61771411）；四川省科技计划项目（2019YJ0449，2021YFG0383）。

Person Re-Identification Method with Multi-Scale Contrast Pooling Feature

LIU Xiaorong¹, LI Xiaoxia^1,2, QIN Changhui¹

1. College of Information Engineering, Southwest University of Science and Technology, Mianyang, Sichuan 621000, China;
2. Sichuan Province Key Laboratory of Robotics in Special Environment, Mianyang, Sichuan 621010, China

Received:2021-06-10 Revised:2021-08-03 Published:2022-04-14

摘要/Abstract

摘要： 行人重识别是利用计算机视觉技术判断图像或者视频序列中是否存在特定行人的技术。受行人姿态、遮挡、光照变化等因素的影响，传统的行人重识别方法中特征的表达能力有限，导致准确率降低，提出一种融合不同尺度对比池化特征的行人重识别方法。利用残差网络ResNet50提取行人图像的多尺度特征，在网络的不同层次上，通过对输入的特征进行全局平均池化和最大平均池化，将每组平均池化特征和最大池化特征相减，对相减得到的差异特征与最大池化特征进行相加，获得具有强判别性的对比池化特征。在此基础上，利用三元组损失和交叉熵损失联合优化模型，提高模型的泛化能力，同时采用重排序技术优化网络性能。实验结果表明，该方法在Market1501和DukeMTMC-reID数据集上的首位命中率分别达到96.41%和91.43%，平均精度均值为94.52%和89.30%，相比SVDNet、GLAD和PCB等方法，其行人重识别的准确率较高。

关键词: 行人重识别, 多尺度特征, 对比池化特征, 特征融合, 深度学习

Abstract: Person re-identification is a technology that uses computer vision to identify whether there are specific people in images or video sequences.Owing to the influence of the person's posture, occlusion, illumination change, and other factors, the expression ability of features in traditional person re-identification methods is limited, resulting in reduced accuracy.A person re-identification method that combines the contrast pooling feature at different scales is proposed.The residual network ResNet50 is used to extract the multi-scale features of the images of the people.At different levels of the network, through the global average pooling and maximum average pooling of the input features, each group of average pooling features and maximum pooling features are subtracted, and the subtracted difference features and maximum pooling features are added to obtain highly discriminative constrast pooling fusion features.On this basis, the triplet loss and cross entropy loss joint optimization model is used to improve the generalization ability of the model;the reordering technology is used to optimize the network performance.The experimental results show that the first ranking of this method on the Market1501 and DukeMTMC-reID datasets are 96.41% and 91.43%, respectively, and the average accuracies are 94.52% and 89.30%, respectively.Compared with SVDNet, GLAD, and PCB, this method has a higher person re-identification accuracy.

Key words: person re-identification, multi-scale feature, contrast pooling feature, feature fusion, Deep Learning(DL)

中图分类号:

TP391.41

刘晓蓉, 李小霞, 秦昌辉. 融合多尺度对比池化特征的行人重识别方法[J]. 计算机工程, 2022, 48(4): 292-298.

LIU Xiaorong, LI Xiaoxia, QIN Changhui. Person Re-Identification Method with Multi-Scale Contrast Pooling Feature[J]. Computer Engineering, 2022, 48(4): 292-298.

http://www.ecice06.com/CN/Y2022/V48/I4/292

图/表 10

20230131202705

20230131202709

20230131202712

20230131202715

20230131202717

20230131202720

20230131202725

20230131202728

20230131202731

20230131202734

参考文献

[1] 宋婉茹, 赵晴晴, 陈昌红, 等.行人重识别研究综述[J].智能系统学报, 2017, 12(6):770-780. SONG W R, ZHAO Q Q, CHEN C H, et al.Survey on pedestrian re-identification research[J].CAAI Transactions on Intelligent Systems, 2017, 12(6):770-780.(in Chinese)
[2] OJALA T, PIETIKÄINEN M, MÄENPÄÄ T.Multiresolution gray-scale and rotation invariant texture classification with local binary patterns[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(7):971-987.
[3] LOWE D G.Distinctive image features from scale-invariant keypoints[J].International Journal of Computer Vision, 2004, 60(2):91-110.
[4] BAZZANI L, CRISYANI M, PERINA A, et al.Multiple-shot person re-identification by chromatic and epitomic analyses[J].Pattern Recognition Letters, 2012, 33(7):898-903.
[5] LIAO S C, HU Y, ZHU X Y, et al.Person re-identification by local maximal occurrence representation and metric learning[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2015:2197-2206.
[6] KOESTINGER M, HIRZER M, WOHLHART P, et al.Large scale metric learning from equivalence constraints[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2012:2288-2295.
[7] WEINBERGER K Q, SAUL L K.Distance metric learning for large margin nearest neighbor classification[J].Journal of machine learning research, 2009, 10(2):207-244.
[8] WEI L H, ZHANG S L, YAO H T, et al.GLAD:global local-alignment descriptor for scalable person re-identification[J].IEEE Transactions on Multimedia, 2018, 21(4):986-999.
[9] SUN Y F, ZHENG L, YANG Y, et al.Beyond part models:person retrieval with refined part pooling (and a strong convolutional baseline)[C]//Proceedings of the European Conference on Computer Vision.Berlin, Germany:Springer, 2018:480-496.
[10] LECUN Y, BOTTOU L, BENGIO Y, et al.Gradient-based learning applied to document recognition[J].Proceedings of the IEEE, 1998, 86(11):2278-2324.
[11] KRIZHEVSKY A, SUTSKEVER I, HINTON G E.ImageNet classification with deep convolutional neural networks[C]//Proceedings of Advances in Neural Information Processing Systems.Los Angeles, USA:NIPS Foundation Press, 2012:1097-1105.
[12] SIMONYAN K, ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[EB/OL].[2021-05-07].https://arxiv.org/pdf/1409.1556.pdf.
[13] LIN M, CHEN Q, YAN S C.Network in network[EB/OL].[2021-05-07].https://arxiv.org/abs/1312.4400.
[14] SZEGEDY C, LIU W, JIA Y, et al.Going deeper with convolutions[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2015:1-9.
[15] HE K M, ZHANG X Y, REN S Q, et al.Deep residual learning for image recognition[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2016:770-778.
[16] ZHENG L, SHEN L Y, TIAN L, et al.Scalable person re-identification:a benchmark[C]//Proceedings of IEEE International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2015:1116-1124.
[17] ZHENG Z D, ZHENG L, YANG Y.Unlabeled sample generated by GAN improve the person re-identification baseline in vitro[C]//Proceedings of IEEE International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2017:3774-3782.
[18] ZHONG Z, ZHENG L, CAO D L, et al.Re-ranking person re-identification with k-reciprocal encoding[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2017:1318-1327.
[19] SUN Y F, ZHENG L, DENG W J, et al.SVDNet for pedestrian retrieval[C]//Proceedings of IEEE International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2017:3800-3808.
[20] DAI Z Z, CHEN M Q, GU X D, et al.Batch feature erasing for person re-identification and beyond[EB/OL].[2021-05-07].https://arxiv.org/pdf/1811.07130.pdf.
[21] LIN Y T, ZHENG L, ZHENG Z D, et al.Improving person re-identification by attribute and identity learning[J].Pattern Recognition, 2019, 95:151-161.
[22] ZHENG Z D, YANG X D, YU Z D, et al.Joint discriminative and generative learning for person re-identification[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2019:2138-2147.
[23] WANG Guan'an, GONG S G, CHENG J, et al.Faster person re-identification[C]//Proceedings of the European Conference on Computer Vision.Berlin, Germany:Springer, 2020:275-292.

选择文件类型/文献管理软件名称

选择包含的内容

融合多尺度对比池化特征的行人重识别方法

Person Re-Identification Method with Multi-Scale Contrast Pooling Feature

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 10

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	江雨燕, 陶承凤, 李平. 数据增强和自适应自步学习的深度子空间聚类算法[J]. 计算机工程, 2023, 49(8): 96-103, 110.
[2]	宋志娜, 李莎, 杨建明, 徐川. 基于特征与区域定位增强的遥感舰船目标检测[J]. 计算机工程, 2023, 49(8): 257-264.
[3]	韩华, 黄丽, 田瑾, 王春媛. 基于双中间模态的四流网络跨模态行人重识别[J]. 计算机工程, 2023, 49(8): 302-309.
[4]	张欣怡, 张飞, 郝斌, 高鹭, 任晓颖. 基于改进YOLOv5的口罩佩戴检测算法[J]. 计算机工程, 2023, 49(8): 265-274.
[5]	李泽水, 冀俊忠, 杨翠翠. 基于边权重信息深度网络嵌入的PPIN功能模块检测[J]. 计算机工程, 2023, 49(8): 69-76.
[6]	杨长沛, 廖列法. 基于门控空洞卷积特征融合的中文命名实体识别[J]. 计算机工程, 2023, 49(8): 85-95.
[7]	王可铮, 徐玉芬, 周尚波. 结合对比感知损失和融合注意力的图像去雾模型[J]. 计算机工程, 2023, 49(8): 207-214.
[8]	杨祖赫, 黎智辉, 唐云祁, 晏于文, 宋华青. 结合语义与图像信息的行人属性识别算法[J]. 计算机工程, 2023, 49(8): 215-222, 231.
[9]	刘俊豪, 王美林, 谢兴, 宋烨兴, 许莉花. 基于改进YOLOv5的皮革瑕疵检测算法[J]. 计算机工程, 2023, 49(8): 240-249.
[10]	陈露萌, 曹彦彦, 黄民, 谢鑫钢. 基于改进YOLOv5的火焰检测方法[J]. 计算机工程, 2023, 49(8): 291-301, 309.
[11]	闫兴亚, 匡娅茜, 白光睿, 李月. 基于深度学习的学生课堂行为识别方法[J]. 计算机工程, 2023, 49(7): 251-258.
[12]	刘豪, 吴红兰, 房宇轩. 结合全局上下文信息的高效人体姿态估计[J]. 计算机工程, 2023, 49(7): 102-109.
[13]	李军侠, 王星驰, 殷梓, 石德硕. 边缘深度挖掘的弱监督显著性目标检测[J]. 计算机工程, 2023, 49(7): 169-178.
[14]	吴珊, 周凤. 基于改进SSD算法的小目标检测[J]. 计算机工程, 2023, 49(7): 179-188.
[15]	席建锐, 唐红梅, 梁春阳, 刘鑫. 基于改进隐函数的点云物体重建[J]. 计算机工程, 2023, 49(7): 214-222.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

融合多尺度对比池化特征的行人重识别方法

Person Re-Identification Method with Multi-Scale Contrast Pooling Feature

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 10

参考文献

相关文章 15

编辑推荐

Metrics

本文评价