作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2022, Vol. 48 ›› Issue (2): 201-206. doi: 10.19678/j.issn.1000-3428.0060416

• 图形图像处理 • 上一篇    下一篇

联合语义分割与注意力机制的行人再识别模型

周东明1, 张灿龙1, 唐艳平2, 李志欣1   

  1. 1. 广西师范大学 广西多源信息挖掘与安全重点实验室, 广西 桂林 541004;
    2. 桂林电子科技大学 计算机与信息安全学院, 广西 桂林 541006
  • 收稿日期:2020-12-28 修回日期:2021-02-18 发布日期:2021-03-04
  • 作者简介:周东明(1995-),男,硕士研究生,主研方向为图像处理与模式识别、深度学习、机器学习;张灿龙(通信作者),教授、博士、博士生导师;唐艳平,讲师;李志欣,教授、博士、博士生导师。
  • 基金资助:
    国家自然科学基金(61866004,61966004,61962007);广西自然科学基金(2018GXNSFDA281009,2019GXNSFDA245018,2018GXNSFDA29400);广西“八桂学者”创新研究团队项目;广西多源信息挖掘与安全重点实验室基金(20-A-03-01);广西研究生教育创新计划项目(XYCSZ2020071)。

Pedestrian Re-Identification Model Combining Semantic Segmentation and Attention Mechanism

ZHOU Dongming1, ZHANG Canlong1, TANG Yanping2, LI Zhixin1   

  1. 1. Guangxi Key Laboratory of Multi-Source Information Mining and Security, Guangxi Normal University, Guilin, Guangxi 541004, China;
    2. School of Computer and Information Security, Guilin University of Electronic Science and Technology, Guilin, Guangxi 541006, China
  • Received:2020-12-28 Revised:2021-02-18 Published:2021-03-04

摘要: 受行人姿态变化、光照视角、背景变换等因素的影响,现有行人再识别模型通常对数据集中的行人分成若干块提取图像的局部特征进行辨识以提高识别精度,但存在人体局部特征不匹配、容易丢失非人体部件的上下文线索等问题。构建一种改进的行人再识别模型,通过将人体语义解析网络的局部特征进行对齐,增强行人语义分割模型对图像中行人任意轮廓的建模能力,利用局部注意力网络捕捉非人体部分丢失的语境线索。实验结果表明,该模型在Market-1501、DukeMTMC和CUHK03数据集上的平均精度均值分别达到83.5%、80.8%和92.4%,在DukeMTMC数据集上的Rank-1为90.2%,相比基于注意力机制、行人语义解析和局部对齐网络的行人再识别模型具有更强的鲁棒性和迁移性。

关键词: 人体语义解析网络, 局部注意力网络, 行人再识别, 局部对齐网络, 深度学习

Abstract: Pedestrian identification results are easily affected by pedestrian posture changes, illumination perspective, background transformation and other factors.To reduce such interference, the existing pedestrian re-identification models usually divide the pedestrians in a dataset into several pieces to extract the local features of the image and improve the identification accuracy, but this also presents new problems such as the mismatch between local features of the human body and the loss of contextual clues of non-human parts.In order to solve the above problems, an improved pedestrian re-identification model is proposed.By aligning the local features of the human semantic parsing network, the semantic segmentation model can perform better in modeling arbitrary contours of pedestrians in the image.The local attention network is also used to capture the lost contextual clues of non-human body parts.The experimental results show that the proposed model displays an average accuracy of 83.5% on Market-1501, 80.8% on DukeMTMC, and 92.4% on CUHK03.The Rank-1 value on the DukeMTMC dataset is 90.2%.Compared with the pedestrian re-identification models based on attention mechanism, pedestrian semantic parsing network or Partial Alignment Network(PAN), the proposed model has higher robustness and mobility.

Key words: human semantic parsing network, partial attention network, person re-identification, Partial Alignment Network(PAN), deep learning

中图分类号: