Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2022, Vol. 48 ›› Issue (12): 281-287,295. doi: 10.19678/j.issn.1000-3428.0063136

• Development Research and Engineering Application • Previous Articles     Next Articles

Channel and Spatial Dual-Attention Network for Person Re-Identification

ZENG Tao, XUE Feng, YANG Tian   

  1. School of Computer and Information, Hefei University of Technology, Hefei 230601, China
  • Received:2021-11-04 Revised:2021-12-14 Published:2021-12-20

面向行人重识别的通道与空间双重注意力网络

曾涛, 薛峰, 杨添   

  1. 合肥工业大学 计算机与信息学院, 合肥 230601
  • 作者简介:曾涛(1997—),男,硕士研究生,主研方向为行人重识别;薛峰(通信作者),教授、博士;杨添,硕士研究生。
  • 基金资助:
    国家自然科学基金(61772170)。

Abstract: To address the challenge in obtaining the discriminative features of pedestrians in actual scenes due to changes in camera angle, pedestrian postures, object occlusions, low image resolutions, and misaligned pedestrian images, a Hybrid Pooling Channel Attention Module(HPCAM) and a Full Pixel Spatial Attention Module (FPSAM) are designed.Based on these two attention modules, a Channel and Spatial Dual-Attention Network (CSDA-Net) is proposed.The HPCAM module suppresses the interference of meaningless information and enhances the expression of salient features in the channel dimension to extract highly discriminative pedestrian features.The FPSAM module enhances the discrimination ability of pedestrian features in the spatial dimension and then improves the accuracy of person Re-Identification(ReID).By integrating the HPCAM and FPSAM modules into the traditional person ReID depth model framework in stages, attention features ranging from rough to fine ones are obtained.Experimental results show that the Rank-1 accuracies of CSDA-Net on mainstream datasets CUHK03, DukeMTMC-ReID, and Market1501 in the field of person ReID are 78.3%, 91.3%, and 96.0%, respectively, and its mean Average Precision(mAP) values are 80.0%, 82.1%, and 90.4%, respectively.Compared with MGN networks, the three networks mentioned above show higher Rank-1 accuracies by 14.9, 2.6, and 0.3 percentage points, respectively, and higher mAP values by 13.7, 3.7, and 3.5 percentage points, respectively.This indicates that the CSDA-Net can extract more robust and discriminative expression features.

Key words: person Re-Identification(ReID), dual-attention mechanism, pedestrian features, deep learning, mean Average Precision(mAP)

摘要: 针对现实场景下因受到摄像机视角变化、行人姿态变化、物体遮挡、图像低分辨率、行人图片未对齐等因素影响导致行人判别性特征难以获取的问题,设计混合池通道注意模块(HPCAM)和全像素空间注意力模块(FPSAM),并基于这两种注意力模块提出一种通道与空间双重注意力网络(CSDA-Net)。HPCAM模块能够在通道维度上抑制无用信息的干扰,增强显著性特征的表达,以提取得到判别性强的行人特征。FPSAM模块在空间维度上增强行人特征的判别能力,从而提高行人重识别的准确率。通过在传统行人重识别深度模型框架中分阶段融入HPCAM模块和FPSAM模块,获得由粗糙到细粒度的注意力特征。实验结果表明,CSDA-Net网络在行人重识别主流数据集CUHK03、DukeMTMC-ReID和Market1501上的Rank-1准确率分别为78.3%、91.3%和96.0%,平均精度均值(mAP)分别为80.0%、82.1%和90.4%,与MGN网络相比,Rank-1准确率分别提升14.9、2.6和0.3个百分点,mAP分别提升13.7、3.7和3.5个百分点,能够提取更具鲁棒性和判别性的表达特征。

关键词: 行人重识别, 双重注意力机制, 行人特征, 深度学习, 平均精度均值

CLC Number: