作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2025, Vol. 51 ›› Issue (1): 225-234. doi: 10.19678/j.issn.1000-3428.0068392

• 图形图像处理 • 上一篇    下一篇

结合特征融合和通道注意力的多分支换装行人重识别

胡涌涛, 黄洪琼*()   

  1. 上海海事大学信息工程学院, 上海 201306
  • 收稿日期:2023-09-17 出版日期:2025-01-15 发布日期:2024-04-16
  • 通讯作者: 黄洪琼
  • 基金资助:
    国家自然科学基金(61673259)

Multi-Branch Clothes-Changing Person Re-Identification with Feature Fusion and Channel Attention

HU Yongtao, HUANG Hongqiong*()   

  1. College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
  • Received:2023-09-17 Online:2025-01-15 Published:2024-04-16
  • Contact: HUANG Hongqiong

摘要:

换装行人重识别(CC Re-ID)是行人重识别中的一个新兴研究课题, 旨在找出被换衣的行人。当前方法主要集中在使用多模态数据辅助解耦表征学习, 如通过脸、步态、身体轮廓等辅助数据解耦行人自身属性以减少服装影响, 但这些方法泛化能力较差, 需要大量额外工作。此外, 仅使用原始数据的方法对于相关信息的提取不够充分, 性能较弱。针对CC Re-ID存在的上述问题, 提出一种结合特征融合和通道注意力的多分支换装行人重识别方法(MBFC)。通过在主干网络中融入通道注意力机制, 在特征通道层面学习关键信息, 设计局部与全局特征融合方法以提高网络对行人细粒度特征的提取能力。此外, MBFC模型采用多分支结构, 使用服装对抗损失、交叉熵标签平滑损失等多种损失函数引导模型学习与服装无关的信息, 减少服装对模型的影响, 从而提取到更有效的行人信息。在PRCC和VC-Clothes数据集上进行广泛实验, 结果表明, 所提模型在RANK-1和平均精度均值(mAP)指标上优于对比的CC Re-ID方法。

关键词: 换装行人重识别, 多分支, 通道注意力, 特征融合, 注意力机制

Abstract:

Clothes-Changing Person Re-Identification (CC Re-ID) is an emerging research topic in person re-identification, which aims to retrieve pedestrians who have changed their clothes. To date, this task has not been thoroughly studied. Currently, the proposed methods mainly focus on using multi-modal data to assist in decoupling representation learning, such as decoupling the attributes of a pedestrian through auxiliary data such as face, gait, and body contours to reduce the influence of clothing; however, the generalization ability is poor, and additional work is needed to obtain auxiliary information. Furthermore, a method that uses only the original data is insufficient for extracting relevant information, and the performance of the model is poor. To solve the problem of CC Re-ID, a novel multi-branch CC Re-ID method combining feature fusion and channel attention, MBFC, is proposed. This method integrates the channel attention mechanism into the backbone network to learn key information at the feature channel level and designs local and global feature fusion methods to improve the ability of the network to extract fine-grained pedestrian features. In addition, the model adopts a multi-branch structure and uses multiple loss functions, such as clothing counter loss and smooth label cross-entropy loss, to guide the model in learning information unrelated to clothing, reduce the influence of clothing on the model, and thus extract more effective pedestrian information. The proposed model is extensively tested on the PRCC and VC-Clothes datasets. The experimental results indicate that the performance of the proposed model is superior to that of the most advanced CC Re-ID methods in terms of RANK-1 and mean Average Precision (mAP).

Key words: Clothes-Changing Person Re-Identification (CC Re-ID), multi-branch, channel attention, feature fusion, attention mechanism