作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2021, Vol. 47 ›› Issue (2): 314-320. doi: 10.19678/j.issn.1000-3428.0057107

• 图形图像处理 • 上一篇    

基于多级注意力跳跃连接网络的行人属性识别

王林, 李聪会   

  1. 西安理工大学 自动化与信息工程学院, 西安 710048
  • 收稿日期:2020-01-03 修回日期:2020-02-06 出版日期:2021-02-15 发布日期:2020-02-28
  • 作者简介:王林(1962-),男,教授、博士,主研方向为数据挖掘、图像处理;李聪会,硕士研究生。
  • 基金资助:
    陕西省科技计划重点项目(2017ZDCXL-GY-05-03)。

Pedestrian Attribute Recognition Based on Multi-Level Attention Skip Connection Network

WANG Lin, LI Conghui   

  1. School of Automation and Information Engineering, Xi'an University of Technology, Xi'an 710048, China
  • Received:2020-01-03 Revised:2020-02-06 Online:2021-02-15 Published:2020-02-28

摘要: 行人属性识别易受视角、尺度和光照等非理想自然条件变化的影响,且某些细粒度属性识别难度较大。为此,提出一种多级注意力跳跃连接网络MLASC-Net。在网络中间层,利用敏感注意力模块在通道及空间维度上对原特征向量进行筛选加权,设计多级跳跃连接结构来融合所提取的显著性特征。在网络顶层,改进多尺度金字塔池化以融合局部特征和全局特征。在网络输出层,结合验证损失算法自适应更新损失层,加速模型的收敛并提高精度。在PETA和RAP数据集上的实验结果表明,MLASC-Net的识别准确率相较原基准网络分别提高约4.62和6.54个百分点,其在识别效果和模型收敛速度上有明显优势,同时在非理想自然条件下具有良好的泛化能力,可有效提高网络对细粒度属性的鲁棒性。

关键词: 行人属性识别, 多级跳跃连接网络, 敏感注意力, 多尺度金字塔, 残差网络

Abstract: Pedestrian attribute recognition is susceptible to changes in non-ideal natural conditions,including perspective,scale and lighting,which increases the difficulty in fine-grained attribute recognition.To address the problem,this paper proposes a multi-level attention skip connection network,MLASC-Net.In the middle layer of the network,the sensitive attention module is used to filter and weight the original feature vectors in the channel and spatial dimensions,and a multi-level skip connection structure is designed to fuse the extracted salient features.In the top layer of the network,multi-scale pyramid pooling is improved to integrate local features and global features.In the output layer of the network,the verification of the loss trend algorithm is used to update the loss layer adaptively,accelerate the convergence of the model and increase accuracy.Experimental results on PETA and RAP datasets show that compared with the original benchmark network,MLASC-Net improves the recognition accuracy by about 4.62 and 6.54 percentage points,respectively.It has obvious advantages in recognition effect and model convergence speed,and has good generalization ability under non-ideal natural conditions,which can effectively improve the robustness of the network to fine-grained attributes.

Key words: pedestrian attribute recognition, multi-level skip connection network, sensitive attention, multi-scale pyramid, Residual Network (ResNet)

中图分类号: