作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2022, Vol. 48 ›› Issue (4): 240-246. doi: 10.19678/j.issn.1000-3428.0060912

• 图形图像处理 • 上一篇    下一篇

结合注意力机制与特征融合的静态手势识别算法

胡宗承1, 周亚同1, 史宝军2, 何昊1   

  1. 1. 河北工业大学 电子信息工程学院, 天津 300401;
    2. 河北工业大学 机械工程学院, 天津 300401
  • 收稿日期:2021-02-23 修回日期:2021-04-27 发布日期:2021-05-08
  • 作者简介:胡宗承(1993—),男,硕士研究生,主研方向为计算机视觉、图像处理;周亚同(通信作者)、史宝军,教授、博士、博士生导师;何昊,讲师、博士。
  • 基金资助:
    国家重点研发计划“智能机器人”专项子课题(2019YFB1312102);河北省自然科学基金(F2019202364)。

Static Gesture Recognition Algorithm Based on Attention Mechanism and Feature Fusion

HU Zongcheng1, ZHOU Yatong1, SHI Baojun2, HE Hao1   

  1. 1. School of Electronics and Information Engineering, Hebei University of Technology, Tianjin 300401, China;
    2. School of Mechanical Engineering, Hebei University of Technology, Tianjin 300401, China
  • Received:2021-02-23 Revised:2021-04-27 Published:2021-05-08

摘要: 卷积神经网络在手势识别领域应用广泛,但现有的卷积神经网络存在特征表征不足的问题,导致手势识别精度较低。提出一种轻量级静态手势识别算法r-mobilenetv2,通过串联通道注意力与空间注意力,将两者输出的特征图以跳跃连接的形式线性相加,得到一种全新的注意力机制。使用一维卷积调整低层特征的通道维度,将低级特征与经过上采样的高层特征进行空间维度匹配及通道维度匹配,并进行线性相加,其结果经卷积操作后与高层特征按通道维度连接,从而实现特征融合。在此基础上,将所提注意力机制与特征融合相结合,并用于改进后的轻量级网络MobileNetV2中,得到r-mobilenetv2算法。实验结果表明,与MobileNetV2算法相比,r-mobilenetv2算法的参数量降低了27%,错误率下降了1.82个百分点。

关键词: 注意力机制, 特征融合, 手势识别, 图片分类, 轻量级网络

Abstract: Convolutional neural networks are widely used in the field of gesture recognition, but the existing convolutional neural networks have the problem of insufficient feature representation, resulting in low gesture recognition accuracy.This study proposes a lightweight static gesture recognition algorithm, r-mobilenetv2.By concatenating the channel attention and spatial attention, the output characteristic graphs of the two are linearly added in the form of a jump connection to obtain a new attention mechanism.Simultaneously, the channel dimension of the low-level features is adjusted by one-dimensional convolution.The low-level features are matched with the up-sampled high-level features in the spatial and channel dimensions, and they are added linearly.The results are connected to high-level features according to the channel dimension after convolution to realize feature fusion.On this basis, the proposed attention mechanism is combined with feature fusion and applied to the improved lightweight network MobileNetV2 to obtain the r-mobilenetv2 algorithm.The experimental results show that, compared with the MobileNetV2 algorithm, the number of parameters and error rate of the r-mobilenetv2 algorithm are reduced by 27% and 1.82 percentage points, respectively.

Key words: attention mechanism, feature fusion, hand guesture recognition, image classification, lightweight network

中图分类号: