作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 图形图像处理 • 上一篇    下一篇

基于训练图CNN特征的视频人体动作识别算法

曹晋其1,蒋兴浩1,2,孙锬锋1,2   

  1. (1.上海交通大学 电子信息与电气工程学院,上海 200240; 2.信息内容分析技术国家工程实验室,上海 200240)
  • 收稿日期:2016-10-18 出版日期:2017-11-15 发布日期:2017-11-15
  • 作者简介:曹晋其(1992—),男,硕士,主研方向为图形图像处理;蒋兴浩,教授、博士;孙锬锋,副教授、博士。
  • 基金资助:
    国家自然科学基金(61272439,61272249)。

Video Human Action Recognition Algorithm Based on Trained Image CNN Features

CAO Jinqi  1,JIANG Xinghao  1,2,SUN Tanfeng  1,2   

  1. (1.School of Electronic Information and Electrical Engineering,Shanghai Jiaotong University,Shanghai 200240,China; 2.National Engineering Laboratory for Information Content Analysis Technology,Shanghai 200240,China)
  • Received:2016-10-18 Online:2017-11-15 Published:2017-11-15

摘要: 为将卷积神经网络(CNN)应用到视频理解中,提出一种基于训练图CNN特征的识别算法。利用图像RGB数据识别视频人体动作,使用现有的CNN模型从图像中提取特征,并采用长短记忆单元的递归神经网络进行训练分类,研究CNN模型和隐层的选择、优化、特征矢量化和降维。实验结果表明,与使用图像RGB数据注意力模型的算法和组合长短期记忆模型算法相比,该算法具有更高的准确率。

关键词: 人体动作识别, 深度学习, 卷积神经网络, 递归神经网络, 记忆单元

Abstract: In order to apply Convolutional Neural Network (CNN) to video understanding,a recognition algorithm based on trained image CNN features is proposed.Image RGB data is employed to recognize human action in videos.Off-the-shelf CNN models are used to extract features from images,and classification is made by recurrent neural networks with Long Short-Term Memory (LSTM) unit.The research focuses on the choice of CNN architectures and layers,feature vectorization and dimentionality reduction.Experimental result shows that the algorithm has higher accuracy than attention model algorithm and composite LSTM algorithm using RGB data.

Key words: human action recognition, deep learning, Convolutional Neural Network(CNN), recurrent neural network, memory unit

中图分类号: