基于训练图CNN特征的视频人体动作识别算法

doi:10.3969/j.issn.1000-3428.2017.11.038

计算机工程

基于训练图CNN特征的视频人体动作识别算法

曹晋其¹,蒋兴浩^1,2,孙锬锋^1,2

(1.上海交通大学电子信息与电气工程学院,上海 200240; 2.信息内容分析技术国家工程实验室,上海 200240)

收稿日期:2016-10-18 出版日期:2017-11-15 发布日期:2017-11-15
作者简介:曹晋其(1992—),男,硕士,主研方向为图形图像处理;蒋兴浩,教授、博士;孙锬锋,副教授、博士。
基金资助:
国家自然科学基金(61272439,61272249)。

Video Human Action Recognition Algorithm Based on Trained Image CNN Features

CAO Jinqi ¹,JIANG Xinghao ^1,2,SUN Tanfeng ^1,2

(1.School of Electronic Information and Electrical Engineering,Shanghai Jiaotong University,Shanghai 200240,China; 2.National Engineering Laboratory for Information Content Analysis Technology,Shanghai 200240,China)

Received:2016-10-18 Online:2017-11-15 Published:2017-11-15

摘要/Abstract

摘要： 为将卷积神经网络(CNN)应用到视频理解中,提出一种基于训练图CNN特征的识别算法。利用图像RGB数据识别视频人体动作,使用现有的CNN模型从图像中提取特征,并采用长短记忆单元的递归神经网络进行训练分类,研究CNN模型和隐层的选择、优化、特征矢量化和降维。实验结果表明,与使用图像RGB数据注意力模型的算法和组合长短期记忆模型算法相比,该算法具有更高的准确率。

关键词: 人体动作识别, 深度学习, 卷积神经网络, 递归神经网络, 记忆单元

Abstract: In order to apply Convolutional Neural Network (CNN) to video understanding,a recognition algorithm based on trained image CNN features is proposed.Image RGB data is employed to recognize human action in videos.Off-the-shelf CNN models are used to extract features from images,and classification is made by recurrent neural networks with Long Short-Term Memory (LSTM) unit.The research focuses on the choice of CNN architectures and layers,feature vectorization and dimentionality reduction.Experimental result shows that the algorithm has higher accuracy than attention model algorithm and composite LSTM algorithm using RGB data.

Key words: human action recognition, deep learning, Convolutional Neural Network(CNN), recurrent neural network, memory unit

中图分类号:

TP391

曹晋其,蒋兴浩,孙锬锋. 基于训练图CNN特征的视频人体动作识别算法[J]. 计算机工程, doi: 10.3969/j.issn.1000-3428.2017.11.038.

CAO Jinqi,JIANG Xinghao,SUN Tanfeng. Video Human Action Recognition Algorithm Based on Trained Image CNN Features[J]. Computer Engineering, doi: 10.3969/j.issn.1000-3428.2017.11.038.

http://www.ecice06.com/CN/Y2017/V43/I11/234

参考文献

参考文献［1］WANG Heng,SCHMID C.Action Recognition with Improved Trajectories［C］//Proceedings of IEEE International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2013:3551-3558. ［2］程海粟,李庆武,仇春春,等.基于改进密集轨迹的人体行为识别算法［J］.计算机工程,2016,42(8):199-205. ［3］SIMONYAN K,ZISSERMAN A.Two-stream Convolutional Networks for Action Recognition in Videos［C］//Proceedings of Advances in Neural Information Processing Systems.Montreal,Canada:MIT Press,2014:568-576. ［4］SHARIF R A,AZIZPOUR H,SULLIVAN J,et al.CNN Features Off-the-shelf:An Astounding Baseline for Recognition［C］//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition Workshops.Columbus,USA:IEEE Press,2014:806-813. ［5］尹宝才,王文通,王立春.深度学习研究综述［J］.北京工业大学学报,2015(1):48-59. ［6］SHARMA S,KIROS R,SALAKHUTDINOV R.Action Recognition Using Visual Attention［EB/OL］.(2016-02-14).https://arxiv.org/pdf/1511.04119.pdf. ［7］ZHA S,LUISIER F,ANDREWS W,et al.Exploiting Image-trained CNN Architectures for Unconstrained Video Classification［C］//Proceedings of BMVC’05.Swansea,UK:BMVA Press,2015:1-13. ［8］SRIVASTAVA N,MANSIMOV E,SALAKHUTDINOV R.Unsupervised Learning of Video Representations Using LSTMs［C］//Proceedings of International Conference on Machine Learning (ICML).Lille,France:Microtome Publishing,2015:843-852. ［9］DENG Jia,DONG Wei,SOCHER R,et al.Imagenet:A Large-scale Hierarchical Image Database［C］//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2009:248-255. ［10］KUEHNE H,JHUANG H,GARROTE E,et al.HMDB:A Large Video Database for Human Motion Recog-nition［C］//Proceedings of 2011 IEEE International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2011:2556-2563. ［11］SOOMRO K,ZAMIR A R,SHAH M.UCF101:A Dataset of 101 Human Actions Classes from Videos in the Wild［EB/OL］.(2012-12-03).https://arxiv.org/pdf/1212.0402.pdf. ［12］KRIZHEVSKY A,SUTSKEVER I,HINTON G E.Imagenet Classification with Deep Convolutional Neural Networks［C］//Proceedings of Advances in Neural Information Processing Systems.Montreal,Canada:MIT Press,2012:1097-1105. ［13］SZEGEDY C,LIU W,JIA Y,et al.Going Deeper with Convolutions［C］//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2015:1-9. ［14］ZHANG Xiangyu,ZOU Jianhua,HE Kaiming,et al.Accelerating very Deep Convolutional Networks for Classification and Detection［J］.IEEE Transactions on Pattern Analysis and Machine Intelligence,2016,38(10):1943-1955. ［15］MAHENDRAN A,VEDALDI A.Understanding Deep Image Representations by Inverting Them［C］//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2015:5188-5196. 编辑刘冰

[1]	江雨燕, 陶承凤, 李平. 数据增强和自适应自步学习的深度子空间聚类算法[J]. 计算机工程, 2023, 49(8): 96-103, 110.
[2]	李泽水, 冀俊忠, 杨翠翠. 基于边权重信息深度网络嵌入的PPIN功能模块检测[J]. 计算机工程, 2023, 49(8): 69-76.
[3]	王可铮, 徐玉芬, 周尚波. 结合对比感知损失和融合注意力的图像去雾模型[J]. 计算机工程, 2023, 49(8): 207-214.
[4]	刘俊豪, 王美林, 谢兴, 宋烨兴, 许莉花. 基于改进YOLOv5的皮革瑕疵检测算法[J]. 计算机工程, 2023, 49(8): 240-249.
[5]	曹坪, 杨怀志, 薄一军, 尤嘉, 张淳杰, 李丹勇. 面向低质量裂缝图像的多知识蒸馏分类[J]. 计算机工程, 2023, 49(7): 204-213.
[6]	白明昌. 基于折叠路径聚合的属性网络节点嵌入方法[J]. 计算机工程, 2023, 49(7): 76-84.
[7]	闫兴亚, 匡娅茜, 白光睿, 李月. 基于深度学习的学生课堂行为识别方法[J]. 计算机工程, 2023, 49(7): 251-258.
[8]	李军侠, 王星驰, 殷梓, 石德硕. 边缘深度挖掘的弱监督显著性目标检测[J]. 计算机工程, 2023, 49(7): 169-178.
[9]	吴珊, 周凤. 基于改进SSD算法的小目标检测[J]. 计算机工程, 2023, 49(7): 179-188.
[10]	席建锐, 唐红梅, 梁春阳, 刘鑫. 基于改进隐函数的点云物体重建[J]. 计算机工程, 2023, 49(7): 214-222.
[11]	齐咏生, 杜晓旭, 朱俊峰, 高胜利, 刘利强. 基于增强型轻量深度网络的牧区牲畜高效检测[J]. 计算机工程, 2023, 49(7): 278-287.
[12]	谌雨章, 黄逸姿, 张钧涵. 基于多速率空洞卷积的多尺度水下小目标检测[J]. 计算机工程, 2023, 49(6): 257-264.
[13]	张博旭, 蒲智, 程曦. 基于提示学习的维吾尔语文本分类研究[J]. 计算机工程, 2023, 49(6): 292-299,313.
[14]	于海洋, 景鹏, 张文涛, 谢赛飞, 滑志华, 宋草原. 基于残差与注意力机制的道路裂缝检测U-Net改进模型[J]. 计算机工程, 2023, 49(6): 265-273.
[15]	代祖华, 刘园园, 狄世龙. 语义增强的图神经网络方面级文本情感分析[J]. 计算机工程, 2023, 49(6): 71-80.

选择文件类型/文献管理软件名称

选择包含的内容

基于训练图CNN特征的视频人体动作识别算法

Video Human Action Recognition Algorithm Based on Trained Image CNN Features

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于训练图CNN特征的视频人体动作识别算法

Video Human Action Recognition Algorithm Based on Trained Image CNN Features

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价