基于多列深度3D卷积神经网络的手势识别

doi:10.3969/j.issn.1000-3428.2017.08.041

计算机工程

基于多列深度3D卷积神经网络的手势识别

易生,梁华刚,茹锋

(长安大学电子与控制工程学院,西安710064)

收稿日期:2016-06-06 出版日期:2017-08-15 发布日期:2017-08-15
作者简介:易生(1992— ),男,硕士,主研方向为图像处理、模式识别;梁华刚,副教授;茹锋,教授。
基金资助:
国家自然科学基金青年基金( 61203374);陕西省自然科学基金国际合作项目(2014KW01-05)。

Hand Gesture Recognition Based on Multi-column Deep 3D Convolutional Neural Network

YI Sheng,LIANG Huagang,RU Feng

(School of Electronics and Control Engineering,Chang’an University,Xi’an 710064,China)

Received:2016-06-06 Online:2017-08-15 Published:2017-08-15

摘要/Abstract

摘要：

传统2D卷积神经网络对于视频连续帧图像的特征提取容易丢失目标时间轴上的运动信息,导致识别准确度较低。为此,提出一种基于多列深度3D卷积神经网络(3D CNN)的手势识别方法。采用3D卷积核对连续帧图像进行卷积操作,提取目标的时间和空间特征捕捉运动信息。为避免因单组3D CNN特征提取不充分而导致的误分类,训练多组具有较强分类能力的3D CNN结构组成多列深度3D CNN,该结构通过对多组3D CNN的输出结果进行权衡,将权重最大的类别判定为最终的输出结果。实验结果表明,将多列深度3D CNN应用于CHGDs数据集上进行手势识别,识别率达到95.09%,与单组3D CNN及传统2D CNN相比分别提高近7%,20%,对连续图像目标识别具有较好的识别能力。

关键词: 视频图像序列处理, 手势识别, 深度学习, 特征提取, 卷积神经网络, 运动目标识别

Abstract:

The feature extraction method adopted by traditional Convolutional Neural Network(CNN) for video image with continuous frames is east to lose movement information on the target time axis,resulting in low recognition accuracy.To solve this problem,a method based on multi-lolu deep 3D is proposed.The 3D convolution kernel is used to extract the temporal and spatial features to capture the object’s motion information.In order to avoid the error classification because of the insufficient feature information of single 3D CNN,the multi-column 3D CNN is consisted by multi-component 3D CNN that each of them has very strong classification ability.The output of this structure is weighed by the output of each of the 3D CNN,and the category which has the maximum weight is determined to be the final result.The structure of multi-column 3D CNNs is applied to the CHGD for hand gesture recognition.Experimental results show that the method achieves a recognition rate of 95.09%,and the recognition rate compared to a single 3D CNN increases by nearly 7%,it increases by nearly 20%compared to the traditional 2D CNN,it has very excellent recognition ability for the video image sequence.

Key words: video image sequence processing, hand gesture recognition, deep learning, feature extraction, Convolutional Neural Network(CNN), moving object recognition

中图分类号:

TP18

易生,梁华刚,茹锋. 基于多列深度3D卷积神经网络的手势识别[J]. 计算机工程, doi: 10.3969/j.issn.1000-3428.2017.08.041.

YI Sheng,LIANG Huagang,RU Feng. Hand Gesture Recognition Based on Multi-column Deep 3D Convolutional Neural Network[J]. Computer Engineering, doi: 10.3969/j.issn.1000-3428.2017.08.041.

http://www.ecice06.com/CN/Y2017/V43/I8/243

参考文献

参考文献［1］刘蓉,刘明.基于三轴加速度传感器的手势识别［J］.计算机工程,2011,37(24):141-143. ［2］ Li H Y,Chen J J,Li X,et al.A Gesture Recognition Algorithm Based on PCA and BP Neural Network［J］.Advanced Materials Research,2013,734-737:3053-3056. ［3］ Huang D Y,Hu W C,Chang S H.Vision-based Hand Gesture Recognition Using PCA+Gabor Filters and SVM［C］//Proceedings of International Conference on Intelligent Information Hiding and Multimedia Signal Processing.WashingtonD.C.,USA:IEEE Press,2009:1-4. ［4］黄振翔,彭波,吴娟,等.基于DTW与混合判别特征检测器的手势识别［J］.计算机工程,2014,40(5):216-218. ［5］Liu N,Lovell B C,Kootsookos P J,et al.Model Structure Selection & Training Algorithms for an HMM Gesture Recognition System［J］.Handwriting Recognition,2004,(6682):100-105. ［6］包加桐,宋爱国,郭晏,等.基于SURF特征跟踪的动态手势识别算法［J］.机器人,2011,33(4):482-489. ［7］Sgouropoulos K,Stergiopoulou E,Papamarkos N.A Dynamic Gesture and Posture Recognition System［J］.Journal of Intelligent & Robotic Systems,2014,76(2):283-296. ［8］刘阳,尚赵伟.基于Kinect骨架信息的交通警察手势识别［J］.计算机工程与应用,2015,51(3):157-161. ［9］Schmidhuber J.Deep Learning in Neural Networks:An Overview［J］.Journal of International Neural Network Society,2014,61(1):85-117. ［10］Chen X W,Lin X.Big Data Deep Learning:Challenges and Perspectives［J］.IEEE Access,2014(2):514-525. ［11］Ji S,Yang M,Yu K.3D Convolutional Neural Networks for Human Action Recognition［J］.IEEE Transactions on Pattern Analysis & Machine Intelligence,2013,35(1):221-231. ［12］Kim H J,Lee J S,Park J H.Dynamic Hand Gesture Recognition Using a CNN Model with 3D Receptive Fields［C］//Proceedings of International Conference on Neural Networks and Signal Processing.Washington D.C.,USA:IEEE Press,2008:14-19. ［13］Li Y,Sohel F,Bennamoun M,et al.Heterogeneous Multi-column Conv Nets with a Fusion Framework for Object Recognition［C］//Proceedings of IEEE Winter Conference on Applications of Computer Vision.［S.1.］:IEEE Computer Society,2015:773-780. ［14］Ciresan D,Meier U,Schmidhuber J.Multi-column Deep Neural Networks for Image Classification［C］//Proceedings of IEEE Conference on Computer Vision & Pattern Recognition.WashingtonD.C.,USA:IEEE Press,2012:3642-3649. ［15］Lecun Y,Boser B,Denker J,et al.Backpropagation Applied to Handwritten Zip Code Recognition［J］.Neural Computation,1989,1(4):541-551. ［16］Kim T K,Cipolla R.Canonical Correlation Analysis of Video Volume Tensors for Action Categorization and Detection［J］.IEEE Transactions on Pattern Analysis and Machine Intelligence,2009,31(8):1415-1428. 编辑索书志

[1]	江雨燕, 陶承凤, 李平. 数据增强和自适应自步学习的深度子空间聚类算法[J]. 计算机工程, 2023, 49(8): 96-103, 110.
[2]	李泽水, 冀俊忠, 杨翠翠. 基于边权重信息深度网络嵌入的PPIN功能模块检测[J]. 计算机工程, 2023, 49(8): 69-76.
[3]	王可铮, 徐玉芬, 周尚波. 结合对比感知损失和融合注意力的图像去雾模型[J]. 计算机工程, 2023, 49(8): 207-214.
[4]	刘俊豪, 王美林, 谢兴, 宋烨兴, 许莉花. 基于改进YOLOv5的皮革瑕疵检测算法[J]. 计算机工程, 2023, 49(8): 240-249.
[5]	马娜, 温廷新, 贾旭, 李晓会. 复杂光照条件下自适应的车脸重识别模型[J]. 计算机工程, 2023, 49(8): 275-282, 290.
[6]	曹坪, 杨怀志, 薄一军, 尤嘉, 张淳杰, 李丹勇. 面向低质量裂缝图像的多知识蒸馏分类[J]. 计算机工程, 2023, 49(7): 204-213.
[7]	白明昌. 基于折叠路径聚合的属性网络节点嵌入方法[J]. 计算机工程, 2023, 49(7): 76-84.
[8]	闫兴亚, 匡娅茜, 白光睿, 李月. 基于深度学习的学生课堂行为识别方法[J]. 计算机工程, 2023, 49(7): 251-258.
[9]	李军侠, 王星驰, 殷梓, 石德硕. 边缘深度挖掘的弱监督显著性目标检测[J]. 计算机工程, 2023, 49(7): 169-178.
[10]	吴珊, 周凤. 基于改进SSD算法的小目标检测[J]. 计算机工程, 2023, 49(7): 179-188.
[11]	席建锐, 唐红梅, 梁春阳, 刘鑫. 基于改进隐函数的点云物体重建[J]. 计算机工程, 2023, 49(7): 214-222.
[12]	齐咏生, 杜晓旭, 朱俊峰, 高胜利, 刘利强. 基于增强型轻量深度网络的牧区牲畜高效检测[J]. 计算机工程, 2023, 49(7): 278-287.
[13]	张博旭, 蒲智, 程曦. 基于提示学习的维吾尔语文本分类研究[J]. 计算机工程, 2023, 49(6): 292-299,313.
[14]	于海洋, 景鹏, 张文涛, 谢赛飞, 滑志华, 宋草原. 基于残差与注意力机制的道路裂缝检测U-Net改进模型[J]. 计算机工程, 2023, 49(6): 265-273.
[15]	代祖华, 刘园园, 狄世龙. 语义增强的图神经网络方面级文本情感分析[J]. 计算机工程, 2023, 49(6): 71-80.

选择文件类型/文献管理软件名称

选择包含的内容

基于多列深度3D卷积神经网络的手势识别

Hand Gesture Recognition Based on Multi-column Deep 3D Convolutional Neural Network

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于多列深度3D卷积神经网络的手势识别

Hand Gesture Recognition Based on Multi-column Deep 3D Convolutional Neural Network

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价