结合Bi-2DPCA与CNN的美式手语识别

doi:10.19678/j.issn.1000-3428.0059945

摘要/Abstract

摘要： 针对现有美式手语（ASL）识别算法准确率低和模型训练时间长的问题，提出一种结合双向二维主成分分析（Bi-2DPCA）与卷积神经网络（CNN）并基于贝叶斯优化的识别算法。利用Bi-2DPCA算法对原始图像做数据降维处理，提取行、列方向的特征图，使用卷积神经网络对特征图进行训练分类，同时采用贝叶斯优化算法对模型超参数进行自动调参。在24分类ASL数据集上的实验结果表明，该算法的识别准确率达到99.15%，训练时间相比传统CNN算法减少90.3%。

关键词: 美式手语识别, 双向二维主成分分析, 卷积神经网络, 贝叶斯优化, 自动调参

Abstract: The existing algorithms for American Sign Language(ASL) recognition are limited in the recognition accuracy, and require much time for model training.To address the problem, a Bayesian Optimization(BO)-based algorithm that combines Bidirectional Two-Dimensional Principal Component Analysis(Bi-2DPCA) and Convolutional Neural Network(CNN) is used to optimize model parameters.The Bi-2DPCA algorithm is used to reduce the dimensionality of the original image data, and extract the feature maps in the row and column directions.Then the convolutional neural network is used to train and classify the feature maps.Finally, the Bayesian optimization algorithm is used to adjust the model hyperparameters automatically.The experimental results On 24 classified ASL data sets show that the algorithm achieves a recognition accuracy of 99.15%, and reduces the running time by 90.3% compared with the traditional CNN algorithms.

Key words: American Sign Language(ASL) recognition, Bidirectional Two-Dimensional Principal Component Analysis(Bi-2DPCA), Convolutional Neural Network(CNN), Bayesian Optimization(BO), automatic tuning

中图分类号:

TP389.1

杨明羽, 叶春明. 结合Bi-2DPCA与CNN的美式手语识别[J]. 计算机工程, 2021, 47(12): 278-284.

YANG Mingyu, YE Chunming. American Sign Language Recognition Combining with Bi-2DPCA and CNN[J]. Computer Engineering, 2021, 47(12): 278-284.

https://www.ecice06.com/CN/Y2021/V47/I12/278

图/表 13

20211214182752

20211214182756

20211214182800

20211214182804

20211214182808

20211214182811

20211214182814

20211214182818

20211214182821

20211214182824

20211214182828

20211214182832

20211214182836

参考文献

[1] 郝子煜, 阿里甫·库尔班, 李晓红, 等.基于CapsNet的中国手指语识别[J].计算机应用研究, 2019, 36(10):3157-3159. HAO Z Y, KUERBAN A, LI X H, et al.Chinese finger language recognition using CapsNet[J].Application Research of Computers, 2019, 36(10):3157-3159.(in Chinese)
[2] CLEBESON C D S, JORGE L A S, RAQUEL F V.Dynamic gesture recognition by using CNNs and star RGB:a temporal information condensation[J].Neurocomputing, 2020, 400:238-254.
[3] TAO W J, MING C L, YIN Z Z.American sign language alphabet recognition using convolutional neural networks with multiview augmentation and inference fusion[J].Engineering Applications of Artificial Intelligence, 2018, 76:202-213.
[4] ASHA T, DIXIT S K.COHST and wavelet features based static ASL numbers recognition[J].Procedia Computer Science, 2016, 92:455-460.
[5] QUTAISHAT M, MOUSSA H, BAYAN T, et al.American Sign Language(ASL) recognition based on Hough transform and neural networks[J].Expert Systems with Applications, 2005, 32(1):24-37.
[6] ADITHYA V, RAJESH R.A deep convolutional neural network approach for static hand gesture recognition[J].Procedia Computer Science, 2020, 171:2353-2361.
[7] 柯鹏飞, 蔡茂国, 吴涛.基于改进卷积神经网络与集成学习的人脸识别算法[J].计算机工程, 2020, 46(2):262-267, 273. KE P F, CAI M G, WU T.Face recognition algorithm based on improved convolutional neural network and ensemble learning[J].Computer Engineering, 2020, 46(2):262-267, 273.(in Chinese)
[8] ZHANG Y F, SHI L, WU Y, et al.Gesture recognition based on deep deformable 3D convolutional neural networks[J].Pattern Recognition, 2020, 107:1-5.
[9] KRIZHEVSKY A, SUTSKEVER I, HINTON G E.Image net classification with deep convolutional neural networks[C]//Proceedings of International Conference on Neural Information Processing Systems.Cambridge, USA:MIT Press, 2012:1097-1105.
[10] 吴伟.基于SAE-PCA模型的ASL字母识别方法研究[D].厦门:厦门大学, 2014. WU W.Research on ASL letter recognition method based on SAE-PCA model[D].Xiamen:Xiamen University, 2014.(in Chinese)
[11] 钟健, 何韦颖, 谭汉松.基于PCA降维结合机器学习算法的人机交互手势识别研究[J].机床与液压, 2020, 48(6):181-186. ZHONG J, HE W Y, TAN S H.Research on human-computer interaction gesture recognition based on PCA dimensionality reduction and machine learning algorithm[J].Machine Tool & Hydraulics, 2020, 48(6):181-186.(in Chinese)
[12] YANG J, ZHANG D D, FRANGI A F, et al.Two-dimensional PCA:a new approach to appearance-based face representation and recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2004, 26(1):131-137.
[13] 胡娜, 马慧, 湛涛.融合LBP纹理特征与B2DPCA技术的手指静脉识别方法[J].智能系统学报, 2019, 14(3):533-540. HU N, MA H, ZHAN T.Finger vein recognition method combining LBP texture feature and B2DPCA technology[J].CAAI Transactions on Intelligent Systems, 2019, 14(3):533-540.(in Chinese)
[14] WANG Y L, ZHANG H X, ZHANG G W.cPSO-CNN:an efficient PSO-based algorithm for fine-tuning hyper-parameters of convolutional neural networks[J].Swarm and Evolutionary Computation, 2019, 49:114-123.
[15] 马芳武, 韩丽, 吴量, 等.基于遗传与粒子群算法的隔振平台减振性能优化[J].吉林大学学报(工学版), 2020, 50(5):1608-1616. MA F W, HAN L, WU L, et al.Damping optimization of heavy-loaded anti-vibration platform based on genetic algorithm and particle swarm algorithm[J].Journal of Jilin University(Engineering and Technology Edition), 2020, 50(5):1608-1616.
[16] 王晨阳, 段倩倩, 周凯, 等.基于遗传算法优化卷积长短记忆混合神经网络模型的光伏发电功率预测[J].物理学报, 2020, 69(10):149-155. WANG C Y, DUAN Q Q, ZHOU K, et al.A hybrid model for photovoltaic power prediction of both convolutional and long short-term memory neural networks optimized by genetic algorithm[J].Acta Physica Sinica, 2020, 69(10):149-155.(in Chinese)
[17] 曾宇, 户文成.贝叶斯优化卷积神经网络公共场所异常声识别[J].应用声学, 2020, 39(3):409-416. ZENG Y, HU W C.Recognition of abnormal sound in public places based on Bayesian optimal convolutional neural network[J].Journal of Applied Acoustics, 2020, 39(3):409-416.(in Chinese)
[18] 崔佳旭, 杨博.贝叶斯优化方法和应用综述[J].软件学报, 2018, 29(10):3068-3090. CUI J X, YANG B.Survey on Bayesian optimization methodology and applications[J].Journal of Software, 2018, 29(10):3068-3090.(in Chinese)
[19] WU J, CHEN X Y, ZHANG H, et al.Hyperparameter optimization for machine learning models based on Bayesian optimization[J].Journal of Electronic Science and Technology, 2019, 17(1):26-40.
[20] 李文宽, 刘培玉, 朱振方, 等.基于卷积神经网络和贝叶斯分类器的句子分类模型[J].计算机应用研究, 2020, 37(2):333-336, 341. LI W K, LIU P Y, ZHU Z F, et al.Sentence classification model based on convolution neural network and Bayesian classifier[J].Application Research of Computers, 2020, 37(2):333-336, 341.(in Chinese)
[21] 生龙, 马建飞, 杨瑞欣, 等.基于特征交换的CNN图像分类算法研究[J].计算机工程, 2020, 46(9):268-273. SHENG L, MA J F, YANG R X, et al.Research on CNN image classification algorithm based on feature exchange[J].Computer Engineering, 2020, 46(9):268-273.(in Chinese)
[22] 冯玉芳, 殷宏, 卢厚清, 等.基于改进全卷积神经网络的红外与可见光图像融合方法[J].计算机工程, 2020, 46(8):243-249, 257. FENG Y F, YIN H, LU H Q, et al.Infrared and visible light image fusion method based on improved fully convolutional neural network[J].Computer Engineering, 2020, 46(8):243-249, 257.(in Chinese)
[23] MATTEO P, LUIGI C, GIUSEPPE P.A light CNN for detecting COVID-19 from CT scans of the chest[J].Pattern Recognition Letters, 2020, 140:95-100.

选择文件类型/文献管理软件名称

选择包含的内容