作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2021, Vol. 47 ›› Issue (12): 278-284. doi: 10.19678/j.issn.1000-3428.0059945

• 图形图像处理 • 上一篇    下一篇

结合Bi-2DPCA与CNN的美式手语识别

杨明羽, 叶春明   

  1. 上海理工大学 管理学院, 上海 200093
  • 收稿日期:2020-11-09 修回日期:2020-12-24 发布日期:2020-12-30
  • 作者简介:杨明羽(1997-),男,硕士研究生,主研方向为图像识别、人工智能、智能算法;叶春明,教授、博士生导师。
  • 基金资助:
    国家自然科学基金(7184003);上海市科委“科技创新行动计划”软科学重点项目(20692104300);上海理工大学科技发展基金(2018KJFZ043)。

American Sign Language Recognition Combining with Bi-2DPCA and CNN

YANG Mingyu, YE Chunming   

  1. School of Business, University of Shanghai for Science and Technology, Shanghai 200093, China
  • Received:2020-11-09 Revised:2020-12-24 Published:2020-12-30

摘要: 针对现有美式手语(ASL)识别算法准确率低和模型训练时间长的问题,提出一种结合双向二维主成分分析(Bi-2DPCA)与卷积神经网络(CNN)并基于贝叶斯优化的识别算法。利用Bi-2DPCA算法对原始图像做数据降维处理,提取行、列方向的特征图,使用卷积神经网络对特征图进行训练分类,同时采用贝叶斯优化算法对模型超参数进行自动调参。在24分类ASL数据集上的实验结果表明,该算法的识别准确率达到99.15%,训练时间相比传统CNN算法减少90.3%。

关键词: 美式手语识别, 双向二维主成分分析, 卷积神经网络, 贝叶斯优化, 自动调参

Abstract: The existing algorithms for American Sign Language(ASL) recognition are limited in the recognition accuracy, and require much time for model training.To address the problem, a Bayesian Optimization(BO)-based algorithm that combines Bidirectional Two-Dimensional Principal Component Analysis(Bi-2DPCA) and Convolutional Neural Network(CNN) is used to optimize model parameters.The Bi-2DPCA algorithm is used to reduce the dimensionality of the original image data, and extract the feature maps in the row and column directions.Then the convolutional neural network is used to train and classify the feature maps.Finally, the Bayesian optimization algorithm is used to adjust the model hyperparameters automatically.The experimental results On 24 classified ASL data sets show that the algorithm achieves a recognition accuracy of 99.15%, and reduces the running time by 90.3% compared with the traditional CNN algorithms.

Key words: American Sign Language(ASL) recognition, Bidirectional Two-Dimensional Principal Component Analysis(Bi-2DPCA), Convolutional Neural Network(CNN), Bayesian Optimization(BO), automatic tuning

中图分类号: