作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2021, Vol. 47 ›› Issue (10): 242-251. doi: 10.19678/j.issn.1000-3428.0059166

• 开发研究与工程应用 • 上一篇    下一篇

基于遮挡感知卷积神经网络的面部表情识别模型

王军1, 赵凯2, 程勇1   

  1. 1. 南京信息工程大学 科技产业处, 南京 210044;
    2. 南京信息工程大学 计算机与软件学院, 南京 210044
  • 收稿日期:2020-08-04 修回日期:2020-10-13 发布日期:2020-10-28
  • 作者简介:王军(1970-),男,教授、博士,主研方向为机器学习、神经网络、图像处理;赵凯,硕士研究生;程勇,高级工程师,博士。
  • 基金资助:
    国家自然科学基金(41875184,61373064);江苏省“六大人才高峰”创新团队项目(TD-XYDXX-004);赛尔网络下一代互联网技术创新项目(NGII20170610,NGII20171204);江苏省农业气象重点实验室开放基金(KYQ1309)。

Facial Expression Recognition Model Based on Convolutional Neural Network with Occlusion Perception

WANG Jun1, ZHAO Kai2, CHENG Yong1   

  1. 1. Technology Industry Department, Nanjing University of Information Science & Technology, Nanjing 210044, China;
    2. School of Computer & Software, Nanjing University of Information Science & Technology, Nanjing 210044, China
  • Received:2020-08-04 Revised:2020-10-13 Published:2020-10-28

摘要: 针对面部遮挡情况下表情特征难以提取的问题,提出一种双通道遮挡感知神经网络模型。设计区域遮挡判定单元并集成到VGG16网络中形成遮挡感知神经网络,提取面部图像中未遮挡区域及遮挡较少区域的表情特征。运用迁移学习算法对卷积层参数进行预训练,减轻训练数据样本不足带来的过拟合问题。通过优化残差网络提取全脸表情相关特征,在此基础上加权融合遮挡感知神经网络和残差网络的输出以识别表情。在CK+、RAF-DB、SFEW这3个公开数据库上进行对比实验,结果表明,该模型平均准确率分别达到97.33%、86%、61.06%,与OPCNN、ResNet、VGG16等传统卷积神经网络模型相比,有效提高了面部遮挡情况下的表情识别精度。

关键词: 卷积神经网络, 面部表情识别, 迁移学习, 特征融合, 残差网络

Abstract: To reduce the difficulty in extracting features of an occluded face, a dual-channel Convolutional Neural Network (CNN) model with occlusion perception is proposed.The model is constructed by integrating newly designed occlusiondecision units into VGG16 network, which aims at extractingexpression-related features of the areas that are less occluded.The model employs the transfer learning algorithm to pre-train the parameters of the convolutional layer, which means to alleviate the over-fittingproblem.At the meantime, the expression-related features of the whole facial image are extracted by the modified residual network.Finally, the outputs of theperceptive neural network and residual network arefused in a weighted manner.The experimental results show that the proposed model achieves an accuracy of 97.33% on CK+, 86% on RAF-DB, and 61.06%on SFEW.Compared with traditional OPCNN, ResNet, and VGG16 models, the proposed model exhibits a significant improvement in the accuracy of recognizing the expression of an occluded face.

Key words: Convolutional Neural Network(CNN), facial expression recognition, transfer learning, feature fusion, residual network

中图分类号: