作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2021, Vol. 47 ›› Issue (10): 207-213. doi: 10.19678/j.issn.1000-3428.0059225

• 图形图像处理 • 上一篇    下一篇

一种端到端的人脸对齐方法

康智慧1, 王全玉1, 王战军2   

  1. 1. 北京理工大学 计算机科学与技术学院, 北京 100081;
    2. 北京理工大学 人文与社会科学学院, 北京 100081
  • 收稿日期:2020-08-11 修回日期:2020-09-15 发布日期:2020-09-22
  • 作者简介:康智慧(1993-),女,硕士研究生,主研方向为人机交互、深度学习;王全玉,副教授、博士;王战军,教授、博士。
  • 基金资助:
    国家自然科学基金(71834001)。

An End-to-End Face Alignment Method

KANG Zhihui1, WANG Quanyu1, WANG Zhanjun2   

  1. 1. School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China;
    2. School of Humanities and Social Sciences, Beijing Institute of Technology, Beijing 100081, China
  • Received:2020-08-11 Revised:2020-09-15 Published:2020-09-22

摘要: 现有的人脸对齐方法多数是非端到端的,中间过程需要大量的人工干预,导致人脸关键点检测的稳定性较差。为此,提出一种端到端的基于深度学习的人脸对齐方法。基于MobileNets系列网络的子模块,使用类VGG结构的方式进行搭建,将整张图片作为输入,采用基于深度可分离卷积模块进行特征提取,并运用改进的倒残差结构避免网络训练过程的梯度消失,减少特征损失。在此基础上将眼间距离作为正规化方法,在300W人脸数据集上进行测试,结果表明,与CDM、DRMF等方法相比,该方法在保证较优精度的同时,具有良好的实时性。

关键词: 人脸对齐, 人脸特征点, 特征提取, 深度可分离卷积, 倒残差结构

Abstract: Most of the existing face alignment methods are not end-to-end, and require frequent manual intervention, which leads to a reduction in their stability.To address the problem, an end-to-end face alignment method based on deep learning is proposed.The network required by this method is constructed based on the sub-modules of the MobileNet series in a structure similar to VGG.Taking the entire image as the input, the depth-wise separable convolution module is used for feature extraction, and the method employs an improved inverted residual structure to avoid the disappearance of gradients in the network training process while reducing the loss of features.The distance between eyes is taken as the basis for normalization.The designed network is tested on the 300W face dataset and compared with CDM, DRMF methods. The experimental results show that the proposed algorithm displays excellent accuracy and real-time performance.

Key words: face alignment, facial landmark, feature extraction, depth-wise separable convolution, inverted residual structure

中图分类号: