作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2022, Vol. 48 ›› Issue (6): 304-313. doi: 10.19678/j.issn.1000-3428.0062018

• 开发研究与工程应用 • 上一篇    下一篇

基于条件对抗自动编码器的跨年龄人脸合成

程志康1,2, 孙锐1,2, 孙琦景1,2, 张旭东1,2   

  1. 1. 合肥工业大学 计算机与信息学院, 合肥 230009;
    2. 工业安全与应急技术安徽省重点实验室, 合肥 230009
  • 收稿日期:2021-07-08 修回日期:2021-09-03 发布日期:2022-06-11
  • 作者简介:程志康(1997—),男,硕士研究生,主研方向为计算机视觉、机器学习;孙锐(通信作者),教授、博士;孙琦景,硕士研究生;张旭东,教授、博士。
  • 基金资助:
    国家自然科学基金(61471154,61876057);安徽省重点研发计划科技强警专项(202004d07020012)。

Cross-age Face Synthesis Based on Conditional Adversarial Autoencoder

CHENG Zhikang1,2, SUN Rui1,2, SUN Qijing1,2, ZHANG Xudong1,2   

  1. 1. School of Computer and Information, Hefei University of Technology, Hefei 230009, China;
    2. Anhui Province Key Laboratory of Industry Safety and Emergency Technology, Hefei 230009, China
  • Received:2021-07-08 Revised:2021-09-03 Published:2022-06-11

摘要: 跨年龄人脸合成是指通过已知特定年龄的人脸图像合成其他年龄段的人脸图像,在动漫娱乐、公共安全、刑事侦查等领域有广泛的应用。针对跨年龄人脸合成图像容易产生器官变形扭曲、人脸局部特征保持效果不佳等问题,提出一种基于条件对抗自动编码器的合成方法。通过在解码器结构中引入通道关注和空间关注模块,分别从通道域和空间域提取重要信息,使模型在训练过程中忽略背景等无关信息,聚焦人脸图像变化的区域,有效解决合成图像器官扭曲变形等问题。此外,设计一种多尺度特征损失网络,从多个尺度更深层次地约束人脸图像的局部结构特征,从而保持人脸合成过程中局部特征结构的稳定性。在UTKFace跨年龄人脸数据集上的实验结果表明,与CAAE方法相比,该方法有效避免了人脸器官变形扭曲问题,能够更好地保持人脸局部结构特征,具有较佳的人脸合成效果和细节保持能力。

关键词: 跨年龄人脸合成, 条件对抗自动编码器, 通道关注模块, 空间关注模块, 多尺度特征损失网络

Abstract: Cross-age face synthesis involves synthesizing facial images of other age groups from facial images of known specific age groups.It has a wide range of applications in the fields of animation entertainment, public safety, criminal investigation, and so on.To solve the problems of organ distortion and poor local feature preservation in cross-age face image synthesis, a cross-age face image synthesis method based on a Conditional Adversarial AutoEncoder (CAAE) is proposed.By introducing both channel and spatial attention into the decoder structure, more important parts are taken from the channel and spatial domains, respectively, so that the model ignores irrelevant information such as the background in the training process, focuses on the changing area of the face image, and effectively avoids the distortion and deformation of organs in synthetic images.In addition, a multi-scale feature loss network is designed to constrain the local structural features of face images from multiple scales to maintain the stability of the local feature structure in the face synthesis process.The experimental results from the UTKFace cross-age face dataset show that compared with the CAAE method, this approach effectively prevents the deformation and distortion of facial organs, can better maintain the local structural features of the face, and has a better face synthesis effect and detail retention ability.

Key words: cross-age face synthesis, Conditional Adversarial AutoEncoder(CAAE), channel attention module, spatial attention module, multi-scale feature loss network

中图分类号: