基于条件对抗自动编码器的跨年龄人脸合成

doi:10.19678/j.issn.1000-3428.0062018

计算机工程 ›› 2022, Vol. 48 ›› Issue (6): 304-313. doi: 10.19678/j.issn.1000-3428.0062018

基于条件对抗自动编码器的跨年龄人脸合成

程志康^1,2, 孙锐^1,2, 孙琦景^1,2, 张旭东^1,2

1. 合肥工业大学计算机与信息学院, 合肥 230009;
2. 工业安全与应急技术安徽省重点实验室, 合肥 230009

收稿日期:2021-07-08 修回日期:2021-09-03 发布日期:2022-06-11
作者简介:程志康(1997—),男,硕士研究生,主研方向为计算机视觉、机器学习;孙锐(通信作者),教授、博士;孙琦景,硕士研究生;张旭东,教授、博士。
基金资助:
国家自然科学基金（61471154，61876057）；安徽省重点研发计划科技强警专项（202004d07020012）。

Cross-age Face Synthesis Based on Conditional Adversarial Autoencoder

CHENG Zhikang^1,2, SUN Rui^1,2, SUN Qijing^1,2, ZHANG Xudong^1,2

1. School of Computer and Information, Hefei University of Technology, Hefei 230009, China;
2. Anhui Province Key Laboratory of Industry Safety and Emergency Technology, Hefei 230009, China

Received:2021-07-08 Revised:2021-09-03 Published:2022-06-11

摘要/Abstract

摘要： 跨年龄人脸合成是指通过已知特定年龄的人脸图像合成其他年龄段的人脸图像，在动漫娱乐、公共安全、刑事侦查等领域有广泛的应用。针对跨年龄人脸合成图像容易产生器官变形扭曲、人脸局部特征保持效果不佳等问题，提出一种基于条件对抗自动编码器的合成方法。通过在解码器结构中引入通道关注和空间关注模块，分别从通道域和空间域提取重要信息，使模型在训练过程中忽略背景等无关信息，聚焦人脸图像变化的区域，有效解决合成图像器官扭曲变形等问题。此外，设计一种多尺度特征损失网络，从多个尺度更深层次地约束人脸图像的局部结构特征，从而保持人脸合成过程中局部特征结构的稳定性。在UTKFace跨年龄人脸数据集上的实验结果表明，与CAAE方法相比，该方法有效避免了人脸器官变形扭曲问题，能够更好地保持人脸局部结构特征，具有较佳的人脸合成效果和细节保持能力。

关键词: 跨年龄人脸合成, 条件对抗自动编码器, 通道关注模块, 空间关注模块, 多尺度特征损失网络

Abstract: Cross-age face synthesis involves synthesizing facial images of other age groups from facial images of known specific age groups.It has a wide range of applications in the fields of animation entertainment, public safety, criminal investigation, and so on.To solve the problems of organ distortion and poor local feature preservation in cross-age face image synthesis, a cross-age face image synthesis method based on a Conditional Adversarial AutoEncoder (CAAE) is proposed.By introducing both channel and spatial attention into the decoder structure, more important parts are taken from the channel and spatial domains, respectively, so that the model ignores irrelevant information such as the background in the training process, focuses on the changing area of the face image, and effectively avoids the distortion and deformation of organs in synthetic images.In addition, a multi-scale feature loss network is designed to constrain the local structural features of face images from multiple scales to maintain the stability of the local feature structure in the face synthesis process.The experimental results from the UTKFace cross-age face dataset show that compared with the CAAE method, this approach effectively prevents the deformation and distortion of facial organs, can better maintain the local structural features of the face, and has a better face synthesis effect and detail retention ability.

Key words: cross-age face synthesis, Conditional Adversarial AutoEncoder(CAAE), channel attention module, spatial attention module, multi-scale feature loss network

中图分类号:

TP391

程志康, 孙锐, 孙琦景, 张旭东. 基于条件对抗自动编码器的跨年龄人脸合成[J]. 计算机工程, 2022, 48(6): 304-313.

CHENG Zhikang, SUN Rui, SUN Qijing, ZHANG Xudong. Cross-age Face Synthesis Based on Conditional Adversarial Autoencoder[J]. Computer Engineering, 2022, 48(6): 304-313.

http://www.ecice06.com/CN/Y2022/V48/I6/304

图/表 15

20220625181538

20220625181543

20220625181546

20220625181551

20220625181556

20220625181601

20220625181605

20220625181609

20220625181614

20220625181618

20220625181622

20220625181626

20220625181630

20220625181634

20220625181639

参考文献

[1] LANITIS A, TAYLOR C J, COOTES T F.Toward automatic simulation of aging effects on face images[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(4):442-455.
[2] RAMANATHAN N, CHELLAPPA R.Modeling age progression in young faces[C]//Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2006:387-394.
[3] RAMANATHAN N, CHELLAPPA R.Modeling shape and textural variations in aging faces[C]//Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition.Washington D.C., USA:IEEE Press, 2008:1-8.
[4] BERG A C, PERALES LOPEZ F J, GONZALEZ M.A facial aging simulation method using flaccidity deformation criteria[C]//Proceedings of the 10th International Conference on Information Visualisation.Washington D.C., USA:IEEE Press, 2006:791-796.
[5] SUO J L, ZHU S C, SHAN S G, et al.A compositional and dynamic model for face aging[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010, 32(3):385-401.
[6] TIDDENMAN B, BURT M, PERRETT D.Prototyping and transforming facial textures for perception research[J] IEEE Computer Graphics and Applications, 2001, 21(5):42-50.
[7] KEMELMACHER-SHLIZERMAN I, SUWAJANAKORN S, SEITZ S M.Illumination-aware age progression[C]//Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2014:3334-3341.
[8] WANG W, CUI Z, YEN Y, et al.Recurrent face aging[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2016:2378-2386.
[9] SHU X, TANG J, LAI H, et al.Personalized age progression with aging dictionary[C]//Proceedings of 2015 IEEE International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2015:3970-3978.
[10] DUONG C N, LUU K, QUACH K G, et al.Longitudinal face modeling via temporal deep restricted Boltzmann machines[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2016:5772-5780.
[11] DUONG C N, QUACH K G, LUU K, et al.Temporal non-volume preserving approach to facial age-progression and age-invariant face recognition[C]//Proceedings of 2017 IEEE International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2017:3755-3763.
[12] ZHANG Z, SONG Y, QI H.Age progression/regression by conditional adversarial autoencoder[C]//Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2017:4352-4360.
[13] MAKHZANI A, SHLENS J, JAITLY N, et al.Adversarial autoencoders[EB/OL].[2021-06-06].https://arxiv.org/abs/1511.05644.
[14] GOODFELLOW I J, POUGET ABADIE J, MIRZA M, et al.Generative adversarial nets[EB/OL].[2021-06-06].https://arxiv.org/abs/1406.2661.
[15] 杨林, 王永杰.基于单点多步博弈的网络防御策略选取方法[J].计算机工程, 2021, 47(1):154-164. YANG L, WANG Y J.Network defense strategy selection method based on single-point multi-step game[J].Computer Engineering, 2021, 47(1):154-164.(in Chinese)
[16] ZEILER M D, FERGUS R.Visualizing and understanding convolutional networks[C]//Proceedings of European Conference on Computer Vision.Berlin, Germany:Springer, 2014:818-833.
[17] HE K M, ZHANG X Y, REN S Q.Delving Deep into rectifiers:surpassing human-level performance on ImageNet classification[C]//Proceedings of 2015 IEEE International Conference on Computer Vision.Washington D.C., USA:IEEE Press, 2015:1026-1034.
[18] Face Transformer (FT) demo[EB/OL].[2021-06-06].http://cherry.dcs.aber.ac.uk/transformer/.
[19] Dlib C++ Library[EB/OL].[2021-06-06].http://dlib.net/.
[20] KAZEMI V, SULLIVAN J.One millisecond face alignment with an ensemble of regression trees[C]//Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C., USA:IEEE Press, 2014:1867-1874.
[21] ROTHE R, TIMOFTE R, GOOL L V.DEX:deep expectation of apparent age from a single image[C]//Proceedings of 2015 IEEE International Conference on Computer Vision Workshop.Washington D.C., USA:IEEE Press, 2015:252-257.
[22] KINGMA D P, BA J.ADAM:a method for stochastic optimization[EB/OL].[2021-06-06].https://arxiv.org/pdf/1412.6980.pdf.

选择文件类型/文献管理软件名称

选择包含的内容

基于条件对抗自动编码器的跨年龄人脸合成

Cross-age Face Synthesis Based on Conditional Adversarial Autoencoder

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 15

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	杨文忠, 丁甜甜, 康鹏, 卜文秀. 基于舆情新闻的中文关键词抽取综述[J]. 计算机工程, 2023, 49(3): 1-17.
[2]	孙伟, 常鹏帅, 戴亮, 张小瑞, 陈旋, 代广昭. 基于注意力引导数据增强的车型识别[J]. 计算机工程, 2022, 48(7): 300-306.
[3]	李晨, 侯进, 李金彪, 陈子锐. 基于注意力与残差级联的红外与可见光图像融合方法[J]. 计算机工程, 2022, 48(7): 234-240.
[4]	张瑷涵, 刘翔, 石蕴玉, 刘思齐. 基于深度学习的双流程短视频分类方法[J]. 计算机工程, 2022, 48(7): 277-283.
[5]	魏紫薇, 屈丹, 柳聪. 基于连接注意力的行人重识别特征提取方法[J]. 计算机工程, 2022, 48(7): 220-226.
[6]	张业星, 陈敏, 潘秋羽. 基于特征通道建模的目标检测方法[J]. 计算机工程, 2022, 48(7): 264-269,299.
[7]	郝阿香, 贾郭军. 结合注意力与批特征擦除的行人重识别模型[J]. 计算机工程, 2022, 48(7): 270-276,306.
[8]	崔云轩, 刘桂华, 余东应, 郭中远, 张文凯. 点线特征融合的激光雷达单目惯导SLAM系统[J]. 计算机工程, 2022, 48(7): 254-263.
[9]	黄金瑶, 刘同来, 吴嘉鑫, 武继刚. 多周期家庭护理的路径规划与调度算法[J]. 计算机工程, 2022, 48(7): 292-299.
[10]	黎浩民, 李光平. 基于稀疏神经网络的图像超分辨率重建算法[J]. 计算机工程, 2022, 48(7): 247-253.
[11]	朱凌, 王雅萍, 廖丽敏. 基于共现流增强双向金字塔卷积网络的密集液滴识别[J]. 计算机工程, 2022, 48(7): 241-246,253.
[12]	王晞阳, 陈继林, 李猛, 刘首文. FPGA架构上面向稀疏矩阵求解的静态调度算法[J]. 计算机工程, 2022, 48(7): 199-205,213.
[13]	臧迪, 杨志刚, 王晶, 姚治成, 张伟功. 基于网卡虚拟化的高性能容器网络设计[J]. 计算机工程, 2022, 48(7): 214-219.
[14]	奚智雯, 蔡晶晶, 阳文敏, 柴志雷. 基于微服务架构FPGA云平台的并发请求调度机制[J]. 计算机工程, 2022, 48(7): 206-213.
[15]	白杰, 张赛, 李艳萍. 基于改进交错组卷积的眼底硬性渗出物自动分割[J]. 计算机工程, 2022, 48(7): 307-314.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于条件对抗自动编码器的跨年龄人脸合成

Cross-age Face Synthesis Based on Conditional Adversarial Autoencoder

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 15

参考文献

相关文章 15

编辑推荐

Metrics

本文评价