Speech-driven Lip Movement Synthesize System  Based on IOHMM

doi:10.3969/j.issn.1000-3428.2009.18.099

Computer Engineering ›› 2009, Vol. 35 ›› Issue (18): 283-285. doi: 10.3969/j.issn.1000-3428.2009.18.099

• Developmental Research • Previous Articles

Speech-driven Lip Movement Synthesize System Based on IOHMM

MA E-e, LIU Ying, WANG Cheng-ru

(Department of Information Science and Engineering, Yanshan University, Qinhuangdao 066004)

Received:1900-01-01 Revised:1900-01-01 Online:2009-09-20 Published:2009-09-20

基于IOHMM的语音驱动唇动合成系统

马娥娥，刘颖，王成儒

(燕山大学信息科学与工程学院，秦皇岛 066004)

Abstract

Abstract: This paper processes speech feature extraction based on wavelet packet analysis aiming at speech-driven lip movement synthesize. It uses feature difference and multi-frames speech based on association relationship of lip frames to express dynamic characteristic for speech, utilizes Principal Component Analysis(PCA) to reduce dimensions of the input speech. It introduces speech-visual mapping models based on Input-Output Hidden Markov Model(IOHMM) to obtain speech-driven lip movement synthesize system. Experiment indicates that speech features are more robust than traditional Mel-frequency cepstrum coefficient, can synthesize coherent and natural lip sequences.

Key words: visual speech, wavelet packet analysis, Principal Component Analysis(PCA)

摘要： 针对语音驱动的唇动合成系统进行基于小波包分析的语音特征提取，采用特征差分和口形帧前后关联的多帧语音表征语音的动态特性，利用主成分分析降低输入语音的特征维数。采用基于输入输出隐马尔可夫模型(IOHMM)的音视频映射模型构建语音驱动唇动合成系统，实验表明提取的语音参数比传统Mel倒谱系数鲁棒性更好，合成的口形序列更连贯、自然。

关键词: 可视语音, 小波包分析, 主成分分析

CLC Number:

TP391.42

MA E-e; LIU Ying; WANG Cheng-ru. Speech-driven Lip Movement Synthesize System Based on IOHMM[J]. Computer Engineering, 2009, 35(18): 283-285.

马娥娥;刘颖;王成儒. 基于IOHMM的语音驱动唇动合成系统[J]. 计算机工程, 2009, 35(18): 283-285.

/ / Recommend / Download Citations

URL: http://www.ecice06.com/EN/10.3969/j.issn.1000-3428.2009.18.099

http://www.ecice06.com/EN/Y2009/V35/I18/283

[1]	YAN Xin, ZHU Yonghao, TU Naiwei, WU Shuwen, WANG Yuhong. Prediction of Coal and Gas Outburst in Mine Working Face Based on PCA and Weighted Bayesian [J]. Computer Engineering, 2021, 47(8): 315-320.
[2]	HAO Zhanjun, ZHANG Daiyang, DANG Xiaochao, DUAN Yu. Non-contact Human Motion Recognition Method Based on Channel State Information [J]. Computer Engineering, 2021, 47(6): 172-181.
[3]	HU Tao, DIAN Songyi, JIANG Ronghua. Hardware Trojan Detection Based on Long Short-Term Memory Neural Network [J]. Computer Engineering, 2020, 46(7): 110-115.
[4]	DANG Xiaochao, DENG Qiyan, HAO Zhanjun. Indoor Personnel Detection Method Based on 30° Angle Concentric Circular Sampling [J]. Computer Engineering, 2020, 46(4): 198-205.
[5]	YAN Yujuan,LI Hua,ZHAO Jumin,LI Deng’ao,LIU Jia. Fall detection system based on CRFID and pattern recognition [J]. Computer Engineering, 2019, 45(6): 297-302,309.
[6]	DI Ruitong,WANG Hong,FANG Youli. Fake Reviews Identification Method Fusing Time Series and Multi-scale Features [J]. Computer Engineering, 2019, 45(3): 278-285,292.
[7]	YANG Chenchen,MA Chunmei,ZHU Jinqi. Study of Fall Behavior Identification Algorithm Based on Smartphone [J]. Computer Engineering, 2019, 45(2): 178-183.
[8]	WANG Lin, ZHAO Junli, DUAN Fuqing, ZHOU Mingquan. Survey on Craniofacial Reconstruction Method [J]. Computer Engineering, 2019, 45(12): 8-18.
[9]	DANG Xiaochao,SI Xiong,HAO Zhanjun,HUANG Yaning. A Passive Indoor Fingerprint Localization Algorithm Based on Channel State Information [J]. Computer Engineering, 2018, 44(7): 114-120.
[10]	ZHANG Xiaoming,WANG Zhijun,LIANG Liping. A Stacking Algorithm for Convolution Neural Network [J]. Computer Engineering, 2018, 44(4): 243-247.
[11]	YANG Hanfang,ZHOU Xiangdong. Cross Domain Image Classification Based on Deep Sparse Discrimination [J]. Computer Engineering, 2018, 44(4): 310-316.
[12]	LIU Yachong,TANG Zhiling. Classification and Identification Method of Communication Radiation Source Feature Based on Softmax Regression [J]. Computer Engineering, 2018, 44(2): 98-102.
[13]	ZHANG Huiyi,HOU Yaozu,TAO Tao. A Collaborative Filtering Algorithm Based on Two-stage Joint Hashing [J]. Computer Engineering, 2018, 44(12): 316-320.
[14]	ZHAO Pengbiao,LIU Ge,LUO Lei,ZHOU Rui. Construction Method of Indoor Digital Floor Plan Based on Smartphone [J]. Computer Engineering, 2018, 44(11): 271-275,281.
[15]	MIAO Jiajia,SHEN Lei,GUO Jingjing. Blind Decoding Method of Asynchronous WCDMA Signals Under Array Antenna [J]. Computer Engineering, 2018, 44(10): 107-111.

Please choose a citation manager

Content to export

Speech-driven Lip Movement Synthesize System Based on IOHMM

基于IOHMM的语音驱动唇动合成系统

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments

模态框（Modal）标题

Please choose a citation manager

Content to export

Speech-driven Lip Movement Synthesize System Based on IOHMM

基于IOHMM的语音驱动唇动合成系统

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments