作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (18): 283-285. doi: 10.3969/j.issn.1000-3428.2009.18.099

• 开发研究与设计技术 • 上一篇    

基于IOHMM的语音驱动唇动合成系统

马娥娥,刘 颖,王成儒   

  1. (燕山大学信息科学与工程学院,秦皇岛 066004)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-09-20 发布日期:2009-09-20

Speech-driven Lip Movement Synthesize System Based on IOHMM

MA E-e, LIU Ying, WANG Cheng-ru   

  1. (Department of Information Science and Engineering, Yanshan University, Qinhuangdao 066004)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-09-20 Published:2009-09-20

摘要: 针对语音驱动的唇动合成系统进行基于小波包分析的语音特征提取,采用特征差分和口形帧前后关联的多帧语音表征语音的动态特性,利用主成分分析降低输入语音的特征维数。采用基于输入输出隐马尔可夫模型(IOHMM)的音视频映射模型构建语音驱动唇动合成系统,实验表明提取的语音参数比传统Mel倒谱系数鲁棒性更好,合成的口形序列更连贯、自然。

关键词: 可视语音, 小波包分析, 主成分分析

Abstract: This paper processes speech feature extraction based on wavelet packet analysis aiming at speech-driven lip movement synthesize. It uses feature difference and multi-frames speech based on association relationship of lip frames to express dynamic characteristic for speech, utilizes Principal Component Analysis(PCA) to reduce dimensions of the input speech. It introduces speech-visual mapping models based on Input-Output Hidden Markov Model(IOHMM) to obtain speech-driven lip movement synthesize system. Experiment indicates that speech features are more robust than traditional Mel-frequency cepstrum coefficient, can synthesize coherent and natural lip sequences.

Key words: visual speech, wavelet packet analysis, Principal Component Analysis(PCA)

中图分类号: