作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (8): 7-9. doi: 10.3969/j.issn.1000-3428.2010.08.003

• 博士论文 • 上一篇    下一篇

基于乘积HMM的双模态语音识别方法

赵 晖,顾亚强,唐朝京   

  1. (国防科技大学电子科学与工程学院,长沙 410073)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2010-04-20 发布日期:2010-04-20

Bimodal Speech Recognition Approach Based on Product HMM

ZHAO Hui, GU Ya-qiang, TANG Chao-jing   

  1. (College of Electronic Science and Engineering, National University of Defense Technology, Changsha 410073)
  • Received:1900-01-01 Revised:1900-01-01 Online:2010-04-20 Published:2010-04-20

摘要: 针对噪声环境中的语音识别,提出一种用于双模态语音识别的乘积隐马尔可夫模型(HMM)。在独立训练音频HMM和视频HMM的基础上,建立二维训练模型,表征音频流和视频流之间的异步特性。引入权重系数,根据不同噪声环境自适应调整音频流与视频流的权重。实验结果证明,与其他双模态语音识别方法相比,该方法的识别性能更高。

关键词: 双模态语音识别, 乘积隐马尔可夫模型, 异步特性, 权重系数

Abstract: Aiming at speech recognition in noise environment, this paper proposes a product Hidden Markov Model(HMM) used for bimodal speech recognition. A two-dimension training model is built based on dependently trained audio-HMM and visual-HMM, which reflects the asynchronous characteristic between audio stream and video stream. Weight coefficient is introduced to automatically adjust weight of video stream and audio stream according to different noise environment. Experimental results show that compared with other bimodal speech recognition approaches, this approach exhibits better performance on speech recognition.

Key words: bimodal speech recognition, product Hidden Markov Model(HMM), asynchronous characteristic, weight coefficient

中图分类号: