作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (20): 203-204. doi: 10.3969/j.issn.1000-3428.2006.20.075

• 人工智能及识别技术 • 上一篇    下一篇

应用MAP方差估计的话者自适应训练方法

黄盈椿1,王欢良2,冯 涛2   

  1. (1. 中国科学院电子学研究所,北京 100080;2. 哈尔滨工业大学计算机科学与技术学院,哈尔滨 150001)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2006-10-20 发布日期:2006-10-20

Speaker Adaptive Training of Appling MAP Estimation for Covariance

HUANG Yingchun1, WANG Huanliang2, FENG Tao2   

  1. (1. Institute of Electronics, Chinese Academy of Sciences, Beijing 100080; 2. School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001)
  • Received:1900-01-01 Revised:1900-01-01 Online:2006-10-20 Published:2006-10-20

摘要: 近年来话者自适应训练(SAT)方法日益受到重视。然而在实际中此方法通常因为部分方差的估计失误而导致识别性能下降。该文提出了一种应用最大后验概率(MAP)估计方差的全新SAT方法,它能够根据后验概率动态地调整模型的方差,从而解决上述问题。在Switchboard数据库上的实验显示,新方法能够显著地提高识别性能,并且有效地提升系统的稳定性。

关键词: 语音识别, 话者自适应, 话者自适应训练, MAP

Abstract: Recently there has been a growing interest in speaker adaptive training(SAT). However, errors can often arise when estimating covariance matrices in the original SAT framework due to the lack of observations in some Gauss components. This paper presents a novel approach which applies maximum a posteriori (MAP) covariance-estimating into original SAT. Experimental results in Switchboard corpus demonstrate that the proposed method can deliver significant reductions in word error rate (WER) and raise the robustness of SAT process.

Key words: Speech recognition, Speaker adaptation, Speaker adaptive training(SAT), Maximum a posteriori(MAP)