作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (16): 158-160. doi: 10.3969/j.issn.1000-3428.2011.16.054

• 人工智能及识别技术 • 上一篇    下一篇

基于词图重估的语音解码参数优化方法

尹明明,屈 丹,李弼程,黄山奇   

  1. (解放军信息工程大学信息工程学院,郑州 450002)
  • 收稿日期:2010-12-30 出版日期:2011-08-20 发布日期:2011-08-20
  • 作者简介:尹明明(1986-),男,硕士研究生,主研方向:语音识别;屈 丹,副教授、博士;李弼程,教授、博士生导师;黄山奇,硕士研究生
  • 基金资助:

    国家“863”计划基金资助项目(2006AA01z146)

Speech Decoding Parameters Optimization Method Based on Word Graph Rescoring

YIN Ming-ming, QU Dan, LI Bi-cheng, HUANG Shan-qi   

  1. (Institute of Information Engineering, PLA Information Engineering University, Zhengzhou 450002, China)
  • Received:2010-12-30 Online:2011-08-20 Published:2011-08-20

摘要: 在大词汇连续语音识别系统中,语言模型权值和插入代价等语音解码参数对系统的识别率有较大的影响,而在实际应用中常通过实验手动调整其值寻求最佳识别结果。为此,提出一种利用二元文法进行词图重估的方法,自动优化语音解码参数。在重估的参数空间搜索过程中采用线性搜索与模拟退火搜索相结合的方法,使优化参数具有全局最优和对初值稳定性强的优点。实验结果表明,相比凭经验设置的参数,该方法估计出的参数值能大幅降低识别词错误率,与经典的N-best优化相比,其优化速度有较大提升。

关键词: 词图重估, 语言模型权值, 解码, 插入代价

Abstract: In Large Vocabulary Continuous Speech Recognition(LVCSR) system, speech decoding parameters——Language Model(LM) weight and insertion cost can greatly affects the recognition performance. But in practice, they are usually hand-tuned through experiment to obtain best recognition performance. This paper proposes the rescoring based method that uses bi-gram LM to optimize the parameters automatically, meanwhile the method of combine line search and Simulated Annealing(SA) search in parameters search space of rescoring, which is globally optimal and is insensitive to initial value of parameter. Experimental results show that the method can dramatically reduce the word error compared with empirical parameter setting method, and gains much faster optimization speed than classical N-best optimization.

Key words: word graph rescoring, Language Model(LM) weight, decoding, insertion cost

中图分类号: