作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2012, Vol. 38 ›› Issue (7): 287-289. doi: 10.3969/j.issn.1000-3428.2012.07.094

• 开发研究与设计技术 • 上一篇    下一篇

基于相邻帧特征相似性的快速关键词检出方法

袁 浩,李海洋,郑铁然,韩纪庆   

  1. (哈尔滨工业大学计算机科学与技术学院,哈尔滨 150001)
  • 收稿日期:2011-07-24 出版日期:2012-04-05 发布日期:2012-04-05
  • 作者简介:袁 浩(1986-),男,硕士研究生,主研方向:连续语音识别;李海洋,博士研究生;郑铁然,副教授;韩纪庆,教授、 博士生导师
  • 基金资助:
    国家“973”计划基金资助项目(2007CB311100)

Rapid Keyword Spotting Method Based on Feature Similarity of Adjacent Frames

YUAN Hao, LI Hai-yang, ZHENG Tie-ran, HAN Ji-qing   

  1. (School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China)
  • Received:2011-07-24 Online:2012-04-05 Published:2012-04-05

摘要: 针对关键词检出系统中计算观察概率效率较低的问题,在最近邻近似方法的基础上,提出一种基于相邻帧特征相似性的方法。依据相邻帧之间的高相似性,利用产生前一帧特征矢量的若干个最大的混合分量,有效预测当前帧所使用的最大高斯混合分量,从而快速计算观察概率。实验结果表明,与基线系统相比,该方法在保持识别性能的前提下,识别时间可降低29.3%。

关键词: 语音识别, 关键词检出, GMM估算, 在线垃圾模型, 相邻帧特征相似性, 隐马尔科夫模型

Abstract: Aiming at the problem that the Keyword Spotting(KWS) system can not calculate the state likelihood effectively. Based on nearest-neighbor approximation, a new technique named Feature Similarity of Adjacent Frames(FSAF) is proposed. Depending on the high similarity of adjacent frames, it uses some of the maximum mixtures of the previous frame to predict the maximum mixture used by the current frame to calculate the likelihood quickly. Experimental result shows the technique reduces the recognition time by 29.3% over baseline system, but the identification performance is kept, it can be applied to actual projects.

Key words: speech recognition, Keyword Spotting(KWS), GMM approximation, on-line garbage model, Feature Similarity of Adjacent Frames(FSAF), Hidden Markov Model(HMM)

中图分类号: