作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (2): 197-199. doi: 10.3969/j.issn.1000-3428.2011.02.068

• 人工智能及识别技术 • 上一篇    下一篇

基于DDBHMM的维吾尔语音声学识别

王飞飞1a,吾守尔•斯拉木1a,那斯尔江•吐尔逊1b,2   

  1. (1. 新疆大学a. 信息科学与工程学院;b. 数学与系统科学学院,乌鲁木齐 830046;2. 西安交通大学电子与信息工程学院,西安 710049)
  • 出版日期:2011-01-20 发布日期:2011-01-25
  • 作者简介:王飞飞(1984-),女,硕士研究生,主研方向:语音信息处理;吾守尔?斯拉木,教授、博士生导师;那斯尓江?吐尔逊,副教授、博士研究生
  • 基金资助:
    国家自然科学基金资助项目(60762006, 60863008);国家语委基金资助重点项目(MZ115-75)

Uyghur Speech Acoustics Recognition Based on DDBHMM

WANG Fei-fei 1a, Wushour Silamu 1a, Nasirjan Tursun 1b,2   

  1. (1a. Information Science and Engineering College; 1b. Mathematics and Systems Science College, Xinjiang University, Urumqi 830046, China; 2. Electronic and Information Engineering College, Xi’an Jiaotong University, Xi’an 710049, China)
  • Online:2011-01-20 Published:2011-01-25

摘要: 在维吾尔语连续语音识别试验的声学层建模基础上,引用DDBHMM模型将上下文相关的三音子作为基本识别单元,并提出一种状态绑定的思想,对状态进行优化。为得到更充分的训练模型,提高识别效率,对语料库进行扩充,在多组对比试验的基础上,分析扩充前后对声学层识别速度、准确率等各个方面的影响。

关键词: 语料库, 维吾尔语, DDBHMM模型理论, 三音子

Abstract: DDBHMM(Duration Distribution Based HMM) is adopted as the acoustic model for Uyghur continuous speech recognition, and the context-dependent triphone model is selected as the best recognition unit, the Uyghur speech recognition system is optimised by using the state-binding method. In order to make the models be trained more sufficiently to improve the recognition performance, the corpus is enlarged, the emphasis is on analysis of the effect that the speech database’s enlargement brings to the recognition rate and accuracy and so on based on several groups of contrasted experiments.

Key words: corpus, Uyghur, DDBHMM model theory, triphone

中图分类号: