Abstract:
DDBHMM(Duration Distribution Based HMM) is adopted as the acoustic model for Uyghur continuous speech recognition, and the context-dependent triphone model is selected as the best recognition unit, the Uyghur speech recognition system is optimised by using the state-binding method. In order to make the models be trained more sufficiently to improve the recognition performance, the corpus is enlarged, the emphasis is on analysis of the effect that the speech database’s enlargement brings to the recognition rate and accuracy and so on based on several groups of contrasted experiments.
Key words:
corpus,
Uyghur,
DDBHMM model theory,
triphone
摘要: 在维吾尔语连续语音识别试验的声学层建模基础上,引用DDBHMM模型将上下文相关的三音子作为基本识别单元,并提出一种状态绑定的思想,对状态进行优化。为得到更充分的训练模型,提高识别效率,对语料库进行扩充,在多组对比试验的基础上,分析扩充前后对声学层识别速度、准确率等各个方面的影响。
关键词:
语料库,
维吾尔语,
DDBHMM模型理论,
三音子
CLC Number:
WANG Fei-Fei, WU Shou-Er-?Shi-La-Mu, NA Shi-Er-Jiang-?Tu-Er-Xun. Uyghur Speech Acoustics Recognition Based on DDBHMM[J]. Computer Engineering, 2011, 37(2): 197-199.
王飞飞, 吾守尔?斯拉木, 那斯尔江?吐尔逊. 基于DDBHMM的维吾尔语音声学识别[J]. 计算机工程, 2011, 37(2): 197-199.