Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2009, Vol. 35 ›› Issue (14): 221-223. doi: 10.3969/j.issn.1000-3428.2009.14.077

• Artificial Intelligence and Recognition Technology • Previous Articles     Next Articles

Speaker Identification Algorithm of Semi-Supervised Learning Mechanism

LI Yan-ping, TANG Zhen-min, DING Hui, ZHANG Yan   

  1. (Lab of Pattern Recognition and Intelligence System, Nanjing University of Science and Technology, Nanjing 210094)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-07-20 Published:2009-07-20

半监督学习机制下的说话人辨认算法

李燕萍,唐振民,丁 辉,张 燕   

  1. (南京理工大学模式识别与智能系统实验室,南京 210094)

Abstract: Aiming at the limitation of generalization ability when the speaker identification system falls short of training data, this paper proposes a new method of complex Gaussian Mixture Model(GMM) based on Semi-Supervised Learning(SSL). It uses complex GMM to model the feature distribution of all speakers and the training period adopted SSL based on iterative algorithm. Experimental shows that this new method can take full advantage of unlabeled data on the self-adaptive updating, improve the performance, get higher recognition rate compared with traditional GMM, and effectively enhance the generalization ability of system.

Key words: speaker identification, Semi-Supervised Learning(SSL), complex Gaussian Mixture Model(GMM), EM algorithm

摘要: 针对说话人辨认中训练语音有限时系统泛化能力差的问题,提出一种基于半监督学习的复合高斯混合模型算法。通过复合高斯混合模型对所有说话人的特征分布统一建模,基于半监督学习机制下的EM算法对学习样本进行学习。实验证明,该算法能够充分利用未标记样本对系统进行有效的自适应更新,改善系统的性能,获得比传统高斯混合模型更高的识别率,提高系统的泛化能力。

关键词: 说话人辨认, 半监督学习, 复合高斯混合模型, EM算法

CLC Number: