作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (11): 192-194. doi: 10.3969/j.issn.1000-3428.2011.11.066

• 人工智能及识别技术 • 上一篇    下一篇

基于Bark子波变换的MFCC特征提取

尹许梅,何选森   

  1. (湖南大学计算机与通信学院,长沙 410082)
  • 收稿日期:2010-10-26 出版日期:2011-06-05 发布日期:2011-06-05
  • 作者简介:尹许梅(1985-),女,硕士研究生,主研方向:说话人识别;何选森,副教授
  • 基金资助:
    湖南省湘潭市科技计划基金资助项目(ZJ20071008)

MFCC Feature Extraction Based on Bark Wavelet Transform

YIN Xu-mei, HE Xuan-sen   

  1. (College of Computer and Communication, Hunan University, Changsha 410082, China)
  • Received:2010-10-26 Online:2011-06-05 Published:2011-06-05

摘要: 为提高低信噪比环境下语音的鲁棒性,提出一种改进的Mel频率倒谱系数(MFCC)特征提取方法。在传统MFCC特征提取的基础上,引入更适应人耳听觉系统的Bark子波变换,在快速傅里叶变换之前对语音进行预处理,并在MFCC提取方法中代替离散余弦变换;在语音预处理阶段,利用改进的Lanczos窗函数抑制旁瓣以提高语音鲁棒性。实验表明,与传统MFCC方法相比,在噪声环境下,改进方法具有更高的说话人识别率。

关键词: 说话人识别, Mel频率倒谱系数, Bark子波, 窗函数

Abstract: In order to improve the quality of speech in low Signal Noise Ratio(SNR), an improved Mel Frequency Cepstral Coefficient(MFCC) feature extraction method is proposed. On the basis of the traditional MFCC feature extraction, the improved method introduces Bark Wavelet Transform(BWT) for more suitable to human ear’s auditory system, it is used to make preprocessing before Fast Fourier Transform(FFT), on the other hand, it is used to instead of Discrete Cosine Transform(DCT) in MFCC. In the pre-processing stage Lanczos window function is adopted to restrain the side lobe and to improve the robustness. Experimental results show that compared with the traditional MFCC, the improved method can improve the speaker identification accuracy in the noisy environment.

Key words: speaker recognition, Mel Frequency Cepstral Coefficient(MFCC), Bark wavelet, window function

中图分类号: