Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2009, Vol. 35 ›› Issue (5): 265-267. doi: 10.3969/j.issn.1000-3428.2009.05.091

• Developmental Research • Previous Articles     Next Articles

Design and Implementation of DCT Structure in MFCC

KONG Wei-gong, ZHANG Guo-jie, ZHANG Xiao-jun   

  1. (School of Information Engineering, PLA Information Engineering University, Zhengzhou 450002)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-03-05 Published:2009-03-05

MFCC中DCT结构的设计与实现

孔维功,张国杰,张效军   

  1. (解放军信息工程大学信息工程学院,郑州 450002)

Abstract: This paper presents an implementation structure based on Distributed Arithmetic(DA) according to DCT character in MFCC, which optimizes DA by using ROM reduction and offset binary coder, and reduces the size of ROM table from 2N to (N/K)2K-1. The results of simulation and FPGA test show this kind of design is correct, which meets the requirement of real-time and precision in MFCC computation for speaker recognition.

Key words: speaker recognition, Mel-Frequency Cepstral Coefficients(MFCC), discrete cosine transform, distributed arithmetic

摘要: 根据MFCC中DCT的特点,设计一种基于DA算法的实现结构,采用先分解ROM再偏移二进制编码的方法对DA算法进行优化,将ROM表的大小由2N减小到(N/K)2K-1。通过仿真与FPGA测试,验证了该设计的正确性,能够满足说话人识别中MFCC参数提取的实时性要求和精度要求。

关键词: 说话人识别, 美尔频率倒谱系数, 离散余弦变换, 分布式算法

CLC Number: