作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (5): 265-267.

• 开发研究与设计技术 • 上一篇    下一篇

MFCC中DCT结构的设计与实现

孔维功,张国杰,张效军   

  1. (解放军信息工程大学信息工程学院,郑州 450002)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-03-05 发布日期:2009-03-05

Design and Implementation of DCT Structure in MFCC

KONG Wei-gong, ZHANG Guo-jie, ZHANG Xiao-jun   

  1. (School of Information Engineering, PLA Information Engineering University, Zhengzhou 450002)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-03-05 Published:2009-03-05

摘要: 根据MFCC中DCT的特点,设计一种基于DA算法的实现结构,采用先分解ROM再偏移二进制编码的方法对DA算法进行优化,将ROM表的大小由2N减小到(N/K)2K-1。通过仿真与FPGA测试,验证了该设计的正确性,能够满足说话人识别中MFCC参数提取的实时性要求和精度要求。

关键词: 说话人识别, 美尔频率倒谱系数, 离散余弦变换, 分布式算法

Abstract: This paper presents an implementation structure based on Distributed Arithmetic(DA) according to DCT character in MFCC, which optimizes DA by using ROM reduction and offset binary coder, and reduces the size of ROM table from 2N to (N/K)2K-1. The results of simulation and FPGA test show this kind of design is correct, which meets the requirement of real-time and precision in MFCC computation for speaker recognition.

Key words: speaker recognition, Mel-Frequency Cepstral Coefficients(MFCC), discrete cosine transform, distributed arithmetic

中图分类号: