Design and Implementation of DCT Structure in MFCC

doi:10.3969/j.issn.1000-3428.2009.05.091

Computer Engineering ›› 2009, Vol. 35 ›› Issue (5): 265-267. doi: 10.3969/j.issn.1000-3428.2009.05.091

• Developmental Research • Previous Articles Next Articles

Design and Implementation of DCT Structure in MFCC

KONG Wei-gong, ZHANG Guo-jie, ZHANG Xiao-jun

(School of Information Engineering, PLA Information Engineering University, Zhengzhou 450002)

Received:1900-01-01 Revised:1900-01-01 Online:2009-03-05 Published:2009-03-05

MFCC中DCT结构的设计与实现

孔维功，张国杰，张效军

(解放军信息工程大学信息工程学院，郑州 450002)

Abstract

Abstract: This paper presents an implementation structure based on Distributed Arithmetic(DA) according to DCT character in MFCC, which optimizes DA by using ROM reduction and offset binary coder, and reduces the size of ROM table from 2N to (N/K)2K-1. The results of simulation and FPGA test show this kind of design is correct, which meets the requirement of real-time and precision in MFCC computation for speaker recognition.

Key words: speaker recognition, Mel-Frequency Cepstral Coefficients(MFCC), discrete cosine transform, distributed arithmetic

摘要： 根据MFCC中DCT的特点，设计一种基于DA算法的实现结构，采用先分解ROM再偏移二进制编码的方法对DA算法进行优化，将ROM表的大小由2N减小到(N/K)2K-1。通过仿真与FPGA测试，验证了该设计的正确性，能够满足说话人识别中MFCC参数提取的实时性要求和精度要求。

关键词: 说话人识别, 美尔频率倒谱系数, 离散余弦变换, 分布式算法

CLC Number:

TP311.12

KONG Wei-gong; ZHANG Guo-jie; ZHANG Xiao-jun. Design and Implementation of DCT Structure in MFCC[J]. Computer Engineering, 2009, 35(5): 265-267.

孔维功;张国杰;张效军. MFCC中DCT结构的设计与实现[J]. 计算机工程, 2009, 35(5): 265-267.

/ / Recommend / Download Citations

URL: http://www.ecice06.com/EN/10.3969/j.issn.1000-3428.2009.05.091

http://www.ecice06.com/EN/Y2009/V35/I5/265

[1]	SONG Yukai, XIE Jiang. Lightweight Speech Emotion Recognition Model Based on Multi-Task Learning [J]. Computer Engineering, 2023, 49(5): 122-128.
[2]	CAO Shuxin, FENG Tengteng, GE Fengpei, LIANG Chunyan. Speaker Recognition Based on Scale Correlation-Bidirectional Long Short-Term Memory Network Model [J]. Computer Engineering, 2023, 49(4): 289-296.
[3]	FU Pengcheng, YANG Guan, LIU Xiaoming, LIU Yang, ZHANG Ziming, CHENG Xi. Visual Question Answering Model Based on Spatial Relation and Frequency Feature [J]. Computer Engineering, 2022, 48(9): 96-104.
[4]	WANG Zhongmin, LIU Ge, SONG Hui. Speech Emotion Recognition Method Based on Multiple Kernel Learning Feature Fusion [J]. Computer Engineering, 2019, 45(8): 248-254.
[5]	LIU Yufu,LANG Wenhui,JIA Guangshuai. Realization and Performance Analysis of Matrix Multiplication on HXDSP Platform [J]. Computer Engineering, 2019, 45(4): 25-29.
[6]	QI Xiangming, ZHANG Jing, TAN Xinqi. Zero-Watermarking Algorithm with Strong Robustness Based on Low-Frequency Singular Mean Value [J]. Computer Engineering, 2019, 45(12): 214-221.
[7]	WANG Kuikui,YU Zhenming. Local Blur Detection Based on DCT Zero Coefficients and Local Structure Tensor [J]. Computer Engineering, 2017, 43(6): 207-211,218.
[8]	LI Yong,WEI Dang,WANG Liuyu. Emotional Speech Synthesis Method Based on PSOLA and DCT [J]. Computer Engineering, 2017, 43(12): 278-282,291.
[9]	ZHANG Xiaodan,LI Chunlai. Image Steganographic Algorithm Based on Discrete Cosine Transforms and Co-occurrence Matrix Feature [J]. Computer Engineering, 2015, 41(8): 127-131.
[10]	WANG Ying,YUAN Kaiguo,XI Minchao. Removable Digital Video Watermark Algorithm Based on Discrete Cosine Transform [J]. Computer Engineering, 2015, 41(5): 169-174.
[11]	LI Ruizhen,ZHANG Xiaoxu,MA De,HUANG Kai,YAN Xiaolang. A Flexible and Configurable Architecture of Software and Hardwarem for JPEG Codec [J]. Computer Engineering, 2014, 40(11): 266-272.
[12]	TONG Wei, ZHAO Xu-dong, WANG Shi-lin, LI Sheng-hong. Image Splicing Detection Based on Entropy and Multi-step Markov Feature [J]. Computer Engineering, 2014, 40(1): 236-238,245.
[13]	NIE Xiu-Shan, DONG Fei, SUN Jian-De. Video Fingerprinting Algorithm Based on Kurtosis Image [J]. Computer Engineering, 2013, 39(2): 141-144.
[14]	DU Xiao-qing, YU Feng-qin. Speaker Recognition Based on Vocal Mechanism and Human Ear Perceptual Characteristic [J]. Computer Engineering, 2013, 39(11): 197-199,204.
[15]	XIANG Yao-jie, YANG Jun-an, LI Jin-hui, LU Jun. An Improved Mel-frequency Filter for Speaker Recognition [J]. Computer Engineering, 2013, 39(11): 214-217,222.

Please choose a citation manager

Content to export

Design and Implementation of DCT Structure in MFCC

MFCC中DCT结构的设计与实现

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments

模态框（Modal）标题

Please choose a citation manager

Content to export

Design and Implementation of DCT Structure in MFCC

MFCC中DCT结构的设计与实现

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments