基于信息融合的短语音说话人识别方法研究

doi:10.3969/j.issn.1000-3428.2011.02.058

计算机工程 ›› 2011, Vol. 37 ›› Issue (2): 169-171. doi: 10.3969/j.issn.1000-3428.2011.02.058

基于信息融合的短语音说话人识别方法研究

周萍，唐李珍

（桂林电子科技大学电子工程与自动化学院，广西桂林 541004）

出版日期:2011-01-20 发布日期:2011-01-25
作者简介:周萍(1961－)，女，教授，主研方向：语音信号处理，智能控制；唐李珍，硕士研究生
基金资助:
广西壮族自治区教育厅科研基金资助项目(200808MS 008)

Research on Speaker Recognition Method of Little Speech Data Based on Information Fusion

ZHOU Ping, TANG Li-zhen

（School of Electronic Engineering and Automation, Guilin University of Electronic Technology, Guilin 541004, China）

Online:2011-01-20 Published:2011-01-25

摘要/Abstract

摘要： 针对短训练语音的说话人识别系统，提出一种基于决策层融合的识别算法。识别时运用经验模式分解法对语音信号进行处理，对获取的固有模态函数分量提取语音特征序列，分别进行匹配，通过决策层融合算法，将所得的匹配结果与传统独立识别结果相结合，最终输出识别结果。利用信号分解的方法，实现待测语音信号的重复识别，同时采用决策层融合算法优化识别结果，从而在短训练语音情况下，使系统的识别率得到保障。实验结果表明，该算法在短训练语音识别系统中的识别效果优于传统方法。

关键词: 短语音, 说话人识别, 美尔频率倒谱系数, 经验模式分解, 决策层融合

Abstract: An algorithm of decision-fusion for the speaker recognition systems is presented that uses little speech data for training models. In the phase of recognition, it decomposes the speech signal by using empirical mode decomposition processing method, extracting the speech features and repeating speech recognition based on some gained intrinsic mode function components. Meanwhile, by using this algorithm, it can fuse the results of repeat recognition and original system, and get the final output result. With the method of signal processing, repeating recognition can be implemented the original results and the accuracy rate of the recognition system based on little speech data is guaranteed. It proves the algorithm is advanced in recognition systems based on little speech data than the traditional ones by simulation experiments.

Key words: little speech data, speaker recognition, Mel Frequency Cepstrum Coefficient(MFCC), Empirical Mode Decomposition(EMD), decision level fusion

中图分类号:

TN912.3

周萍, 唐李珍. 基于信息融合的短语音说话人识别方法研究[J]. 计算机工程, 2011, 37(2): 169-171.

ZHOU Ping, TANG Li-Zhen. Research on Speaker Recognition Method of Little Speech Data Based on Information Fusion[J]. Computer Engineering, 2011, 37(2): 169-171.

http://www.ecice06.com/CN/Y2011/V37/I2/169

[1]	曹书鑫, 冯藤藤, 葛凤培, 梁春燕. 基于尺度相关‐双向长短期记忆网络模型的说话人识别[J]. 计算机工程, 2023, 49(4): 289-296.
[2]	肖佳林，赵聿晴，王英. 基于HMM与SVM的语音活动检测[J]. 计算机工程, 2014, 40(1): 203-208.
[3]	李敏, 吴斌, 刘恒. 巡逻式多电子哨兵目标识别的数据融合方法[J]. 计算机工程, 2013, 39(3): 182-186.
[4]	熊志伟, 全海燕, 周荣强. 基于Bessel函数展开的ICA语音增强[J]. 计算机工程, 2013, 39(3): 311-315.
[5]	杜晓青，于凤芹. 基于发声机理与人耳感知特性的说话人识别[J]. 计算机工程, 2013, 39(11): 197-199,204.
[6]	项要杰，杨俊安，李晋徽，陆俊. 一种适用于说话人识别的改进Mel滤波器[J]. 计算机工程, 2013, 39(11): 214-217,222.
[7]	汤丽平, 刘剑. 基于近似熵的心肌猝死预警诊断[J]. 计算机工程, 2012, 38(9): 202-204,207.
[8]	张烨, 田雯, 刘盛鹏. 基于集合经验模式分解的火灾时间序列预测[J]. 计算机工程, 2012, 38(24): 152-155.
[9]	秦春香, 黄浩. 发音特征在维汉语音识别中的应用[J]. 计算机工程, 2012, 38(23): 177-180.
[10]	崇元, 徐晓刚. 基于BEMD与NMF的多源遥感图像融合[J]. 计算机工程, 2012, 38(23): 224-226,230.
[11]	胡峰松, 曹孝玉. 基于Gammatone滤波器组的听觉特征提取[J]. 计算机工程, 2012, 38(21): 168-170,174.
[12]	徐晨, 曹辉, 赵晓. 基于SVM的说话人识别参数选择方法[J]. 计算机工程, 2012, 38(21): 175-177.
[13]	武宁, 肖星星, 冯瑞. 家用机器人的说话人识别系统[J]. 计算机工程, 2012, 38(2): 207-209.
[14]	张浩, 吴敏. 校园网络流量自相似性分析与研究[J]. 计算机工程, 2012, 38(08): 73-75.
[15]	张学锋, 王芳, 夏萍. 融合LPC与MFCC的特征参数[J]. 计算机工程, 2011, 37(4): 216-217.

选择文件类型/文献管理软件名称

选择包含的内容

基于信息融合的短语音说话人识别方法研究

Research on Speaker Recognition Method of Little Speech Data Based on Information Fusion

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于信息融合的短语音说话人识别方法研究

Research on Speaker Recognition Method of Little Speech Data Based on Information Fusion

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价