计算机工程 ›› 2018, Vol. 44 ›› Issue (9): 269-273.doi: 10.19678/j.issn.1000-3428.0047419

• 多媒体技术及应用 • 上一篇    下一篇

混响环境下基于分频的广义互相关时延估计算法

胡章芳,乐聪聪,张毅,罗元,刘宇   

  1. 重庆邮电大学 重庆市信息无障碍与服务机器人工程技术研究中心,重庆 400065
  • 收稿日期:2017-06-01 出版日期:2018-09-15 发布日期:2018-09-15
  • 作者简介:胡章芳(1969—),女,副教授,主研方向为音频信息处理、图像处理、人机交互;乐聪聪(通信作者),硕士研究生;张毅、罗元,教授;刘宇,硕士研究生。
  • 基金项目:

    重庆市科技计划项目(cstc2015jcyjBX0066)。

Generalized Cross-correlation Time Delay Estimation Algorithm Based on Frequency Division in Reverberation Environment

HU Zhangfang,YUE Congcong,ZHANG Yi,LUO Yuan,LIU Yu   

  1. Chongqing Information Accessibility and Service Robot Engineering Technology Research Center, Chongqing University of Posts and Telecommunications,Chongqing 400065,China
  • Received:2017-06-01 Online:2018-09-15 Published:2018-09-15

摘要:

实际室内环境存在混响,导致声源定位准确率下降。为此,提出一种基于分频的广义互相关时延估计声源定位算法。双耳信号通过Gammatone滤波器后得到分频的子带信号,在每个子带信号中进行最小相位分解的去混响处理,从倒谱域中反变换到时域后对每个子带进行互相关 计算,然后求和每个子带互相关函数得到最大互相关值相对应的双耳时间差值,最终通过几何模型获得方位角。仿真结果表明,相比PHAT-GCC声源定位法,该算法能减少混响环境带来的定位误差,提高声源定位的稳定性。

关键词: 声源定位, 双耳互相关, 时延估计, Gammatone滤波器, 方位角

Abstract:

Because of the reverberation in the actual indoor environment,the accuracy of sound source localization is reduced.To solve this problem,a novel localization algorithm based on frequency division and generalized cross-correlation time delay estimation is proposed.The binaural signal gets the subband signal of the frequency division through the Gammatone filter,and performs the minimum phase decomposition in each subband signal,and then calculates each subband after inverse transformation to the time domain from the cepstral domain.Then each subband cross-correlation function is obtained to get the binaural corresponding to the maximum correlation value.The difference between them is finally obtained through geometric models to obtain azimuth angles.Simulation results show that,compared with the PHAT-GCC source localization algorithm,the proposed algorithm can reduce the location error caused by reverberation environment and improve the stability of sound source localization.

Key words: sound source localization, binaural cross-correlation, time delay estimation, Gammatone filter, azimuth

中图分类号: