计算机工程

• 多媒体技术及应用 • 上一篇    下一篇

基于GPU加速的音频检索技术

金国平1,余宗桥1,郭延文1,蒋 和2   

  1. (1. 南京大学计算机科学与技术系,南京 210046;2. 南京师范大学数学科学学院,南京 210046)
  • 收稿日期:2013-04-15 出版日期:2014-05-15 发布日期:2014-05-14
  • 作者简介:金国平(1988-),男,硕士研究生,主研方向:多媒体信息检索;余宗桥,硕士研究生;郭延文(通讯作者),副教授、博士、CCF会员;蒋 和,本科生。
  • 基金项目:
    国家自然科学基金资助项目(61073098, 61021062);国家“973”计划基金资助项目(2010CB327903);江苏省自然科学基金资助项目(BK2009081)。

Audio Retrieval Technology Based on GPU Acceleration

JIN Guo-ping 1, YU Zong-qiao 1, GUO Yan-wen 1, JIANG He 2   

  1. (1. Department of Computer Science and Technology, Nanjing University, Nanjing 210046, China; 2. School of Mathematical Sciences, Nanjing Normal University, Nanjing 210046, China)
  • Received:2013-04-15 Online:2014-05-15 Published:2014-05-14

摘要: 由于数字音频数据量极大的特点,采用传统音频检索方法会导致等待时间过长。为加快音频检索时间,提出一种基于GPU加速的数字音频检索方法。利用数字音频的特征将连续的音频划分成等长的多个短时音频段,采用GPU加速算法计算每个短时音频段的特征值,将各段的特征值构成特征矩阵。使用后缀数组的变形算法找出2个特征值序列的公共特征段落集合,并将公共特征段落集合进行精化和整体匹配,从而得出检索结果。实验结果表明,该检索方法的准确率可以达到95%以上,与已有方法相比,可以大幅度地提高检索速度,加速比可以达到10倍以上。

关键词: 音频检索, GPU加速, 后缀数组, 音频特征, 特征值序列, 整体匹配

Abstract: As digital audio has a feature of great data volume, traditional audio retrieval method results are used in intolerable response time. In order to speed up audio retrieval, this paper proposes a GPU acceleration audio retrieval method. The audio is divided into multiple short audio segments based on the features, and the characteristic matrix is constituted by the eigenvalues which is calculated from each short audio segment using the GPU acceleration algorithm. The suffix array deformation algorithm is used to find the common set from the two eigenvalues sequence. The common set is refined and overall matched to get the retrieval result. Experimental results show that the retrieval accuracy is over 95% and compared with existing algorithms, this method can significantly improve the retrieval speed and speedup can be achieved in more than 10 times.

Key words: audio retrieval, GPU acceleration, suffix array, audio characteristic, eigenvalue sequence, overall match

中图分类号: