作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (19): 160-162. doi: 10.3969/j.issn.1000-3428.2011.19.052

• 人工智能及识别技术 • 上一篇    下一篇

基于K-L距离的两步固定音频检索方法

齐晓倩,陈鸿昶,黄 海   

  1. (解放军信息工程大学信息工程学院,郑州 450002)
  • 收稿日期:2011-03-30 出版日期:2011-10-05 发布日期:2011-10-05
  • 作者简介:齐晓倩(1984-),女,硕士,主研方向:固定音频检索;陈鸿昶,教授;黄 海,博士
  • 基金资助:
    国家“863”计划基金资助项目(2008AA011002)

Two-stage Specific Audio Retrieval Method Based on K-L Distance

QI Xiao-qian, CHEN Hong-chang, HUANG Hai   

  1. (School of Information Engineering, PLA Information Engineering University, Zhengzhou 450002, China)
  • Received:2011-03-30 Online:2011-10-05 Published:2011-10-05

摘要: 根据音频文件数据量大、数据间存在一定相关性的特点,提出一种基于K-L距离的两步固定音频检索方法。该方法采用基于可变门限的直方图检索方法快速筛选出相似度较高的语音文件,利用特征矩阵的K-L距离对剩余语音进行精确比较,取得较好的效果。实验结果证明,该方法能使检索准确率达到90%左右。

关键词: 固定音频检索, 过零率, 直方图, 美尔频率倒谱系数, K-L距离

Abstract: Due to the huge amount of audio data, and some relation among them, this paper proposes a two-stage specific audio retrieval method based on K-L Distance. The method uses histogram retrieval method based on variable threshold to choose audio file of high similarity, compares precisely with residual audio using K-L distance of feature matrix, and obtains good effect. Experimental results show that the retrieval accuracy is over 90%.

Key words: specific audio retrieval, Zero Crossing Rate(ZCR), histogram, Mel Frequency Cepstral Coefficient(MFCC), K-L distance

中图分类号: