作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (11): 211-213. doi: 10.3969/j.issn.1000-3428.2008.11.076

• 人工智能及识别技术 • 上一篇    下一篇

基于分形特征的音频检索

李 坚1,毛先领2,文贵华2   

  1. (1. 华南理工大学计算机应用工程研究所,广州 510641;2. 华南理工大学计算机学院,广州 510641)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-06-05 发布日期:2008-06-05

Fractal Feature-based Audio Retrieval

LI Jian1, MAO Xian-ling2, WEN Gui-hua2   

  1. (1. Computer Engineering Institute, South China University of Technology, Guangzhou 510641; 2. Computer College, South China University of Technology, Guangzhou 510641)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-06-05 Published:2008-06-05

摘要: 提出利用分形几何抽取音频特征的全局化音频检索,将其学习阶段计算音频数据库中每个音频的分维作为特征向量,保存在音频特征数据库中,并建立索引。其检索阶段则首先计算查询音频的分维,然后从音频数据库中快速找出分维最相似的若干音频对象。分维刻画了音频的内在属性如自相似性,使其具有片段检索对匹配的起点不敏感、抗噪音、检索速度快等优点。用FRACTAL, MFCC和SOLAR 3种方法对数据集分别检索,实验结果表明基于分维的音频检索在性能和时间复杂度上有显著优势。

关键词: 音频检索, 分形, 音频特征

Abstract: The fractal geometry-based feature extraction is proposed for audio retrieval system. During the learning process, the system computes the fractal dimension as the feature vector for each audio in audio database and then saves it in the feature vector database. In the retrieval process, the fractal dimension for the query audio is firstly extracted, by which the most similar audios from the audio database are retrieved. The fractal dimension is intrinsic for each audio such as self-similarity so as to make it not sensitive to noise and position of the audio fragment to be retrieved from the long audio. It also retrieves the audios quickly. Compared with FRACTAL, MFCC and SOLAR, the experimental results validate that the proposed approach advances in performance and time complexity.

Key words: audio retrieval, fractal, audio feature

中图分类号: