作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 人工智能及识别技术 • 上一篇    下一篇

基于二进制哈希与空间金字塔的视觉词袋模型生成方法

彭天强 1,栗芳 2   

  1. (1.河南工程学院 计算机学院,郑州 451191; 2.河南图像识别工程技术中心,郑州 450002)
  • 收稿日期:2016-01-14 出版日期:2016-12-15 发布日期:2016-12-15
  • 作者简介:彭天强(1978—),男,副教授,主研方向为图像识别与检索、大数据挖掘;栗芳,硕士研究生。
  • 基金资助:
    国家自然科学基金青年科学基金“基于位置敏感哈希的图像语义检索技术研究”(61301232);河南工程学院博士基金。

Generating Method of Visual Word Bag Model Based on Binary Hashing and Space Pyramid

PENG Tianqiang  1,LI Fang  2   

  1. (1.School of Computer Science,Henan University of Engineering,Zhengzhou 451191,China;2.Henan Image Recognition Engineering Center,Zhengzhou 450002,China)
  • Received:2016-01-14 Online:2016-12-15 Published:2016-12-15

摘要: 构建视觉词典是视觉词袋模型中的关键步骤,目前大多数视觉词典是基于k-means及其改进算法聚类生成。但由于k-means聚类的局限性以及样本空间结构的复杂性与高维性,该方式构建的视觉词典存在区分性较差、构建时间过长、不包含空间信息等问题。为此,提出一种改进的视觉词袋模型生成方法,以缩短视觉词典的构建时间。提取图像的局部特征点,构成局部特征点描述集。学习二进制哈希函数,将局部特征点映射为视觉单词,并对视觉词进行过滤,生成二进制哈希码的视觉词典。利用生成的视觉词典,结合空间金字塔匹配模型生成新的视觉词典模型,将图像表示为空间金字塔直方图向量,并应用于图像分类和检索。实验结果表明,该模型具有较高的分类精度和检索性能。

关键词: 二进制哈希, 空间金字塔匹配模型, 视觉词袋模型, 图像分类, 图像检索

Abstract: Constructing visual vocabulary in the Bag of Visual Word(BoVW) model is a critical step,most visual vocabulary is generated by the k-means algorithm or its improved algorithm.Because of the limitation of the k-means algorithm and the complexity and the high-dimensionality of the sample space,visual vocabulary generated by these methods have the problem of low discriminative long running time and without space information.For these problems,a BoVW model is proposed based on binary Hashing and space pyramid,which can shorten the visual vocabulary generation time sharply.It extracts the local feature points from the images,learns binary Hashing functions,which map the local feature points into visual words,filters the visual words and generates the visual vocabulary whose visual word is binary hash code.The new BoVW model is composed with the visual vocabulary and Spatial Pyramid Matching(SPM) model,which represents the images by the histogram vector of space pyramid and is applied in image classification and retrieval.Experimental results on the common datasets show that the model has higher classification accuracy and retrieval performance.

Key words: binary Hashing, Spatial Pyramid Matching(SPM) model, Bag of Visual Word(BoVW) model, image classification, image retrieval

中图分类号: