作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2012, Vol. 38 ›› Issue (22): 276-278. doi: 10.3969/j.issn.1000-3428.2012.22.069

• 开发研究与设计技术 • 上一篇    下一篇

基于改进视觉词袋模型的图像标注方法

霍 华,赵 刚   

  1. (河南科技大学电子信息工程学院,河南 洛阳 471003)
  • 收稿日期:2012-02-07 修回日期:2012-03-27 出版日期:2012-11-20 发布日期:2012-11-17
  • 作者简介:霍 华(1968-),男,副教授、博士后,主研方向:智能信息处理,光纤通道技术,嵌入式系统;赵 刚,硕士研究生
  • 基金资助:

    国家自然科学基金资助项目(60743008);河南省国际科技合作计划基金资助项目(104300510063)

Image Annotation Method Based on Improved BoVW Model

HUO Hua, ZHAO Gang   

  1. (Electronic Information Engineering College, Henan University of Science and Technology, Luoyang 471003, China)
  • Received:2012-02-07 Revised:2012-03-27 Online:2012-11-20 Published:2012-11-17

摘要: 针对传统视觉词袋模型对图像尺度变化较为敏感的缺点,提出一种基于改进视觉词袋模型的图像标注方法。该方法引入图像的多尺度空间信息,对图像进行多尺度变换并构建多尺度视觉词汇表,将图像表示为不同尺度特征,结合多核学习的方法优化各尺度特征的相应权重,获取特征表示。实验结果验证了该方法的有效性,其标注准确率比传统BoVW模型提高17.8%~25.7%。

关键词: 图像标注, 视觉词袋模型, 多尺度空间, 多尺度视觉词, 多核学习, 权重优化

Abstract: Aiming at overcoming the traditional Bag of Visual Word(BoVW) model’s sensitivity to image scale’s variation, this paper proposes an image annotation method based on improved BoVW model. It incorporates with multiple spaces information and transfers original images into multiple scale spaces and constructs multiple scale vocabularies. Images are represented as a family of feature histograms with different scale. Multiple kernel learning is introduced to optimize the histograms weights of different scale in order to acquire discriminative classifying power. Experimental results prove the validity of the method, it outperforms BoVW on image annotation precision ranged from 17.8% to 25.7%.

Key words: image annotation, Bag of Visual Word(BoVW) model, multiple scale space, multiple scale visual word, multiple kernel learning, weight optimization

中图分类号: