作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (19): 204-206,209. doi: 10.3969/j.issn.1000-3428.2011.19.067

• 图形图像处理 • 上一篇    下一篇

词包模型中视觉单词歧义性分析

刘扬闻,霍 宏,方 涛   

  1. (上海交通大学图像处理与模式识别研究所,上海 200240)
  • 收稿日期:2011-04-26 出版日期:2011-10-05 发布日期:2011-10-05
  • 作者简介:刘扬闻(1985-),男,硕士研究生,主研方向:图像分类,图像处理;霍 宏,讲师、博士;方 涛,教授、博士生导师
  • 基金资助:
    国家“973”计划基金资助项目(2006CB701303);国家自然科学基金资助项目(41071256)

Visual Words Ambiguity Analysis in BOW Model

LIU Yang-wen, HUO Hong, FANG Tao   

  1. (Institute of Image Processing and Pattern Recognition, Shanghai Jiaotong University, Shanghai 200240, China)
  • Received:2011-04-26 Online:2011-10-05 Published:2011-10-05

摘要: 传统词包(BOW)模型中的视觉单词是通过无监督聚类图像块的特征向量得到的,没有考虑视觉单词的语义信息和语义性质。为解决该问题,提出一种基于文本分类的视觉单词歧义性分析方法。利用传统BOW模型生成初始视觉单词词汇表,使用文档频率、χ2分布和信息增益这3种文本分类方法分析单词语义性质,剔除具有低类别信息的歧义性单词,并采用支持向量机分类器实现图像分类。实验结果表明,该方法具有较高的分类精度。

关键词: 图像分类, 视觉单词, 文本分类, 支持向量机, 词包模型

Abstract: Visual words in the traditional Bag of Word(BOW) model can be gotten by an unsupervised method of clustering the visual features. But one critical limitation of existing BOW is not concerned with the semantic natures of visual words. This paper proposes a visual words ambiguity analysis method based on text categorization. The codebook is generated by the BOW model. There are three ways of analysis——document frequency, χ2 distribution and information gains, and then they reduce the low information visual words after analyzing. It gets optimized visual words, the histogram formed by the frequency of visual words is used in image categorization task by the Support Vector Machine(SVM) classifier. Experimental results show that this method has higher classification accuracy.

Key words: image classification, visual words, text classification, Support Vector Machine(SVM), Bag of Word(BOW) model

中图分类号: