作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2013, Vol. 39 ›› Issue (3): 258-263. doi: 10.3969/j.issn.1000-3428.2013.03.051

• 图形图像处理 • 上一篇    下一篇

基于语义学习的图像多模态检索

李志欣1,施智平2,陈宏朝1,吴璟莉1   

  1. (1. 广西师范大学计算机科学与信息工程学院,广西 桂林 541004;2. 首都师范大学信息工程学院,北京 100048)
  • 收稿日期:2012-03-08 出版日期:2013-03-15 发布日期:2013-03-13
  • 作者简介:李志欣(1971-),男,副教授、博士,主研方向:图像理解,模式识别,机器学习;施智平,副研究员、博士; 陈宏朝,副教授、硕士;吴璟莉,副教授、博士
  • 基金资助:
    国家自然科学基金资助项目(61165009, 60903141);广西自然科学基金资助项目(2012GXNSFAA053219, 2011GXN SFB018068);“八桂学者”工程专项基金资助项目

Multi-modal Image Retrieval Based on Semantic Learning

LI Zhi-xin 1, SHI Zhi-ping 2, CHEN Hong-chao 1, WU Jing-li 1   

  1. (1. College of Computer Science and Information Technology, Guangxi Normal University, Guilin 541004, China; 2. College of Information Engineering, Capital Normal University, Beijing 100048, China)
  • Received:2012-03-08 Online:2013-03-15 Published:2013-03-13

摘要: 针对语义鸿沟问题,在语义学习的基础上设计图像的多模态检索系统。该系统结合3种查询方式进行图像检索。基于视觉特征的查询通过特征提取与相似度匹配进行排位。基于标签的查询建立在图像自动标注的基础上,但在语义空间之外的泛化能力较差。基于语义图例的查询能够在很大程度上克服这个缺陷,通过在显式或隐式的语义空间上进行查询,使检索结果更符合人类感知。实验结果表明,与基于纹理特征的图像检索相比,基于语义图例的检索具有更高的精度及召回率。

关键词: 图像多模态检索, 图像自动标注, 概率主题建模, 概率潜在语义分析, 语义鸿沟, 语义学习, 语义多项式

Abstract: In order to bridge the semantic gap, a multi-modal image retrieval system is proposed based on semantic learning. The system combines three query modes to retrieval images. The paradigm of query by visual feature ranks images by feature extraction and similarity matching; The paradigm of query by label is based on automatic image annotation, but its generalization ability is not good outside the semantic space; The paradigm of Query by Semantic Example(QBSE) can overcome the problem to a great extent. It makes the retrieval more agreeable with human perception by executing the query in either explicit or implicit semantic space. Experimental results show that the paradigm of query by semantic example has higher precision and recall rate than image retrieval based on the texture feature.

Key words: multi-modal image retrieval, automatic image annotation, probabilistic topic modeling, probabilistic latent semantic analysis, semantic gap, semantic learning, semantic multinomial

中图分类号: