作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (19): 291-292. doi: 10.3969/j.issn.1000-3428.2011.19.096

• 开发研究与设计技术 • 上一篇    

基于Google与KL距离的概念相关度算法

连 宇,彭进业,谢红梅,冯晓毅   

  1. (西北工业大学电子信息学院,西安 710129)
  • 收稿日期:2011-03-24 出版日期:2011-10-05 发布日期:2011-10-05
  • 作者简介:连 宇(1985-),男,硕士研究生,主研方向:图像处理,模式识别;彭进业,教授、博士生导师;谢红梅,副教授;冯晓毅,教授、博士生导师
  • 基金资助:
    国家自然科学基金资助项目(61075014, 60875016);教育部博士点基金资助项目(20096102110025)

Concepts Similarity Algorithm Based on Google and KL Distance

LIAN Yu, PENG Jin-ye, XIE Hong-mei, FENG Xiao-yi   

  1. (School of Electronic and Information, Northwestern Polytechnical University, Xi’an 710129, China)
  • Received:2011-03-24 Online:2011-10-05 Published:2011-10-05

摘要: WordNet在计算概念相关度时存在词汇量小、难以及时扩展更新以及同义、近义、一词多义等问题。为此,提出一种结合文本信息和图像视觉信息的概念相关度方法。利用Google距离和KL距离分别计算基于词语同现频率的概念相关度和基于视觉特征的概念相关度,并结合两者得到概念的总体相关度。实验结果验证了该方法的有效性。

关键词: 概念相关度, WordNet网络, Google距离, KL距离, 视觉语言建模

Abstract: For WordNet has its limitation of the vocabulary and promptly expansion, the problem of the synonym, near-synonym and polysemy, this paper proposes a new concept similarity measurement algorithm, which combines text and image vision information. Google distance is used to calculate the semantic distance between the concepts based on co-occurrence of the words. And the visual similarity is calculated. The whole similarity between the concepts is measured by combining them. Experimental results verify the validity of this algorithm.

Key words: concepts similarity, WordNet, Google distance, Kullback-Leibler(KL) distance, vision language modeling

中图分类号: