Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2011, Vol. 37 ›› Issue (22): 42-44. doi: 10.3969/j.issn.1000-3428.2011.22.011

• Networks and Communications • Previous Articles     Next Articles

Concept Semantic Similarity Algorithm in WordNet Based on Information Content

WANG Yan-na, ZHOU Zi-li, HE Yan   

  1. (College of Physics and Engineering, Qufu Normal University, Qufu 273165, China)
  • Received:2011-05-11 Online:2011-11-18 Published:2011-11-20

WordNet中基于IC的概念语义相似度算法

王艳娜,周子力,何 艳   

  1. (曲阜师范大学物理工程学院,山东 曲阜 273165)
  • 作者简介:王艳娜(1976-),女,讲师、硕士,主研方向:语义相似度算法,智能信息处理,图像处理;周子力,副教授、博士;何 艳,硕士研究生
  • 基金资助:
    山东省优秀中青年科学家科研奖励基金资助项目(BS20 10DX012)

Abstract: A new algorithm of calculating semantic similarity of concepts in WordNet based on Information Content(IC) is presented. The model considers the IC value of concepts and their positions in the is_a taxonomy tree in WordNet, which improves the performance of the model efficiently. Furthermore, a new method of calculating IC value of concept is given. The method considers the number of child node of concept and its depth in the taxonomy tree of WordNet, which makes the IC value more accurate.

Key words: Information Content(IC), WordNet ontology, semantic similarity, child node, taxonomy tree

摘要: 提出一种计算WordNet中概念间语义相似度的算法,该算法同时考虑概念的信息内容(IC)以及2个概念在WordNet is_a关系分类树中的距离信息,由此提高算法性能。给出一种计算概念IC值的新方法,通过考虑概念的子节点数及概念所处WordNet分类树中的深度,使计算结果更精确。与其他5种语义相似度算法的比较结果表明,该算法能够求得更准确的相似度。

关键词: 信息内容, WordNet本体, 语义相似度, 子节点, 分类树

CLC Number: