作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (19): 42-43. doi: 10.3969/j.issn.1000-3428.2010.19.014

• 软件技术与数据库 • 上一篇    下一篇

基于Wikipedia的语义相关度计算

刘 军,姚天昉   

  1. (上海交通大学计算机科学与工程系,上海 200240)
  • 出版日期:2010-10-05 发布日期:2010-09-27
  • 作者简介:刘 军(1981-),男,硕士研究生,主研方向:自然语言处理,意见挖掘;姚天昉,副教授、博士
  • 基金资助:

    国家自然科学基金资助项目(60773087)

Semantic Relevancy Computing Based on Wikipedia

LIU Jun, YAO Tian-fang   

  1. (Department of Computer Science and Engineering, Shanghai Jiaotong University, Shanghai 200240, China)
  • Online:2010-10-05 Published:2010-09-27

摘要:

在意见挖掘中,为实现特殊领域知识的语义相关度计算,提出基于Wikipedia的语义相关度计算方法。在构建Wikipedia类别树的基础上,通过Wikipedia类别向量表示Wikipedia中的词汇,形成一部包含各种领域知识的Wikipedia词典,利用该词典计算语义相关度。实验结果表明,该方法的斯皮尔曼等级相关系数可达到0.77。

关键词: 语义相关度, 领域知识, Wikipedia类别树, 意见挖掘

Abstract:

n order to compute semantic relevancy for the specific domain knowledge in opinion mining, this paper proposes a semantic relevancy computing method based on Wikipedia. On the basis of constructing a category tree from Wikipedia, it represents the vast words in Wikipedia by using the category and the result in a Wikipedia dictionary which contains rich domain-specific knowledge, and then computes semantic relevancy by using the dictionary. Experimental results show Spearman rank correlation coefficient of this method can reach 0.77.

Key words: semantic relevancy, domain knowledge, Wikipedia category tree, opinion mining

中图分类号: