基于维基百科的语义相似度计算方法

doi:10.3969/j.issn.1000-3428.2011.07.065

计算机工程 ›› 2011, Vol. 37 ›› Issue (7): 193-195. doi: 10.3969/j.issn.1000-3428.2011.07.065

基于维基百科的语义相似度计算方法

盛志超，陶晓鹏

(复旦大学计算机科学技术学院，上海 200433)

出版日期:2011-04-05 发布日期:2011-03-31
作者简介:盛志超(1984－)，男，硕士研究生，主研方向：语义比较，文本分类；陶晓鹏，副教授

Semantic Similarity Computing Method Based on Wikipedia

SHENG Zhi-chao, TAO Xiao-peng

(School of Computer Science, Fudan University, Shanghai 200433, China)

Online:2011-04-05 Published:2011-03-31

摘要/Abstract

摘要： 针对目前语义计算准确率低、可理解性差的问题，提出一种基于维基百科的语义相似度计算方法。不同于利用分类信息计算词的语义相似度，该方法利用页面的链接信息，通过模仿人类联想的方式计算不同词之间的相似度，所得到的结果较容易被理解，并结合词语的语义类别提高计算结果的准确率。和现有算法的对比实验证明了该方法的优越性。

关键词: 页面网, 类别网, 维基百科, 人脑思维

Abstract: Aiming at the low accuracy and poor intelligibility of current algorithms for semantic analysis, a semantic similarity computing method based on Wikipedia is proposed. Different from computing word’s semantic similarity by category information, this method uses link information to calculate the similarity of different words in a way like human thinking. Result can be easily understood and the accuracy rate can be increased with semantic category. Experiment compared with current algorithms proves its advantage.

Key words: PageNet, CategoryNet, Wikipedia, human thinking

中图分类号:

TP391

盛志超, 陶晓鹏. 基于维基百科的语义相似度计算方法[J]. 计算机工程, 2011, 37(7): 193-195.

CHENG Zhi-Chao, DAO Xiao-Feng. Semantic Similarity Computing Method Based on Wikipedia[J]. Computer Engineering, 2011, 37(7): 193-195.

https://www.ecice06.com/CN/Y2011/V37/I7/193

[1]	江惠珍, 孙艳春, 黄罡. 基于知识图谱的GitHub层次化学习和检索服务[J]. 计算机工程, 2024, 50(5): 16-25.
[2]	荆琪,段利国,李爱萍,赵谦. 基于维基百科的短文本相关度计算[J]. 计算机工程, 2018, 44(2): 197-202.
[3]	李雁群,何云琪,钱龙华,周国栋. 基于维基百科的中文嵌套命名实体识别语料库自动构建[J]. 计算机工程, 2018, 44(11): 76-82.
[4]	王东, 牛军钰. 基于多角度关联模型的实体检索方法[J]. 计算机工程, 2013, 39(1): 71-75.
[5]	陈燕, 龙建勋. 基于明确语义分析的自动文摘算法[J]. 计算机工程, 2011, 37(3): 183-185.
[6]	史天艺;李明禄. 基于维基百科的自动词义消歧方法[J]. 计算机工程, 2009, 35(18): 62-65.

选择文件类型/文献管理软件名称

选择包含的内容

基于维基百科的语义相似度计算方法

Semantic Similarity Computing Method Based on Wikipedia

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 6

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于维基百科的语义相似度计算方法

Semantic Similarity Computing Method Based on Wikipedia

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 6

编辑推荐

Metrics

本文评价