Semantic Similarity Computing Method Based on Wikipedia

doi:10.3969/j.issn.1000-3428.2011.07.065

Computer Engineering ›› 2011, Vol. 37 ›› Issue (7): 193-195.

• Networks and Communications • Previous Articles Next Articles

Semantic Similarity Computing Method Based on Wikipedia

SHENG Zhi-chao, TAO Xiao-peng

(School of Computer Science, Fudan University, Shanghai 200433, China)

Online:2011-04-05 Published:2011-03-31

基于维基百科的语义相似度计算方法

盛志超，陶晓鹏

(复旦大学计算机科学技术学院，上海 200433)

作者简介:盛志超(1984－)，男，硕士研究生，主研方向：语义比较，文本分类；陶晓鹏，副教授

Abstract

Abstract: Aiming at the low accuracy and poor intelligibility of current algorithms for semantic analysis, a semantic similarity computing method based on Wikipedia is proposed. Different from computing word’s semantic similarity by category information, this method uses link information to calculate the similarity of different words in a way like human thinking. Result can be easily understood and the accuracy rate can be increased with semantic category. Experiment compared with current algorithms proves its advantage.

Key words: PageNet, CategoryNet, Wikipedia, human thinking

摘要： 针对目前语义计算准确率低、可理解性差的问题，提出一种基于维基百科的语义相似度计算方法。不同于利用分类信息计算词的语义相似度，该方法利用页面的链接信息，通过模仿人类联想的方式计算不同词之间的相似度，所得到的结果较容易被理解，并结合词语的语义类别提高计算结果的准确率。和现有算法的对比实验证明了该方法的优越性。

关键词: 页面网, 类别网, 维基百科, 人脑思维

CLC Number:

TP391

CHENG Zhi-Chao, DAO Xiao-Feng. Semantic Similarity Computing Method Based on Wikipedia[J]. Computer Engineering, 2011, 37(7): 193-195.

盛志超, 陶晓鹏. 基于维基百科的语义相似度计算方法[J]. 计算机工程, 2011, 37(7): 193-195.

/ Recommend / Download Citations

URL:

https://www.ecice06.com/EN/Y2011/V37/I7/193

[1]	JIANG Huizhen, SUN Yanchun, HUANG Gang. GitHub Hierarchical Learning and Retrieval Service Based on Knowledge Graphs [J]. Computer Engineering, 2024, 50(5): 16-25.
[2]	JING Qi,DUAN Liguo,LI Aiping,ZHAO Qian. Short Text Correlation Calculation Based on Wikipedia [J]. Computer Engineering, 2018, 44(2): 197-202.
[3]	LI Yanqun,HE Yunqi,QIAN Longhua,ZHOU Guodong. Automatic Construction of Chinese Nested Named Entity Recognition Corpus Based on Wikipedia [J]. Computer Engineering, 2018, 44(11): 76-82.
[4]	WU Shun-yao, SHAO Feng-jing, WANG Jin-long, SUN Ren-cheng, WANG Ying. Document Clustering Fused with Semantic Resources and Key Words [J]. Computer Engineering, 2014, 40(4): 223-227.
[5]	WANG Dong, NIU Jun-Yu. Entity Retrieval Method Based on Multi-perspective Association Model [J]. Computer Engineering, 2013, 39(1): 71-75.
[6]	CHEN Yan, LONG Jian-Xun. Automatic Abstraction Algorithm Based on Explicit Semantic Analysis [J]. Computer Engineering, 2011, 37(3): 183-185.
[7]	LIU Jun, TAO Tian-Fang. Semantic Relevancy Computing Based on Wikipedia [J]. Computer Engineering, 2010, 36(19): 42-43.
[8]	SHI Tian-yi; LI Ming-lu. Automatic Word Sense Disambiguation Method Based on Wikipedia [J]. Computer Engineering, 2009, 35(18): 62-65.

Please choose a citation manager

Content to export

Semantic Similarity Computing Method Based on Wikipedia

基于维基百科的语义相似度计算方法

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 8

Recommended Articles

Metrics

Comments

模态框（Modal）标题

Please choose a citation manager

Content to export

Semantic Similarity Computing Method Based on Wikipedia

基于维基百科的语义相似度计算方法

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 8

Recommended Articles

Metrics

Comments