作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (12): 55-57. doi: 10.3969/j.issn.1000-3428.2010.12.019

• 软件技术与数据库 • 上一篇    下一篇

改进的基于概念相似度的文本检索

吕 刚1,2,郑 诚1   

  1. (1. 安徽大学计算智能与信号处理教育部重点实验室,合肥 230039;2. 合肥学院网络与智能信息处理重点实验室,合肥 230601)
  • 出版日期:2010-06-20 发布日期:2010-06-20
  • 作者简介:吕 刚(1978-),男,讲师、硕士研究生,主研方向:数据挖掘;郑 诚,副教授、博士
  • 基金资助:

    安徽省自然科学基金资助项目(050420204);安徽省高校自然科学研究基金资助项目(2006kj055B)

Modified Text Retrieval Based on Concept Similarity

LV Gang 1,2, ZHENG Cheng1   

  1. (1. Key Laboratory of Intelligent Computing & Signal Processing, Ministry of Education, Anhui University, Hefei 230039;2. Key Laboratory of Network and Intelligent Information Processing, Hefei University, Hefei 230601)
  • Online:2010-06-20 Published:2010-06-20

摘要:

为提高信息检索的查全率和查准率,提出改进的本体语义相似度计算方法,利用本体中概念语义相似度对检索结果文档的分值进行重新计算,过滤掉与原始查询相关度较小的文档。给出定义查询扩展中的迭代参数,减少进行扩展的次数,提高查询效率。利用开源工具Jena, Lucene进行文本语义检索测试,验证该方法的可行性和有效性。

关键词: 语义检, 本体, 语义相似度, 查询扩展, 文档分值

Abstract:

To enhance information retrieval recall and precision, this paper proposes an improved method of calculating ontology semantic similarity. To filter out the document which has smaller related degree with origin query, the scores of search results document are re-calculated by use of ontology semantic similarity. Put forward a definition of the iterative query expansion parameters, reducing the number of expansion and improve the efficiency of query. By using open source tools Jena, Lucene for text semantic retrieval test, the proposed method is verified feasibility and effectiveness.

Key words: information retrieval, ontology, semantic similarity, query expansion, document scores

中图分类号: