作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (23): 193-195. doi: 10.3969/j.issn.1000-3428.2008.23.069

• 人工智能及识别技术 • 上一篇    下一篇

基于概率粗糙集模型的信息检索

黄治国1,朱承学2,薛 凡1,王加阳3   

  1. (1. 黄淮学院国际学院,驻马店 463000;2. 湖南第一师范学院信息技术系,长沙 410002;3. 中南大学信息科学与工程学院,长沙 410083)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-12-05 发布日期:2008-12-05

Information Retrieval Based on Probability Rough Set Model

HUANG Zhi-guo1, ZHU Cheng-xue2, XUE Fan1, WANG Jia-yang3   

  1. (1. International College, Huanghuai University, Zhumadian 463000; 2. Department of Information and Technology, Hunan First Normal College, Changsha 410002; 3.School of Information Science and Engineering, Central South University, Changsha 410083)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-12-05 Published:2008-12-05

摘要: 针对经典粗糙集模型难以分类标引空间以及体现类间关联的缺陷,将条件概率关系结合粗糙集理论引入信息检索,提出一种基于概率粗糙集的信息检索模型。定义标引词空间的条件概率关系,自动挖掘概念相似类形成概念空间。定义文档与查询、文档与文档间语义贴近度的计算方法。根据贴近度实现检索匹配结果的排序输出。仿真实例表明了该方法的可行性和有效性。

关键词: 粗糙集, 信息检索, 条件概率关系, 语义贴近度

Abstract: Aiming at the disadvantage of classical rough set theory on identifying the conceptually similar terms and the relationships between classes, this paper proposes a novel information retrieval model based on conditional probability relation and rough set. Conception space is formed by defining conditional probability relation in index words space to mine conception similar class automatically. A method is designed to calculate semantic distance between a document and a query, as well as documents. And the ordered outputs of retrieval result are acquired. The simulation instance shows that this algorithm is feasible and effective in practice.

Key words: rough set, information retrieval, conditional probability relation, semantic distance

中图分类号: