摘要: 介绍了一种基于主题的分布式信息检索方法,并对算法的有效性进行了深入的分析。该文通过文本聚类方法,把文档按照主题的方式来划分,经过实验发现查询答案明显地汇聚在少数的文档集合中。由此表明,基于主题的分布式信息检索方法比传统分布式信息检索方法在检索效果上有了显著的提高。
关键词:
分布式信息检索;文本聚类;K 平均聚类
Abstract: This paper introduces a topic based distributed information retrieval method, thoroughly analyses the reason for the good performance. Through text clustering method, divides the text by theme, and the experimental results show that inquired answers obviously converge among minority collections of documents, such indicates that the topic based distributed information retrieval method achieves great improvement comparing to the traditional method.
Key words:
Distributed information retrieval; Text clustering; K-means clustering
张 刚,周昭涛,王 斌. 基于主题的分布式信息检索技术研究[J]. 计算机工程, 2006, 32(12): 80-81,84.
ZHANG Gang, ZHOU Zhaotao, WANG Bin. Research on Topic Based Distributed Information Retrieval Technology[J]. Computer Engineering, 2006, 32(12): 80-81,84.