作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (22): 70-72. doi: 10.3969/j.issn.1000-3428.2008.22.024

• 软件技术与数据库 • 上一篇    下一篇

基于伪爬行器的主题式元搜索引擎研究与设计

马奕平,庄 毅,叶延风,张 霞   

  1. (南京航空航天大学计算机科学与技术系,南京 210016)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-11-20 发布日期:2008-11-20

Research and Design of Topic-specific Meta-search Engine Based on Bogus Crawler

MA Yi-ping, ZHUANG Yi, YE Yan-feng, ZHANG Xia   

  1. (Department of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-11-20 Published:2008-11-20

摘要: 为提高搜索的查准率和查全率,设计一个主题式的元搜索引擎和一个类似于爬行器的伪爬行器,通过调用通用搜索引擎采集信息,查全率高于通用搜索引擎。利用反馈机制,参考用户查询历史记录,搜索结果更加接近用户的要求。通过采用主题式策略,改进文档相似度算法,提高分类的正确率和搜索引擎的查准率与搜索范围,同时减少系统响应时间,降低对服务器性能的要求。

关键词: 元搜索, 主题式, 搜索引擎, 伪爬行器

Abstract: To improve the correct-rate and completeness-rate of search, a topic-specific meta-search engine is designed. A bogus crawler is invented, which collects information by the normal search engines, so that the search-area is wider than the normal search engine. The feedback mechanism is adopted and the search-history of user is considered, which make the search result is more imminent to the purpose of the user. Owing to the strategy of topic-specific and mending the arithmetic of similitude-degree of the texts, the correct-rate is improved. Both the correct-rate and completeness-rate of searching are improved, the response time is decreased as well, at the same time, the request of capability of the server is reduced.

Key words: meta-search, topic-specific, search engine, bogus crawler

中图分类号: