作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (4): 94-96,99.

• 软件技术与数据库 • 上一篇    下一篇

基于 Lucene 的全文检索系统研究与开发

郎小伟,王申康   

  1. 浙江大学人工智能所,杭州 310027
  • 出版日期:2006-02-20 发布日期:2006-02-20

Research and Development of Full Text Search Engine Based on Lucene

LANG Xiaowei, WANG Shenkang   

  1. Artificial Intelligence Institute, Zhejiang University, Hangzhou 310027
  • Online:2006-02-20 Published:2006-02-20

摘要: 提出了一种基于Jakarta Lucene 的全文检索系统模型。该模型相对于Google 的站内检索,以及传统的数据库检索都有较为明显的优势。其关键字的拆分比对技术、信息检索的速度以及最终结果的排序都有独到之处。能够保证检索的前100 条记录最符合检索者的需要。

关键词: 索引;段;记录;域;关键字

Abstract: The paper proposes a system model for full text search engine based on Jakarta Lucene. This model provides more apparent advantages comparing to Google in-site and the original database search engine. Its division and comparison technology of keyword, the speed rate to index information and the target sorting results have their own special features

Key words: Index; Segment; Document; Field; Keyword