摘要: 传统FTP搜索引擎对检索结果优化程度不够,会降低检索质量。在FTP用户查询日志的统计分析基础上,采用双字节倒排索引、检索结果自动分类以及查询自动纠错等技术设计了一种高性能的智能化FTP搜索引擎。试验表明该方案能够有效地提高FTP文件检索效率与质量,平均检索响应时间低于500 ms,检索准确率为92.5%。
关键词:
FTP搜索引擎,
倒排索引,
自动分类,
自动纠错
Abstract: The quality of query results in the traditional FTP search engines is low because it is not optimized. To solve the problem, this paper builds a high performance and intelligentized FTP search engine——KFSE, based on the analysis of FTP user query logs. The double bytes inverted index, automatic classification of query results, and automatic rectify mistake for users are adopted in the system. Validity of the scheme is proved in the real system and it can improve the query efficiency and quality for the FTP search engine. The average of response time is lower than 500 ms and the precision is 92.5%.
Key words:
FTP search engine,
inverted index,
automatic classification,
automatic rectify mistake
中图分类号:
胡 亮;傅泽田;张小栓;赵 明;郭立力;宫薇薇. K-FTP搜索引擎的核心技术[J]. 计算机工程, 2008, 34(13): 19-20,2.
HU Liang; FU Ze-tian; ZHANG Xiao-shuan; ZHAO Ming; GUO Li-li; GONG Wei-wei. Kernel Technology of K-FTP Search Engine[J]. Computer Engineering, 2008, 34(13): 19-20,2.