作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (21): 45-47. doi: 10.3969/j.issn.1000-3428.2008.21.017

• 软件技术与数据库 • 上一篇    下一篇

维、哈、柯全文搜索引擎检索器的关键技术

吐尔地•托合提,维尼拉•木沙江,艾斯卡尔•艾木都拉   

  1. (新疆大学信息科学与工程学院,乌鲁木齐 830046)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-11-05 发布日期:2008-11-05

Key Techniques of Uyghur, Kazak, Kyrgyz Full-text Search Engine Retrieval Server

TURDI Tohti, WINIRA Musajan, ASKAR Hamdulla   

  1. (Information Science and Technology Institute, Xinjiang University, Urumqi 830046)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-11-05 Published:2008-11-05

摘要: 研究维、哈、柯全文搜索引擎检索器的关键问题,提出有效的解决方法,包括在用户计算机没有安装本地输入法和字库的情况下输入维、哈、柯文检索词并正常显示搜索结果,针对具有高拼写错误率的维、哈、柯文检索词进行检错、纠错处理,返回给用户正确而全面的搜索结果等。实验结果表明,该方法为用户提供方便的同时明显提高了维、哈、柯文搜索引擎的查全率和查准率。

关键词: 在线处理, 检错, 纠错, 词根切分, 同化处理

Abstract: This paper studies the key problems of Uyghur, Kazak, Kyrgyz full-text search engine retrieval server and proposes an effective solution, including inputting Uyghur, Kazak, Kyrgyz keywords and shows normal search results without installing the local input method in users’ computers, detecting and correcting Uyghur, Kazak, Kyrgyz keywords with high spelling mistake rate, returning the correct and comprehensive search results to users, etc. Experimental results indicat that the solution provides convenience for users and obviously improves the precision and recall of Uyghur, Kazak, Kyrgyz search engine.

Key words: online processing, error detection, error correction, root segmentation, assimilation processing

中图分类号: