Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2007, Vol. 33 ›› Issue (24): 217-219. doi: 10.3969/j.issn.1000-3428.2007.24.076

• Artificial Intelligence and Recognition Technology • Previous Articles     Next Articles

Method of Multi-text Fusion and Extraction Based on Flexible Intervals

HUANG Wen-tao, XU Ling-yu, LI Yan, WU Zao-liang   

  1. School of Computer Engineering and Science, Shanghai University, Shanghai 200072
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-12-20 Published:2007-12-20

基于柔性区间的多文本融合提取方法

黄文涛,徐凌宇,李 严,吴早亮   

  1. 上海大学计算机工程与科学学院,上海 200072

Abstract: Instead of traditional method of inputting keywords according to human experience, this paper analyzes a new information retrieval method. When searching special area information, it fuses sample documents, extracts common keywords, and adjusts the sample documents to control the search space and result sum. The method avoids the warp being at the traditional method inducing by the subjectivity and unilateralism of the human experience to a certain extent, can reach the excellent harmony between the recall ration and the result sum, achieving the best cost-performance.

Key words: text filtering, samples fusion, similarity computation, intervals control

摘要: 探索了一种新的检索方式,代替了依赖人工经验输入关键字检索的传统方法。在检索特定领域信息时,通过相关样本集融合,提取出关键词集,通过调节样本集实现关键词集的柔性控制,以调控搜索空间与结果取向。该方法在一定程度上避免了人类经验的主观性、片面性和关键词任选偏差,可以使查全率与结果数量达到最佳协调,实现最优性价比。

关键词: 文本过滤, 样本融合, 相似度计算, 区间调节

CLC Number: