Author Login Chief Editor Login Reviewer Login Editor Login Remote Office

Computer Engineering ›› 2006, Vol. 32 ›› Issue (4): 30-32,121.

• Degree Paper • Previous Articles     Next Articles

Research and Implementation About Automatic Abstract System Based on Text Clustering

GUO Qinglin1, 2, FAN Xiaozhong2, LIU Changan1   

  1. 1. Department of Computer Science, North China Electric Power University, Beijing 102206;2. Department of Computer Science, Beijing Institute of Technology, Beijing 100081
  • Online:2006-02-20 Published:2006-02-20

基于文本聚类的自动文摘系统的研究与实现

郭庆琳 1,2,樊孝忠2,柳长安1   

  1. 1. 华北电力大学(北京)计算机系,北京 102206;2. 北京理工大学计算机系,北京 100081

Abstract: This dissertation brings forward the method of automatic abstracting’s realization based on text clustering and natural language understanding. This method can overcome the shortage of the automatic abstracting’s generic realization, and improve greatly the quality of automatic abstracting. Especially, this method uses text clustering, it can not only improve greatly the quality of single document’s automatic abstracting, but also realize multi- document’s automatic abstracting. For a specific plastic domain, an automatic abstracting system TCAAC is implemented.

Key words: Automatic abstract; Text cluster; Natural language understanding

摘要: 针对当前自动文摘方法的不足,提出了基于文本聚类和自然语言理解的自动文摘实现方法。可以克服常规自动文摘方法的不足,使文摘的质量和效果得到大大的提高。将文本聚类引入自动文摘中,不但使单文档的文摘质量得到提高,而且能够实现多文档的自动文摘,这是现有的自动文摘技术所没有涉及的。实现了面向“塑料”行业的基于文本聚类和自然语言理解的自动文摘系统TCAAS。

关键词: 自动文摘;文本聚类;自然语言理解