作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (1): 221-223. doi: 10.3969/j.issn.1000-3428.2008.01.076

• 人工智能及识别技术 • 上一篇    下一篇

自动文本摘要方法

江开忠1,2,李子成1,顾君忠1   

  1. (1. 华东师范大学信息学院计算机科学与技术系,上海 200062;2. 上海工程技术大学基础教学学院,上海 201620)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-01-05 发布日期:2008-01-05

Method for Automatic Text Summarization

JIANG Kai-zhong1,2, LI Zi-cheng1, GU Jun-zhong1   

  1. (1. Department of Computer Science and Technology, East China Normal University, Shanghai 200062; 2. College of Fundamental Studies, Shanghai University of Engineering Science, Shanghai 201620)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-01-05 Published:2008-01-05

摘要: 自动文本摘要是继信息检索之后信息或知识获取的一个重要步骤,对高质量的文档文摘十分重要。该文提出以句子为基本抽取单位,以位置和标题关键词为句子的加权特征,对句子基于潜语义聚类,提出语义结构的摘要方法。同时给出了较为客观和有效的摘要评价方法。实验表明了该方法的有效性。

关键词: 自动文本摘要, 语义结构, 摘要评价

Abstract: Automatic abstraction is a key step for information extraction or knowledge acquisition in the aftermath of Information Index. The high quality summarization is significant at the time of massive information. This paper proposes a method by extracting a set of sentences from the text as the abstract based on the semantic structure, which is formed by clustering all the sentences of the document. By employing this semantic structure, summarization quality can be improved. The paper also presents a set of formula to evaluate the automatic abstracting impersonally and efficaciously. Evaluation and experimental results show that the algorithm is effective.

Key words: automatic text summarization, semantic structure, summarization evaluation

中图分类号: