作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (22): 49-51. doi: 10.3969/j.issn.1000-3428.2008.22.017

• 软件技术与数据库 • 上一篇    下一篇

基于局部主题关键句抽取的自动文摘方法

徐 超1,王 萌2,何婷婷3,张 勇3   

  1. (1. 福建师范大学软件学院,福州 350007;2. 广西工学院计算机工程系,柳州 545006;3. 华中师范大学计算机科学系,武汉 430079)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-11-20 发布日期:2008-11-20

Automatic Summarization Method Based on Extracting Sentences from Local Topics

XU Chao1, WANG Meng2, HE Ting-ting3, ZHANG Yong3   

  1. (1. Faculty of Software, Fujian Normal University, Fuzhou 350007; 2. Department of Computer Engineering, Guangxi University of Technology, Liuzhou 545006; 3. Department of Computer Science, Huazhong Normal University, Wuhan 430079)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-11-20 Published:2008-11-20

摘要: 自动文摘是语言信息处理中的重要环节。该文提出一种基于局部主题关键句抽取的中文自动文摘方法。通过层次分割的方法对文档进行主题分割,从各个局部主题单元中抽取一定数量的句子作为文章的文摘句。通过事先对文档进行语义分析,有效地避免了数据冗余和容易忽略分布较小的主题等问题。实验结果表明了该方法的有效性。

关键词: 自动文摘, 主题分割, 局部主题单元

Abstract: Automatic summarization is an important issue in natural language processing. This paper proposes a new method for automatic summarization of Chinese text based on extracting sentences from subtopics. The document is segmented into several units in terms of the subtopics in the document. The most representative sentences in each subtopic unit are selected as the summary sentences. By analyzing semantic structure of the documents in advance, the summary sentences are not redundancy and the coverage of each subtopic is balanced. Experimental results show that the method is effective.

Key words: automatic summarization, topic segmentation, local topic unit

中图分类号: