摘要: 自动文本摘要是继信息检索之后信息或知识获取的一个重要步骤,对高质量的文档文摘十分重要。该文提出以句子为基本抽取单位,以位置和标题关键词为句子的加权特征,对句子基于潜语义聚类,提出语义结构的摘要方法。同时给出了较为客观和有效的摘要评价方法。实验表明了该方法的有效性。
关键词:
自动文本摘要,
语义结构,
摘要评价
Abstract: Automatic abstraction is a key step for information extraction or knowledge acquisition in the aftermath of Information Index. The high quality summarization is significant at the time of massive information. This paper proposes a method by extracting a set of sentences from the text as the abstract based on the semantic structure, which is formed by clustering all the sentences of the document. By employing this semantic structure, summarization quality can be improved. The paper also presents a set of formula to evaluate the automatic abstracting impersonally and efficaciously. Evaluation and experimental results show that the algorithm is effective.
Key words:
automatic text summarization,
semantic structure,
summarization evaluation
中图分类号:
江开忠;李子成;顾君忠. 自动文本摘要方法[J]. 计算机工程, 2008, 34(1): 221-223.
JIANG Kai-zhong; LI Zi-cheng; GU Jun-zhong. Method for Automatic Text Summarization[J]. Computer Engineering, 2008, 34(1): 221-223.