作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (20): 184-186. doi: 10.3969/j.issn.1000-3428.2009.20.065

• 人工智能及识别技术 • 上一篇    下一篇

基于WFC和MI的主题句提取方法

薛扣英1,原 盛1,张心严2   

  1. (1. 西安交通大学电子与信息工程学院,西安 710049;2. 西安交通大学软件学院,西安 710049)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-10-20 发布日期:2009-10-20

Topic Sentence Extraction Method Based on Weight Fuzzy Clustering and Mutual Information

XUE Kou-ying1, YUAN Sheng1, ZHANG Xin-yan2   

  1. (1. School of Electronics and Information Engineering, Xi’an Jiaotong University, Xi’an 710049; 2. School of Software, Xi’an Jiaotong University, Xi’an 710049)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-10-20 Published:2009-10-20

摘要: 提出一种基于加权模糊聚类(WFC)和互信息(MI)的主题句提取方法,使主题句尽可能全面覆盖全文主题的同时,缩减自身的冗余,以提高摘要效率,采用加权模糊聚类的方法对文本句子进行分类,对在同一类中的句子使用比较互信息的方法进行排名处理,从而获得高质量的摘要。实验结果表明,与传统聚类方法比较,该方法的正确率提高约15%,可以达到约70%的精确度,并在阅读摘要时能够基本正确地获取文本信息。

关键词: 主题句, 加权模糊聚类, 互信息

Abstract: A topic sentence extraction method based on Weight Fuzzy Clustering(WFC) and Mutual Information(MI) is proposed, which is to cover more topics and lower the redundant information of the text. The abstract efficiency is promoted. Using WFC method, the sentences is classified. Sentences in each cluster is ranked by MI values. High qualified abstact is obtained. Experimental results show that, compared with former clustering method, this method can improve the precision by nearly 15%, and has about 70% accuracy. It can get text information correctly.

Key words: topic sentence, Weight Fuzzy Clustering(WFC), Mutual Information(MI)

中图分类号: