计算机工程 ›› 2011, Vol. 37 ›› Issue (18): 164-166.doi: 10.3969/j.issn.1000-3428.2011.18.054

• 人工智能及识别技术 • 上一篇    下一篇

基于缩略语分析的中文报道关系识别研究

王凤玲   

  1. (菏泽学院计算机与信息工程系,山东 菏泽 274015)
  • 收稿日期:2011-04-25 出版日期:2011-09-20 发布日期:2011-09-20
  • 作者简介:王凤玲(1978-),女,讲师、硕士,主研方向:自然语言处理

Research of Chinese Report Link Recognition Based on Abbreviation Analysis

WANG Feng-ling   

  1. (Department of Computer & Information Engineering, Heze University, Heze 274015, China)
  • Received:2011-04-25 Online:2011-09-20 Published:2011-09-20

摘要: 分析中文缩略语的构词方式,定义2个词之间的词形相似度,提出一种基于最长字符串匹配的相似度计算方法,探讨该方法在中文报道关系识别系统中的应用。实验结果表明,该相似度计算方法能够改善中文报道关系识别系统的性能,使系统的归一化检测开销降低12.96%,取得较好的识别效果。

关键词: 报道关系识别, 话题检测与跟踪, 缩略语, 归一化检测开销, 相似度计算方法

Abstract: This paper analyzes the formation of the Chinese abbreviations, defines the morphology similarity between two words, and proposes the story similarity computation method based on the longest string matching. It explores the usage of this similarity computation method in the Chinese report link recognition system. Experimental results show this method performes well, reduces the normalized detection cost by 12.96%, and greatly improves the performance of the story link recognition system.

Key words: report link recognition, topic detection and tracking, abbreviation, normalized detection cost, similarity computation method

中图分类号: