作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (3): 199-201,. doi: 10.3969/j.issn.1000-3428.2009.03.067

• 人工智能及识别技术 • 上一篇    下一篇

基于支持向量机的英语名词短语指代消解

李艳翠,杨 勇,周国栋,朱巧明   

  1. (1. 苏州大学计算机科学与技术学院,苏州 215006;2. 江苏省计算机信息处理技术重点实验室,苏州 215006)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-02-05 发布日期:2009-02-05

Anaphora Resolution of Noun Phrase Based on SVM

LI Yan-cui, YANG Yong, ZHOU Guo-dong, ZHU Qiao-ming   

  1. (1. School of Computer Science and Technology, Soochow University, Suzhou 215006; 2. Jiangsu Provincial Key Lab for Computer Information Processing Technology, Suzhou 215006)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-02-05 Published:2009-02-05

摘要: 提出一种基于支持向量机(SVM)的英语名词短语的指代消解方法,并给出具体实现系统。实验采用了几个常用的基本特征,在MUC-6公开语料上测试得到的F值为68.6,优于同类型的其他原型系统。分析SVM中不同核函数对分类结果的影响以及不同的特征对指代消解的作用。实验结果表明,同位语、别名和字符串匹配3个特征对指代消解非常重要,距离作为特征使用时对指代消解没有帮助,但可在训练样例生成时作为限制条件来使用。

关键词: 指代消解, 支持向量机, 核函数

Abstract: This paper proposes an anaphora resolution of noun phrases based on Support Vector Machine(SVM). Evaluation on the MUC-6 corpus using several widely used features shows that the system achieves the F-measure of 68.6% and outperforms other similar systems. Further analysis shows that appositive, name alias and full string matching contributes most for anaphora resolution. It also shows that the distance between the antecedent candidate and the anaphor is very useful in constraining the instance generation, although including it as a feature does not help for anaphora resolution.

Key words: anaphora resolution, Support Vector Machine(SVM), kernel function

中图分类号: