作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (20): 199-201. doi: 10.3969/j.issn.1000-3428.2009.20.071

• 人工智能及识别技术 • 上一篇    下一篇

混合的汉语基本名词短语识别方法

胡乃全1,朱巧明1,2,周国栋1,2   

  1. (1. 苏州大学计算机科学与技术学院,苏州 215006;2. 江苏省计算机信息处理技术重点实验室,苏州 215006)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-10-20 发布日期:2009-10-20

Hybrid Method to Chinese Base Noun Phrase Recognition

HU Nai-quan1, ZHU Qiao-ming1,2, ZHOU Guo-dong1,2   

  1. (1. School of Computer Science and Technology, Soochow University, Suzhou 215006;2. Jiangsu Provincial Key Lab for Computer Information Processing Technology, Suzhou 215006)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-10-20 Published:2009-10-20

摘要: 提出一种混合的汉语基本名词短语(BaseNP)识别模型,包括采用语法规则、统计方法和组合分类器方法。利用BaseNP词的信息、词性信息及上下文句法信息,构建组合分类器,提高判断的准确性。在中文树库(CTB5.0)上进行实验,F值达到了90.09%,证明该方法能有效地识别BaseNP。

关键词: 基本名词短语, 规则模板, 组合分类器

Abstract: This paper proposes a hybrid method to recognize Chinese Base Noun Phrase(BaseNP), including the use of grammer rules, statistical approach and classification combination. It utilizes words information, part of speech information and context syntax information of BaseNP, generates a combination classification and improves the precision. Experimental results on CTB5.0 show that the F-score is 90.09%, it proves that the method is an effective approach to Chinese BaseNP recognition.

Key words: Base Noun Phrase(BaseNP), rule templates, combined classifier

中图分类号: