摘要: 提出一种混合的汉语基本名词短语(BaseNP)识别模型,包括采用语法规则、统计方法和组合分类器方法。利用BaseNP词的信息、词性信息及上下文句法信息,构建组合分类器,提高判断的准确性。在中文树库(CTB5.0)上进行实验,F值达到了90.09%,证明该方法能有效地识别BaseNP。
关键词:
基本名词短语,
规则模板,
组合分类器
Abstract: This paper proposes a hybrid method to recognize Chinese Base Noun Phrase(BaseNP), including the use of grammer rules, statistical approach and classification combination. It utilizes words information, part of speech information and context syntax information of BaseNP, generates a combination classification and improves the precision. Experimental results on CTB5.0 show that the F-score is 90.09%, it proves that the method is an effective approach to Chinese BaseNP recognition.
Key words:
Base Noun Phrase(BaseNP),
rule templates,
combined classifier
中图分类号:
胡乃全;朱巧明;周国栋;. 混合的汉语基本名词短语识别方法[J]. 计算机工程, 2009, 35(20): 199-201.
HU Nai-quan; ZHU Qiao-ming; ZHOU Guo-dong;. Hybrid Method to Chinese Base Noun Phrase Recognition[J]. Computer Engineering, 2009, 35(20): 199-201.