Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2010, Vol. 36 ›› Issue (24): 192-194. doi: 10.3969/j.issn.1000-3428.2010.24.069

• Networks and Communications • Previous Articles     Next Articles

Chinese Entity Relation Extraction Based on Conditional Random Fields Model

ZHOU Jing   

  1. (Dept. of Computer Management, Nanjing Technical Vocational College, Nanjing 210019, China)
  • Online:2010-12-20 Published:2010-12-14

基于条件随机域模型的中文实体关系抽取

周 晶   

  1. (南京高等职业技术学校计算机管理系,南京 210019)
  • 作者简介:周 晶(1983-),女,助教、硕士研究生,主研方向:信息抽取,自然语言处理

Abstract: To solve disorder among information items and lack of information item in the field of information extraction, this paper proposes a solution to deal with chunks labeling and Entity Relation Extraction(ERE) based on the conditional random fields model. This paper defines the representation of Chinese chunk and entity relation, and uses label dataset of “People’s Daily” as sample dataset to train an optimized model for the entity extraction. Experimental results show this method has better extraction performance.

Key words: information extraction, chunks labeling, entity relation extraction, Conditional Random Fields(CRFs) model

摘要: 针对信息抽取领域中存在的抽取结果难以满足需要的问题,给出基于条件随机域模型的方法,以解决组块标注和实体关系抽取问题。通过定义中文组块和实体关系的标注方式,选择比较通用的《人民日报》语料,训练出效率较高的二阶模板来抽取文本中的实体关系。实验结果表明,该方法可以获得更好的抽取效果。

关键词: 信息抽取, 组块标注, 实体关系抽取, 条件随机域模型

CLC Number: