作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (24): 192-194. doi: 10.3969/j.issn.1000-3428.2010.24.069

• 人工智能及识别技术 • 上一篇    下一篇

基于条件随机域模型的中文实体关系抽取

周 晶   

  1. (南京高等职业技术学校计算机管理系,南京 210019)
  • 出版日期:2010-12-20 发布日期:2010-12-14
  • 作者简介:周 晶(1983-),女,助教、硕士研究生,主研方向:信息抽取,自然语言处理

Chinese Entity Relation Extraction Based on Conditional Random Fields Model

ZHOU Jing   

  1. (Dept. of Computer Management, Nanjing Technical Vocational College, Nanjing 210019, China)
  • Online:2010-12-20 Published:2010-12-14

摘要: 针对信息抽取领域中存在的抽取结果难以满足需要的问题,给出基于条件随机域模型的方法,以解决组块标注和实体关系抽取问题。通过定义中文组块和实体关系的标注方式,选择比较通用的《人民日报》语料,训练出效率较高的二阶模板来抽取文本中的实体关系。实验结果表明,该方法可以获得更好的抽取效果。

关键词: 信息抽取, 组块标注, 实体关系抽取, 条件随机域模型

Abstract: To solve disorder among information items and lack of information item in the field of information extraction, this paper proposes a solution to deal with chunks labeling and Entity Relation Extraction(ERE) based on the conditional random fields model. This paper defines the representation of Chinese chunk and entity relation, and uses label dataset of “People’s Daily” as sample dataset to train an optimized model for the entity extraction. Experimental results show this method has better extraction performance.

Key words: information extraction, chunks labeling, entity relation extraction, Conditional Random Fields(CRFs) model

中图分类号: