作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (22): 197-199. doi: 10.3969/j.issn.1000-3428.2009.22.067

• 人工智能及识别技术 • 上一篇    下一篇

基于条件随机域的生物命名实体识别

彭春艳,张 晖,包玲玉,陈昌平   

  1. (西南科技大学计算机科学与技术学院,绵阳 621000)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-11-20 发布日期:2009-11-20

Biological Named Entity Recognition Based on Conditional Random Fields

PENG Chun-yan, ZHANG Hui, BAO Ling-yu, CHEN Chang-ping   

  1. (School of Computer Science and Technology, Southwest University of Science and Technology, Mianyang 621000)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-11-20 Published:2009-11-20

摘要: 提出一种基于条件随机域模型的生物命名实体识别方法,结合单词构词特性以及距离依赖特性,在JNLPBA的GENIAV3.02数据上进行实验,测试结果表明,引入距离依赖后,系统的识别性能比只利用单特性的条件随机域方法提高2.54%,可获得较好的识别效果,提高了系统的识别效率。

关键词: 生物命名实体识别, 条件随机域, 隐马尔科夫模型

Abstract: A biological named entity recognition method based on Conditional Random Fields(CRF) is proposed, which combines the word characteristics and the distance between words. Experiments are carried out with GENIAV3.02 datasets given by JNLPBA. Experimental results show that, after introducing words distance characteristics, the proposed method can achieve a performance improvement of 2.54% compared to simple conditional random fields, therefore achieving a better recognition result and improve the efficiency of systems.

Key words: biological named entity recognition, Conditional Random Fields(CRF), Hidden Markov Models(HMM)

中图分类号: