作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2012, Vol. 38 ›› Issue (24): 191-195. doi: 10.3969/j.issn.1000-3428.2012.24.045

• 人工智能及识别技术 • 上一篇    下一篇

基于CRFs和跨事件的事件识别研究

侯立斌 1,2,李培峰 1,2,朱巧明 1,2   

  1. (1. 苏州大学计算机科学与技术学院,江苏 苏州 215006;2. 江苏省计算机信息处理技术重点实验室,江苏 苏州 215006)
  • 收稿日期:2012-02-20 修回日期:2012-03-14 出版日期:2012-12-20 发布日期:2012-12-18
  • 作者简介:侯立斌(1986-),男,硕士研究生,主研方向:自然语言处理;李培峰,副教授;朱巧明,教授、博士生导师
  • 基金资助:
    国家自然科学基金资助项目(60970056, 61070123);江苏省自然科学基金资助项目(BK2008160);高等学校博士学科点专项科研基金资助项目(20093201110006)

Study of Event Recognition Based on CRFs and Cross-event

HOU Li-bin 1,2, LI Pei-feng 1,2, ZHU Qiao-ming 1,2   

  1. (1. School of Computer Science and Technology, Soochow University, Suzhou 215006, China; 2. Key Lab of Computer Information Processing Technology of Jiangsu Province, Suzhou 215006, China)
  • Received:2012-02-20 Revised:2012-03-14 Online:2012-12-20 Published:2012-12-18

摘要: 事件检测与类型识别是事件抽取的基础,具体实施分为触发词检测和事件类型识别2个阶段。分别对2个阶段进行研究,在前一阶段,针对词形特征过拟和问题,提出利用LDA模型对词语聚类的方法,考虑到中文自动分词与标注的触发词边界的不一致性,提出基于CRFs模型的触发词识别方法。在后一阶段,为提高事件类型识别的效果,将跨事件理论应用于中文事件类型识别。实验结果表明,该方法能提高系统性能,F值分别提高到66.3和62.0。

关键词: 事件抽取, 触发词检测, 事件类型识别, 跨事件, CRFs模型, LDA模型

Abstract: Event extraction is an important component of information extraction. Event detection and recognition is the basis of event extraction. It is implemented by two stages which are trigger word detection and event recognition. The two stages are studied. In first stage, words are clustered by LDA model to solve the words overfitting problem and a trigger word detection method is proposed based on character using CRFs model in view of the inconsistency between Chinese word segmentation and trigger word boundary. In the next stage, cross-event inference is applied in Chinese event recognition to enhance the result of event recognition. Experimental results show that the approach can significantly improve system performance, achieving the F-measure of 66.2 and 62.0 on the stage of trigger detection and event recognition respectively.

Key words: event extraction, trigger word detection, event type recognition, cross-event, CRFs model, LDA model

中图分类号: