作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (2): 157-159. doi: 10.3969/j.issn.1000-3428.2011.02.054

• 人工智能及识别技术 • 上一篇    下一篇

基于文本分析的知识获取系统设计与实现

姚金国,代志龙   

  1. (复旦大学计算机科学与技术学院,上海 200433)
  • 出版日期:2011-01-20 发布日期:2011-01-25
  • 作者简介:姚金国(1986-),男,硕士研究生,主研方向:专家系统;代志龙,博士研究生
  • 基金资助:
    国家自然科学基金资助项目(60303007)

Design and Implementation of Text Analysis Based Knowledge Acquisition System

YAO Jin-guo, DAI Zhi-long   

  1. (School of Computer Science, Fudan University, Shanghai 200433, China)
  • Online:2011-01-20 Published:2011-01-25

摘要: 知识获取一直以来都是构建专家系统的瓶颈问题。针对该问题,利用自然语言处理技术,设计并实现一个针对化学科技文献进行分析的知识获取系统,并对其关键技术进行分析。系统对输入文本进行分词及词性标注,在此基础上,使用Tregex在句法分析树上进行实体识别,同时利用依存关系进行搭配词识别。实验结果表明了该系统的有效性。

关键词: 知识获取, 句法分析, 文本分析

Abstract: Knowledge acquisition is a bottleneck to develop expert system. Aiming at this problem, it designs and implements a knowledge acquisition system to analyze the chemical literatures. Meanwhile, key techniques are expatiated. NLP approaches are used in this system to facilitate the automated extraction of chemical knowledge. The system segments the input text into words and assigns part of speech to each word. Tregex is used to identify the named entities on parsing trees and typed dependencies are utilized to extract qualifier for each entity. Experimental results indicate that the approaches for this task are highly efficient.

Key words: knowledge acquisition, syntax parsing, text analysis

中图分类号: