作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2008, Vol. 34 ›› Issue (3): 207-209. doi: 10.3969/j.issn.1000-3428.2008.03.073

• 人工智能及识别技术 • 上一篇    下一篇

基于IPC知识结构的专利自动分类方法

刘玉琴,桂 婕,朱东华   

  1. (北京理工大学管理与经济学院,北京 100081)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2008-02-05 发布日期:2008-02-05

Automated Categorization of Patent Based on IPC Knowledge Construction

LIU Yu-qin, GUI Jie, ZHU Dong-hua   

  1. (School of Management and Economics, Beijing Institute of Technology, Beijing 100081)
  • Received:1900-01-01 Revised:1900-01-01 Online:2008-02-05 Published:2008-02-05

摘要: 基于国际专利分类号的层次结构,利用自身的类别描述信息,建立了不同层次的类别特征向量,结合现有专利进行修正训练,分别在各层次上采用经典的KNN算法实现专利的自动分类。实验结果表明:该方法的分类效果在部、大类、小类层次上表现较好。经过修正训练后的分类性能有所提高。

关键词: 文本分类, 专利分类, 国际专利分类号

Abstract: Based on hierarchy and information of International Patent Classification(IPC), this paper constructs character vectors on the different levels. Revision training is done combing with present patents based on the different levels. KNN algorithm is used to realize automated categorization of patent. Experimental results show that the method works well on the level of section, class and subclass, and the categorization performance is improved after revision training.

Key words: text categorization, patent categorization, International Patent Classification(IPC)

中图分类号: