作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (19): 177-178,182. doi: 10.3969/j.issn.1000-3428.2011.19.058

• 人工智能及识别技术 • 上一篇    下一篇

基于极限学习机的XML文档分类

陈盛双   

  1. (武汉理工大学理学院,武汉 430070)
  • 收稿日期:2011-03-23 出版日期:2011-10-05 发布日期:2011-10-05
  • 作者简介:陈盛双(1964-),男,副教授,主研方向:数据挖掘

XML Document Classification Based on Extreme Learning Machine

CHEN Sheng-shuang   

  1. (School of Science, Wuhan University of Technology, Wuhan 430070, China)
  • Received:2011-03-23 Online:2011-10-05 Published:2011-10-05

摘要: 研究基于极限学习机(ELM)的XML文档分类方法。为优化文档的相似性计算,在结构链接向量模型的基础上,提出一种改进的特征向量模型RS-VSM,将有效的结构化信息合并到向量模型中。应用ELM对XML文档进行分类,为提高ELM分类的准确率,提出一种基于投票机制的Voting-ELM算法。实验结果证明,该算法的分类效果较优。

关键词: 可扩展标记语言, 分类, 极限学习机, 结构链接向量模型, 投票机制

Abstract: This paper studies eXtensible Markup Language(XML) document classification method based on Extreme Learning Machine(ELM). On the basis of Structured Link Vector Model(SLVM), an optimized Reduced Structured Vector Space Model(RS-VSM) is proposed to incorporate structural information into feature vectors more efficiently. It applies ELM in the XML document classification to achieve good performance at extremely high speed. A voting-ELM algorithm is proposed to improve accuracy of ELM classifiers. Experimental results demonstrate that the voting-ELM classifiers can achieve better performance.

Key words: eXtensible Markup Language(XML), classification, Extreme Learning Machine(ELM), Structured Link Vector Model(SLVM), voting mechanism

中图分类号: