Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2006, Vol. 32 ›› Issue (16): 74-76. doi: 10.3969/j.issn.1000-3428.2006.16.028

• Software Technology and Database • Previous Articles     Next Articles

Study on Multiclass Text Categorization Method Based on Improved Support Vector Machine

YING Wei1; WANG Zheng’ou1;AN Jinlong2   

  1. 1. Institute of Systems Engineering, Tianjin University, Tianjin 300072; 2. Hebei University of Technology, Tianjin 300130
  • Received:1900-01-01 Revised:1900-01-01 Online:2006-08-20 Published:2006-08-20

一种基于改进的支持向量机的多类文本分类方法

应 伟1;王正欧1;安金龙2   

  1. 1. 天津大学系统工程研究所,天津 300072;2. 河北工业大学,天津 300130

Abstract: This paper puts forward a method of multiclass text categorization based on an improved support vector machine with binary tree and the pre-extracting support vectors and circulated iterative algorithm. Compared with existing multiclass classification support vector machines methods, the present method possesses much higher computation efficiency. It gives the concrete procedure of the algorithm, and applies it to the text classification. Experimental results demonstrate the effectiveness and the efficiency of the approach.

Key words: Text categorization, Support vector machines, Iterative algorithm, Binary tree

摘要: 提出了一种基于二叉树、预抽取支持向量机及循环迭代算法的改进的支持向量机(SVM)的多类文本分类方法, 与现有的多类分类SVM算法相比,该方法具有较高的计算效率。给出了具体实现过程并将其用于文本分类中,实验表明该算法用于文本分类的有效性及其高效率。

关键词: 文本分类, 支持向量机, 迭代算法, 二叉树

CLC Number: