作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (24): 75-77. doi: 10.3969/j.issn.1000-3428.2009.24.025

• 软件技术与数据库 • 上一篇    下一篇

基于不平衡数据集的级联决策树改进算法

郭 鹏,葛 玮   

  1. (西北大学软件工程研究所,西安 710127)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-12-20 发布日期:2009-12-20

Revised Cascade Decision Tree Algorithm Based on Unbalanced Data Set

GUO Peng, GE Wei   

  1. (Institute of Software Engineering, Northwest University, Xi’an 710127)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-12-20 Published:2009-12-20

摘要: 提出一种针对客户离网问题的改进决策树分类算法——M-AdaBoost级联决策树。采用级联式的思想构造多个基于AdaBoost决策树分类器,通过设定子分类器的判决信息,组合成级联式决策树。实验结果表明,该方法相对于单一的C4.5决策树、传统的AdaBoost决策树以及随机森林具有更好的分类效果。

关键词: 级联结构, 决策树, 虚警率

Abstract: This paper presents a revised decision-tree classification algorithm for customers’ churning, M-AdaBoost cascade of decision tree. By constructing a number of AdaBoosted decision tree classifier, the algorithm produces cascade composition of the decision tree. Experimental results prove the improvement achieved by the new algorithm, and it is better than the result of C4.5 decesion tree, AdaBoost decesion tree and random forest.

Key words: cascade structure, decision tree, false alarming rate

中图分类号: