作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2006, Vol. 32 ›› Issue (16): 55-57.

• 软件技术与数据库 • 上一篇    下一篇

SPRINT算法的改进

刘友军; 汪林林   

  1. 重庆邮电学院经济管理学院,重庆 400065
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2006-08-20 发布日期:2006-08-20

Improvement of SPRINT Algorithm

LIU Youjun;WANG Linlin   

  1. School of Economics Management, Chongqing University of Posts and Telecommunications, Chongqing 400065
  • Received:1900-01-01 Revised:1900-01-01 Online:2006-08-20 Published:2006-08-20

摘要: 引出了纯区间的概念后,提出了一种基于纯区间归约的数值型属性处理方法对SPRINT算法进行改进。该方法将属性值域用等宽直方图的方法划分为多个区间,对纯区间进行归约,对非纯区间进行精确计算,保证了分裂精度,减小了计算量。

关键词: 决策树, SPRINT算法, 纯区间归约, Gini指数

Abstract: This paper introduces the concept of pure interval, proposes a new splitting method based on pure intervals reduction to deal with numeric attributes for SPRINT algorithm. The method divides the numeric attributes to many intervals with equal-width histogram, reduces the pure intervals, calculates exactly the minimum gini value in the impure intervals, ensures the accuracy of split result and reduces computation.

Key words: Decision tree, SPRINT algorithm, Pure intervals reduction, Gini index

中图分类号: