计算机工程 ›› 2009, Vol. 35 ›› Issue (6): 82-84.doi: 10.3969/j.issn.1000-3428.2009.06.028

• 软件技术与数据库 • 上一篇    下一篇

基于属性相似度的决策树算法

陆 秋,程小辉   

  1. (桂林工学院电子与计算机系,桂林 541004)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-03-20 发布日期:2009-03-20

Decision Tree Algorithm Based on Attribute Similarity

LU Qiu, CHENG Xiao-hui   

  1. (Department of Electronic and Computer, Guilin University of Technology, Guilin 541004)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-03-20 Published:2009-03-20

摘要: 针对ID3算法的多值偏向问题,提出一种基于属性相似度的、能够避免多值偏向问题的ID3改进算法——NewDtree算法,并应用理论分析方法对NewDtree算法不存在多值偏向问题进行了证明。通过对实验结果的分析,得出NewDtree算法能有效地提高分类的正确率,弥补ID3算法选择测试属性时偏向取值较多的不足的结论。

关键词: ID3算法, 多值偏向, 属性相似度, NewDtree算法

Abstract: According to the multivalue bios of ID3 algorithm, an ID3 improved algorithm, NewDtree, is brought forward in the paper. This algorithm is based on the attribute similarity theory, solving the multivalue bios, and the multivalue bios in NewDtree algorithm is proved not exist with the analytical method of the theory. A conclusion is drawn with the analysis of test result that the NewDtree algorithm can improve the definition of classification effectively, solving the multivalue bios problem of ID3 in selecting test attribute.

Key words: ID3 algorithm, multivalue bios, attribute similarity, NewDtree algorithm

中图分类号: