作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2007, Vol. 33 ›› Issue (13): 87-89. doi: 10.3969/j.issn.1000-3428.2007.13.029

• 软件技术与数据库 • 上一篇    下一篇

一种多值属性和多类标数据的决策树算法

赵 蕊,李 宏   

  1. (中南大学信息科学与工程学院,长沙 410083)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-07-05 发布日期:2007-07-05

Algorithm of Multi-valued Attribute and Multi-labeled Data Decision Tree

ZHAO Rui, LI Hong   

  1. (School of Information Science and Engineering, Central South University, Changsha 410083)
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-07-05 Published:2007-07-05

摘要: 提出了一种多值属性和多类标数据的决策树算法(SSC),在MMC算法中,对用孩子结点的类标集相似度来评定结点属性分类效果的计算方法进行了改进,综合考虑集合的同一性和一致性,提出了相似度评定方法,使类标集相似度的计算更加全面和准确。实验证明该算法的分类效果优于MMC算法。

关键词: 分类, 决策树, 多值属性, 多类标数据, 相似度

Abstract: This paper develops a decision tree classifier SSC(similarity of same and consistent) for multi-valued and multi-labeled data, improves on MMC’s formula for measuring the similarity of label-sets to determine the goodness of splitting attributes. It proposes a new measure approach considering both same and consistent features of label-sets. The experiment shows SSC has improved accuracy of MMC.

Key words: classification, decision tree, multi-valued attribute, multi-labeled data, similarity

中图分类号: