作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2012, Vol. 38 ›› Issue (06): 13-15.

• 博士论文 • 上一篇    下一篇

基于Tri-training算法的构造性学习方法

吴 涛 a,b,李 萍 b,王允强 b   

  1. (安徽大学 a. 智能计算与信号处理教育部重点实验室;b. 数学科学学院,合肥 230039)
  • 收稿日期:2011-08-04 出版日期:2012-03-20 发布日期:2012-03-20
  • 作者简介:吴 涛(1970-),男,教授、博士,主研方向:机器学习,智能计算;李 萍、王允强,硕士研究生
  • 基金资助:
    国家“973”计划基金资助项目(2007BC311003);国家自然科学基金资助项目(61073117);安徽大学创新团队基金资助项目(KJTD001B)

Constructive Learning Method Based on Tri-training Algorithm

WU Tao a,b, LI Ping b, WANG Yun-qiang b   

  1. (a. Key Laboratory of Intelligent Computing & Signal Processing of Ministry of Education; b. School of Mathematical Sciences, Anhui University, Hefei 230039, China)
  • Received:2011-08-04 Online:2012-03-20 Published:2012-03-20

摘要: 构造性机器学习(CML)算法在训练分类器时需要大量有标记样本,而获取这些有标记样本十分困难。为此,提出一种基于Tri- training算法的构造性学习方法。根据已标记的样本,采用不同策略构造3个差异较大的初始覆盖分类网络,用于对未标记数据进行标记,再将已标记数据加入到训练样本中,调整各分类网络参数,反复进行上述过程,直至获得稳定的分类器。实验结果证明,与CML算法和基于NB分类器的半监督学习算法相比,该方法的分类准确率更高。

关键词: 半监督学习, 构造性机器学习, Tri-training算法, 覆盖, 分类网络

Abstract: Constructive Machine Learning(CML) algorithm needs larger numbers of labeled examples to train a classification network, but it is difficult to obtain a mass of labeled examples. So this paper proposes a constructive learning method based on Tri-training algorithm. According to the labeled examples, it constructs three initial classification networks by using different strategies with lager differences. Unlabeled examples can be labeled by using the initial classification networks, so that the examples can be joined into the labeled examples and the parameters of the classification network can be rectified. The steps are repeated to increase the labeled samples until a steady classifier is trained. Experimental results show that the algorithm is feasible and effective than CML and semi-supervised learning algorithm based on Na?ve Bayes(NB) classifier.

Key words: semi-supervised learning, Constructive Machine Learning(CML), Tri-training algorithm, covering, classification network

中图分类号: