作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (4): 214-215. doi: 10.3969/j.issn.1000-3428.2011.04.077

• 人工智能及识别技术 • 上一篇    下一篇

一种具有增量学习能力的PU主动学习算法

陈 文,晏 立,周 亮   

  1. (江苏大学计算机科学与通信工程学院,江苏 镇江212013)
  • 出版日期:2011-02-20 发布日期:2011-02-17
  • 作者简介:陈 文(1983-),男,硕士研究生,主研方向:主动学习算法,Web数据挖掘;晏 立,教授;周 亮,硕士研究生

PU Active Learning Algorithm with Incremental Learning Ability

CHEN Wen, YAN Li, ZHOU Liang   

  1. (School of Computer Science and Telecommunication Engineering, Jiangsu University, Zhenjiang 212013, China)
  • Online:2011-02-20 Published:2011-02-17

摘要:

在正例和无标记样本增量学习中,初始正例样本较少且不同类别正例的反例获取困难,使分类器的分类和泛化能力不强,为解决上述问题,提出一种具有增量学习能力的PU主动学习算法,在使用3个支持向量机进行协同半监督学习的同时,利用基于网格的聚类方法进行无监督学习,当分类与聚类结果不一致时,引入主动学习对无标记样本进行标记。实验结果表明,将该算法应用于Deep Web入口的在线判断和分类能有效提高入口判断的准确性及分类的正确性。

关键词: PU学习, 支持向量机, 基于网格的聚类

Abstract:

In positive and unlabeled samples of incremental learning, the initial positive samples are small and positive cases of different types of cases are difficult to get, making classifier classification ability and generalization ability weak. A new algorithm called PU Active Learning algorithm with Incremental learning ability(I-PUAL) is presented, which is applied to Deep Web sources on-line judgments and classification. Experimental results show that it can take advantage of online unlabeled samples to improve the accuracy of judgments and classification correctness.

Key words: PU learning, SVM, grid-based clustering

中图分类号: