计算机工程

• 人工智能及识别技术 • 上一篇    下一篇

一种半监督重复软最大化模型

邢国正,江雨燕,吴超,李常训   

  1. (安徽工业大学管理科学与工程学院,安徽 马鞍山 243002)
  • 收稿日期:2015-01-15 出版日期:2015-09-15 发布日期:2015-09-15
  • 作者简介:邢国正(1977-),男,讲师,主研方向:机器学习;江雨燕,教授;吴超、李常训,硕士研究生。
  • 基金项目:
    国家自然科学基金资助项目(71172219);国家科技型中小企业创新基金资助项目(11C26213402013)。

A Semi-supervised Replicated Softmax Model

XING Guozheng,JIANG Yuyan,WU Chao,LI Changxun   

  1. (School of Management Science and Engineering,Anhui University of Technology,Maanshan 243002,China)
  • Received:2015-01-15 Online:2015-09-15 Published:2015-09-15

摘要: 概率主题模型由于其高效的数据降维和文档主题特征挖掘能力被广泛应用于各种文档分析任务中,然而概率主题模型主要基于有向图模型构建,使得模型的表示能力受到极大限制。为此,研究分布式主题特征表示和基于无向图模型玻尔兹曼机的重复软最大化模型(RSM),提出一种半监督的RSM(SSRSM)。将SSRSM、RSM模型提取的主题特征应用于多标记判别任务中,实验结果表明,相比LDA和RSM模型,SSRSM模型具有更好的多标记判别能力。

关键词: 主题模型, 无向图模型, 重复软最大化模型, 半监督模型, 特征学习

Abstract: Recently probabilistic topic models are widely used because of high performance of dimension reduction and topic features mining.However,topic models are built based on directed graph model which limits the performance of data representation.This paper based on the studies on distributed feature representation and Replicated Softmax Model(RSM) which is based on the Restricted Bolzmann Machine(RBM) proposes a Semi Supervised Replicated Softmax Model (SSRSM).Experimental results show that the SSRSM outperforms LDA and RSM in task of topics extraction.In addition,by using the features learned by SSRSM and RSM in task of multi-label classification,it is shown that SSRSM has a better performance of multi-label learning than RSM.

Key words: topic model, undirected graph model, Replicated Softmax Model(RSM), semi-supervised model, feature learning

中图分类号: