作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2010, Vol. 36 ›› Issue (24): 186-188. doi: 10.3969/j.issn.1000-3428.2010.24.067

• 人工智能及识别技术 • 上一篇    下一篇

基于典型相关分析的多视图跨领域情感分类

黄贤立   

  1. (淮阴师范学院计算机科学与技术学院,江苏 淮安 223300)
  • 出版日期:2010-12-20 发布日期:2010-12-14
  • 作者简介:黄贤立(1977-),男,讲师、硕士,主研方向:人工智能,机器学习
  • 基金资助:
    淮阴师范学院青年优秀人才支持计划基金资助项目;江苏省淮安市科技支撑(工业)计划基金资助项目(HAG09054-5)

Multi-view Cross Domain Sentiment Classification Based on Canonical Correlation Analysis

HUANG Xian-li   

  1. (School of Computer Science and Technology, Huaiyin Normal University, Huaian 223300, China)
  • Online:2010-12-20 Published:2010-12-14

摘要: 跨领域的文本分类,是指利用有标记领域的知识去帮助另一个概率分布不同的,未标记领域的知识进行分类的问题。从多视图学习的视角提出一个新的跨领域文本分类的方法(MTV算法)。通过在核空间典型相关分析中引入与标记相关的信息,MTV算法可以得到一个判别性能更优的公共子空间。在多个情感类文本数据上的实验表明,MTV算法可以大大提升传统监督式学习算法面对领域迁移时的分类性能,并且在引入判别式的核空间典型相关分析后,进一步优化性能。

关键词: 跨领域学习, 迁移学习, 情感分类, 文本分类, 核空间典型相关分析

Abstract: Cross domain text classification aims at leveraging the knowledge in labeled source domains to predict the unlabeled data in another domain, where the distribution is different from the source domains. This paper proposes a new algorithm for cross domain classification in a multi-view learning perspective, named MulTi-View(MTV) transfer classification. By incorporating discriminative information related to labels, a more discriminative common low dimensional space is extracted. In MTV perspective, MTV algorithm provides a clear and neat way to transfer knowledge between domains. By applying KCCA, MTV also gives a closed form solution. Experiments show that MTV algorithm can significantly improve the performance of traditional supervised learning methods on extensive sentiment text datasets.

Key words: cross domain learning, transfer learning, sentiment classification, text classification, kernel canonical correlation analysis

中图分类号: