作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 人工智能及识别技术 • 上一篇    下一篇

基于CPSO的多目标文本分类投影寻踪

石 松,陈 云   

  1. (上海财经大学上海市金融信息技术研究重点实验室,上海 200433)
  • 收稿日期:2012-10-30 出版日期:2014-02-15 发布日期:2014-02-13
  • 作者简介:石 松(1985-),男,博士研究生,主研方向:数据挖掘,信息检索,智能算法;陈 云,教授、博士生导师
  • 基金资助:
    上海市科学技术委员会基金资助项目(10dz1123500, 10dz1123200, 11ZR1411800);上海市自然科学基金资助项目(11ZR 1411800);上海财经大学研究生创新基金资助项目(CXJJ-2012-322)

Multi-objective Projection Pursuit for Text Categorization Based on CPSO

SHI Song, CHEN Yun   

  1. (Shanghai Key Laboratory of Financial Information Technology, Shanghai University of Finance and Economics, Shanghai 200433, China)
  • Received:2012-10-30 Online:2014-02-15 Published:2014-02-13

摘要: 投影寻踪可有效解决文本分类中的维数灾难问题,而投影方向优化是投影寻踪需要解决的关键问题。传统的投影寻踪方法将投影指标优化看作单目标优化问题,会使解的质量受到影响。为此,提出一种基于多目标优化的投影寻踪方法。将类别之间的距离和类别内数据的聚类紧密程度作为2个优化目标,并将投影扩展到多维,利用混沌粒子群优化算法寻找最优的投影方向。在常用文本数据集上进行实验,确定最优投影指标及维度,并比较不同分类模型的分类结果,结果表明,使用该方法能有效提高文本分类性能。

关键词: 投影寻踪, 文本分类, 维数灾难, 投影指标, 多目标优化, 混沌粒子群优化算法

Abstract: Projection pursuit method is increasingly used in text categorization to solve the curse of dimensionality. Traditional projection pursuit method considers the projection index optimization as a single-objective problem rather than a multi-objective one, which will reduce the quality of the solution. To solve this problem, this paper proposes a projection pursuit mehod based on multi-objective optimization. Measures are taken like class difference and difference between the classes as two objectives of pursuit index, the projection pursuit method is extended to multi-dimensional projections, and a Chaotic Particle Swarm Optimization(CPSO) is suggested to find the optimal projection direction. Experiment on commonly used text datasets determines the optimal projection direction and dimensions, and then compares the results of different classification models. The results demonstrate that the proposed method can improve the text categorization performance effectively.

Key words: projection pursuit, text categorization, curse of dimensionality, projection index, multi-objective optimization, Chaotic Particle Swarm Optimization(CPSO) algorithm

中图分类号: