作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• 人工智能及识别技术 • 上一篇    下一篇

基于SVM的大样本数据回归预测改进算法

顾嘉运,刘晋飞,陈明   

  1. (同济大学 a. 机械工程学院;b. 电子与信息工程学院,上海 200092)
  • 收稿日期:2013-06-06 出版日期:2014-01-15 发布日期:2014-01-13
  • 作者简介:顾嘉运(1989-),男,硕士研究生,主研方向:人工神经网络,支持向量机,信号振动分析;刘晋飞,讲师;陈 明(通讯作者),教授、博士生导师
  • 基金资助:
    国家科技重大专项基金资助项目(2009ZX0414-103);上海市引进技术的吸收与创新计划基金资助项目(11XI-07);上海市科学技术委员会科研计划基金资助项目(11dz1121002)

A Modified Regression Prediction Algorithm of Large Sample Data Based on SVM

GU Jia-yun  a, LIU Jin-fei  b, CHEN Ming  a   

  1. (a. College of Mechanical Engineering; b. College of Electronic and Information Engineering, Tongji University, Shanghai 200092, China)
  • Received:2013-06-06 Online:2014-01-15 Published:2014-01-13

摘要: 针对支持向量机回归预测精度与训练样本尺寸不成正比的问题,结合支持向量机分类与回归算法,提出一种大样本数据分类回归预测改进算法。设计训练样本尺寸寻优算法,根据先验知识对样本数据进行人为分类,训练分类模型,基于支持向量机得到各类别样本的回归预测模型,并对数据进行预测。使用上证指数的数据进行实验,结果表明,支持向量机先分类再回归算法预测得到的均方误差达到12.4,低于人工神经网络预测得到的47.8,更远低于支持向量机直接回归预测得到的436.9,验证了该方法的有效性和可行性。

关键词: 支持向量机, 大样本, 尺寸优化, 分类, 回归, 预测

Abstract: A modified prediction method of large size data based on Support Vector Machine(SVM) classification and regression is proposed aiming at the problem that prediction accuracy of SVM regression is not proportional to the size of training sample. The method combines the SVM classification and regression algorithms. The size of the sample data is optimized, and the sample data is classified based on a priori knowledge. According to the classification, the classification model is trained. Then it trains the regression model for training sample of all classes, and makes the prediction with large size data based on SVM classification and regression. With the case of Shanghai Composite Index, the Mean Squared Error(MSE) of values predicted by the new method based on SVM classification and regression is 12.4, lower than 47.8 predicted by Artificial Neural Network(ANN) and much lower than 436.9 predicted by SVM regression. These results verify the effectiveness and feasibility of the method.

Key words: Support Vector Machine(SVM), large sample, size optimization, classification, regression, prediction

中图分类号: