梯度有偏随机DA优化方法的个体收敛界分析

doi:10.19678/j.issn.1000-3428.0052466

计算机工程 ›› 2019, Vol. 45 ›› Issue (10): 203-207,214. doi: 10.19678/j.issn.1000-3428.0052466

梯度有偏随机DA优化方法的个体收敛界分析

张梦晗^a, 汪海^a, 刘欣^b, 鲍蕾^a

中国人民解放军陆军炮兵防空兵学院 a. 信息工程系;b. 基础部, 合肥 230031

收稿日期:2018-08-22 修回日期:2018-09-28 出版日期:2019-10-15 发布日期:2018-11-09
作者简介:张梦晗(1994-),男,硕士研究生,主研方向为机器学习、模式识别;汪海,硕士研究生;刘欣,教授;鲍蕾,讲师、博士。
基金资助:
国家自然科学基金（61673394）。

Analysis of Individual Convergence Bound for Gradient Biased Stochastic DA Optimization Method

ZHANG Menghan^a, WANG Hai^a, LIU Xin^b, BAO Lei^a

a. Department of Information Engineering;b. Department of Basic Courses, PLA Army Academy of Artillery and Air Defense, Hefei 230031, China

Received:2018-08-22 Revised:2018-09-28 Online:2019-10-15 Published:2018-11-09

摘要/Abstract

摘要： 样本不满足独立同分布会使梯度估计在迭代过程中存在偏差，且最优的个体收敛界在噪声的干扰下无法确定。为此，提出一种线性插值随机对偶平均（DA）优化方法。给出DA方法收敛性的证明，在梯度估计有偏的基础上，求解得到一种线性插值DA随机优化方法不产生累积偏差的个体收敛界，以保证正则化损失函数结构下优化方法的个体收敛精度。实验结果表明，与随机加速方法相比，该方法具有较快的个体收敛速率与较高的收敛精度。

关键词: 对偶平均方法, 随机优化, 个体收敛性, 梯度有偏估计, 最优收敛速率

Abstract: Samples that do not satisfy the independent and identical distribution will lead to deviations of the gradient estimation in the iterative process,and the convergence bound of the optimal individual cannot be determined under the interference of noise.Therefore,a linear interpolation stochastic Dual Averaging(DA) optimization method is proposed.The proof of the convergence of the DA method is given.On the basis of the gradient estimation bias,the individual convergence bounds of the non-cumulative deviation of the linear interpolation DA stochastic optimization method are obtained,and the optimization method of individual convergence precision of regularized loss function structure is assured.Experimental results show that compared with the stochastic accelerate method,the method has a faster individual convergence rate and a higher convergence accuracy.

Key words: Dual Averaging(DA) method, stochastic optimization, individual convergence, gradient biased estimation, optimal convergence rate

中图分类号:

TP391

张梦晗, 汪海, 刘欣, 鲍蕾. 梯度有偏随机DA优化方法的个体收敛界分析[J]. 计算机工程, 2019, 45(10): 203-207,214.

ZHANG Menghan, WANG Hai, LIU Xin, BAO Lei. Analysis of Individual Convergence Bound for Gradient Biased Stochastic DA Optimization Method[J]. Computer Engineering, 2019, 45(10): 203-207,214.

https://www.ecice06.com/CN/Y2019/V45/I10/203

图/表 3

参考文献 20

[1]	NESTEROV Y.Primal-dual subgradient methods for convex problems[J].Mathematical Programming,2009,120(1):221-259.
[2]	BERTSEKAS D P,NEDI A,OZDAGLAR A E.Convex analysis and optimization[M].Berlin,Germany:Springer,2003.
[3]	BECK A,TEBOULLE M.Mirror descent and nonlinear projected sub-gradient methods for convex optimization[J].Operations Research Letters,2003,31(3):167-175.
[4]	ZHANG Tong.Solving large scale linear prediction problems using stochastic gradient descent algorithms[C]//Proceedings of the 21st International Conference on Machine Learning.New York,USA:ACM Press,2004:919-926.
[5]	SCHMIDT M,ROUX N L,BACH F.Convergence rates of inexact proximal gradient methods for convex optimization[J].Advances in Neural Information Processing Systems,2011,24:1458-1466.
[6]	DEVOLDER O.Stochastic first order methods in smooth convex optimization[EB/OL].[2018-07-20].https://core.ac.uk/download/pdf/34135646.pdf.
[7]	HONORIO J.Convergence rates of biased stochastic optimization for learning sparse ising models[C]//Proceedings of International Conference on Machine Learning.Madison,USA:Omnipress,2012:257-264.
[8]	DASPREMONT A.Smooth optimization with approximate gradient[J].SIAM Journal on Optimization,2008,19(3):1171-1183.
[9]	DEVOLDER O,GLINEUR F,NESTEROV Y.First-order methods of smooth convex optimization with inexact oracle[J].Mathematical Programming,2014,146(1/2):37-75.
[10]	RAKHLIN A,SHAMIR O,SRIDHARAN K.Making gradient descent optimal for strongly convex stochastic optimiza-tion[C]//Proceedings of International Conference on Machine Learning.Madison,USA:Omnipress,2012:1571-1578.
[11]	SHAMIR O,ZHANG Tong.Stochastic gradient descent for non-smooth optimization:convergence results and optimal averaging schemes[C]//Proceedings of International Conference on Machine Learning.Madison,USA:Omnipress,2013:71-79.
[12]	NESTEROV Y,SHIKHMAN V.Quasi-monotone subgradient methods for nonsmooth convex minimization[J].Journal of Optimization Theory and Applications,2015,165(3):917-940.
[13]	陶蔚,潘志松,储德军,等.使用Nesterov步长策略投影次梯度方法的个体收敛性[J].计算机学报,2018,41(1):164-176.
[14]	DUCHI J,SHALEV-SHWARTZ S,SINGER Y,et al.Composite objective mirror descent[EB/OL].[2018-07-20].http://web.stanford.edu/~jduchi/projects/DuchiShSiTe10.pdf.
[15]	NESTEROV Y.A method of solving a convex programming problem with convergence rate O(1/k2)[J].Soviet Mathematics Doklady,1983,27(2):372-376.
[16]	HU Chonghai,KWOK J T,PAN W.Accelerated gradient methods for stochastic optimization and online learning[C]//Proceedings of the 22nd International Conference on Neural Information Processing Systems.[S.l.]:Curran Associates Inc.,2009:781-789.
[17]	TSENG P.Approximation accuracy,gradient methods,and error bound for structured convex optimization[J].Mathematical Programming,2010,125(2):263-295.
[18]	陶蔚,潘志松,朱小辉,等.线性插值投影次梯度方法的最优个体收敛速率[J].计算机研究与发展,2017,54(3):529-536.
[19]	XIAO Lin,ZHANG Tong.A proximal stochastic gradient method with progressive variance reduction[J].SIAM Journal on Optimization,2014,24(4):2057-2075.
[20]	瓦普尼克.统计学习理论的本质[M].张学工,译.北京:清华大学出版社,2000.

选择文件类型/文献管理软件名称

选择包含的内容

梯度有偏随机DA优化方法的个体收敛界分析

Analysis of Individual Convergence Bound for Gradient Biased Stochastic DA Optimization Method

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 3

参考文献 20

相关文章 5

编辑推荐

Metrics

本文评价

[1]	王明青,杨博文,杨坚. LTE可伸缩视频组播的动态资源分配算法[J]. 计算机工程, 2018, 44(10): 274-280.
[2]	王惊晓，高乾坤，汪群山. 一种具有最优收敛速度的正则化境面下降算法[J]. 计算机工程, 2014, 40(6): 148-153.
[3]	金海波，仲崇权. 实时以太网状态分析及其优化策略[J]. 计算机工程, 2013, 39(11): 100-104,108.
[4]	黎杰, 祝吾杰, 胡丽媛. 改进微分进化算法在软硬件划分中的应用[J]. 计算机工程, 2012, 38(16): 284-286.
[5]	范佳, 钱徽, 朱淼良, 陈武斌. 优化路径分配的多作业机器人任务规划[J]. 计算机工程, 2010, 36(23): 142-145.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

梯度有偏随机DA优化方法的个体收敛界分析

Analysis of Individual Convergence Bound for Gradient Biased Stochastic DA Optimization Method

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 3

参考文献 20

相关文章 5

编辑推荐

Metrics

本文评价