前馈神经网络导数特性分析

doi:10.3969/j.issn.1000-3428.2014.07.041

计算机工程

前馈神经网络导数特性分析

魏海^a，杨华舒^b，苏志敏^a，桂跃^c，董梦思^a

(昆明理工大学 a. 电力工程学院；b. 国土资源工程学院；c. 建筑工程学院，昆明 650500)

收稿日期:2013-06-20 出版日期:2014-07-15 发布日期:2014-07-14
作者简介:魏海(1975－)，男，副教授、博士，主研方向：神经网络；杨华舒、苏志敏，教授；桂跃，副教授、博士，董梦思，硕士研究生。
基金资助:
国家自然科学基金资助项目(51069003)；云南省应用基础研究基金资助项目(2010ZC048)。

Derivatives Feature Analysis of Feedforward Neural Networks

WEI Hai ^a, YANG Hua-shu ^b, SU Zhi-min ^a, GUI Yue ^c, DONG Meng-si^a

(a. Faculty of Electric Power Engineering; b. Faculty of Land Resources Engineering; c. Faculty of Architecture Engineering, Kunming University of Science and Technology, Kunming 650500, China)

Received:2013-06-20 Online:2014-07-15 Published:2014-07-14

摘要/Abstract

摘要： 为分析前馈神经网络输出量的一阶、二阶偏导数特性，从一层网络结构入手，推导网络输出量的一阶偏导数，应用链式求导法则，推导多层网络输出量的一阶、二阶偏导数的计算公式。在此基础上推导网络的三阶偏导数，并针对二层结构网络，在其输出层激活函数为线性函数时，推导出该网络对输入量的高阶偏导数计算公式。实例分析结果表明，前馈神经网络一阶、二阶偏导数值的精度比网络输出值的精度要低，尤其是在区间的边界上有时会出现较大的偏差。网络的一阶、二阶偏导数值的精度也会随着隐含层神经元数量的增加明显降低，在基本相同的网络训练精度下，隐含层神经元较多的网络比神经元少的网络导数特性差。

关键词: 前馈神经网络, 偏导数, 线性激活函数, 精度, 网络体系结构, 网络输出

Abstract: In order to analyze first and second order partial derivative feature of feedforward neural networks with respect to its inputs, one layered architecture network is chosen to deduce first order partial derivative of network. Chain rule is employed to derive formulas to compute partial derivatives of multilayer architecture networks. On the basis of that, third order partial derivative of networks can be gained easily. And considering linear activation function in output layer of two layered networks, higher order partial derivatives of networks with respect of its inputs can be obtained. Case analysis shows that accuracy of first and second order partial derivative of feedforward neural networks is far less than that of output of networks, especially in the boundary area of interval of input the error between stimulation value and real value is very significant. Moreover, accuracy of first and second order derivative of network decreases greatly with increase of the number of neurons in hidden layer. Consequently, under the condition of networks with equivalent training accuracy, the networks with less neurons in hidden layer has better derivative performance than that with more neurons in hidden layer.

Key words: feedforward neural networks, partial derivative, linear activation function, accuracy, network architecture, output of networks

中图分类号:

TP18

魏海，杨华舒，苏志敏，桂跃，董梦思. 前馈神经网络导数特性分析[J]. 计算机工程, doi: 10.3969/j.issn.1000-3428.2014.07.041.

WEI Hai, YANG Hua-shu, SU Zhi-min, GUI Yue, DONG Meng-si. Derivatives Feature Analysis of Feedforward Neural Networks[J]. Computer Engineering, doi: 10.3969/j.issn.1000-3428.2014.07.041.

http://www.ecice06.com/CN/Y2014/V40/I7/197

参考文献

参考文献 [1] Hornik K, Stinchcombe M, White H. Multilayer Feedforward Networks are Universal Approximators[J]. Neural Networks, 1989, 2(5): 359-366. [2] Teoh E J, Tan K C. Estimating the Number of Hidden Neurons in a Feedforward Network Using the Singular Value Decomposition[J]. IEEE Transactions on Neural Networks, 2006, 17(6): 1623-1629. [3] Seman N, Baker Z A, Baker N A. The Optimization of Artifi- cial Neural Networks Connection Weights Using Genetic Algorithms for Isolated Spoken Malay Parliamentary Speeches[C]//Proc. of International Conference on Computer and Information Application. Tianjin, China: [s. n.], 2010: 162-166. [4] Wan S, Babta L E. Parameter Incremental Learning Algorithm for Neural Networks[J]. IEEE Transactions on Neural Networks, 2006, 17(6): 1424-1438. [5] Kim C T, Lee J J. Training Two-layered Feedforward Net- works with Variable Projection Method[J]. IEEE Transac- tions on Neural Networks, 2008, 19(2): 371-375. [6] 赵英, 崔福义, 郭亮, 等. 基于BP神经网络的天津于桥CODMn预测研究[J]. 南京理工大学学报, 2008, 32(3): 376-380. [7] 霍再林, 冯绍元, 康绍忠, 等. 神经网络与地下水流动数值模型在干旱内陆区地下水位变化分析中的应用[J]. 水利学报, 2009, 40(6): 724-728. [8] 朱星明, 卢长娜, 王如云, 等. 基于人工神经网络的洪水水位预报模型[J]. 水利学报, 2005, 36(7): 806-811. [9] 曹飞龙, 张永全, 潘星. 构造前向神经网络逼近多项式函数[J]. 模式识别与人工智能, 2007, 20(3): 331-335. [10] 王伟, 沈振中, 李桃凡. 遗传算法与自适应粒子群算法耦合的大坝安全预警评价模型[J]. 岩土工程学报, 2009, 31(8): 1242-1247. [11] Razavi S, Tolson B A. A New Formulation for Feedforward Neural Networks[J]. IEEE Transactions on Neural Networks, 2011, 22(10): 1588-1598. [12] Xiang C, Ding S H O, Lee T H. Geometrical Interpretation and Architecture Selection of MLP[J]. IEEE Transactions on Neural Networks, 2005, 16(1): 84-96. 编辑索书志

[1]	陈治旭, 靳雁霞, 芦烨, 杨晶, 刘亚变, 史志儒. 基于子图卷积神经网络的多精度服装建模方法[J]. 计算机工程, 2023, 49(4): 174-181.
[2]	生龙, 袁丽娜, 武南南, 姬少培. 基于GSA与DE优化混合核ELM的网络异常检测模型[J]. 计算机工程, 2022, 48(6): 146-153.
[3]	曾涛, 薛峰, 杨添. 面向行人重识别的通道与空间双重注意力网络[J]. 计算机工程, 2022, 48(12): 281-287,295.
[4]	丁青锋, 徐梦引, 石辉, 李怡浩, 杨倩. 基于ADC精度与AP选择的去蜂窝系统能效均衡方法[J]. 计算机工程, 2022, 48(11): 192-200.
[5]	方海涛, 李明齐, 卞鑫. 基于DFT寻径的压缩感知信道估计改进算法[J]. 计算机工程, 2022, 48(1): 182-187.
[6]	左攀, 束永安. DCN中基于前馈神经网络的动态多路径负载均衡方法[J]. 计算机工程, 2021, 47(9): 113-119.
[7]	徐先峰, 赵万福, 邹浩泉, 张丽, 潘卓毅. 基于MobileNet-SSD的安全帽佩戴检测算法[J]. 计算机工程, 2021, 47(10): 298-305,313.
[8]	索静, 宋林林, 李强. 基于LRC-SNN的图像高效重建与识别[J]. 计算机工程, 2020, 46(7): 243-250,259.
[9]	尹彦卿, 龚华军, 王新华. 基于YOLOv3的嵌入式实时视频目标检测算法[J]. 计算机工程, 2020, 46(2): 230-234.
[10]	曹维东, 解翠, 韩冰, 董军宇. 融合深度学习的自动化海洋锋精细识别[J]. 计算机工程, 2020, 46(10): 266-274.
[11]	神显豪,刘康勇,奈何,金红. 基于固定时窗累加和的WSN声目标迭代定位方法[J]. 计算机工程, 2019, 45(6): 127-133.
[12]	何为,郭承军,田忠. 适用于多星座精密单点定位的性能评估优化方法[J]. 计算机工程, 2019, 45(5): 88-92.
[13]	薛铭龙,李一博. 基于改进随机森林算法的智能环境活动识别[J]. 计算机工程, 2019, 45(5): 149-154.
[14]	孟凯露,岳克强,尚俊娜. 种间双系统协作蝙蝠优化算法及其性能仿真[J]. 计算机工程, 2019, 45(4): 189-195.
[15]	赵琪,赵怀林,祝波. 基于IEEE 1588协议的时间触发以太网同步算法[J]. 计算机工程, 2019, 45(3): 138-141,147.

选择文件类型/文献管理软件名称

选择包含的内容

前馈神经网络导数特性分析

Derivatives Feature Analysis of Feedforward Neural Networks

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

前馈神经网络导数特性分析

Derivatives Feature Analysis of Feedforward Neural Networks

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价