[1] GIRSHICK R.Fast R-CNN[C]//Proceedings of IEEE International Conference on Computer Vision.Washington D.C.,USA:IEEE Press,2015:1440-1448. [2] ZHOU Feiyan,JIN Linpeng,DONG Jun.Review of convolutional neural network[J].Chinese Journal of Computers,2017,40(6):1229-1251.(in Chinese)周飞燕,金林鹏,董军.卷积神经网络研究综述[J].计算机学报,2017,40(6):1229-1251. [3] MNIH V,KAVUKCUOGLU K,SILVER D,et al.Playing Atari with deep reinforcement learning[EB/OL].[2019-01-05].https://arxiv.org/pdf/1312.5602v1.pdf. [4] BOJARSKI M,DEL TESTA D,DWORAKOWSKI D,et al.End to end learning for self-driving cars[EB/OL].[2019-01-05].https://arxiv.org/pdf/1604.07316.pdf. [5] ROSKA T,CHUA L O.The CNN universal machine:an analogic array computer[J].IEEE Transactions on Circuits and Systems II:Analog and Digital Signal Processing,2015,40(3):163-173. [6] CHEN Hongcai,CHENG Yu,ZHANG Changyou.Application of convolutional neural network in vehicle target detection[J].Journal of Software,2017,28(S1):107-114.(in Chinese)陈宏彩,程煜,张常有.卷积神经网络在车辆目标快速检测中的应用[J].软件学报,2017,28(S1):107-114. [7] LIAN Yiya,WU Xiaojun.Research on image super-resolution reconstruction of super deep convolutional neural network[J].Computer Engineering,2019,45(1):217-220.(in Chinese)连逸亚,吴小俊.超深卷积神经网络的图像超分辨率重建研究[J].计算机工程,2019,45(1):217-220. [8] SZEGEDY C,IOFFE S,VANHOUCKE V,et al.Inception-v4,inception-resnet and the impact of residual connections on learning[EB/OL].[2019-01-05].https://arxiv.org/pdf/1602.07261.pdf. [9] GUPTA S,ZHANG Wei,WANG Fei.Model accuracy and runtime tradeoff in distributed deep learning:a systematic study[C]//Proceedings of 2016 IEEE International Conference on Data Mining.Washington D.C.,USA:IEEE Press,2016:171-180. [10] KRIZHEVSKY A.One weird trick for parallelizing convolutional neural networks[EB/OL].[2019-01-05].https://arxiv.org/pdf/1404.5997.pdf. [11] FU Haohuan,LIAO Junfeng,YANG Jinzhe,et al.The Sunway Taihu Light supercomputer:system and applications[J].Science China Information Sciences,2016,59:1-16. [12] FANG Jiarui,FU Haohuan,ZHAO Wenlai,et al.swDNN:a library foraccelerating deep learning applications on Sunway Taihu Light[C]//Proceedings of 2017 IEEE International Parallel and Distributed Processing Symposium.Washington D.C.,USA:IEEE Press,2017:615-624. [13] JIA Yangqing,SHELHAMER E,DONAHUE J,et al.Caffe:convolutional architecture for fast feature embedding[EB/OL].[2019-01-05].https://arxiv.org/pdf/1408.5093.pdf,2014:675-678. [14] YU Yang,AN Hong,CHEN Junshi,et al.Pipelining computation and optimization strategies for scaling GROMACS on the Sunway many-core processor[C]//Proceedings of International Conference on Algorithms and Architectures for Parallel Processing.Berlin,Germany:Springer,2017:18-32. [15] LAVIN A,GRAY S.Fast algorithms for convolutional neural networks[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2016:4013-4021. [16] MATHIEU M,HENAFF M,LECUN Y.Fast training of convolutional networks through FFTs[EB/OL].[2019-01-05].https://arxiv.org/pdf/1312.5851.pdf. [17] KRIZHEVSKY A,SUTSKEVER I,HINTON G E.Imagenet classification with deep convolutional neural networks[C]//Proceedings of the 25th International Conference on Neural Information Processing Systems.New York,USA:ACM Press,2012:1097-1105. [18] SZEGEDY C,LIU Wei,JIA Yangqing,et al.Going deeper with convolutions[C]//Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2015:1-9. [19] SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[EB/OL].[2019-01-05].https://arxiv.org/pdf/1409.1556.pdf. [20] LECUN Y,BOTTOU L,BENGIO Y,et al.Gradient-based learning applied to document recognition[J].Proceedings of the IEEE,1998,86(11):2278-2324. [21] RONNEBERGER O,FISCHER P,BROX T.U-net:convolutional networks for biomedical image segmentation[C]//Proceedings of International Conference on Medical Image Computing and Computer-assisted Intervention.Berlin,Germany:Springer,2015:234-241. [22] CHETLUR S,WOOLLEY C,VANDERMERSCH P,et al.cuDNN:efficient primitives for deep learning[EB/OL].[2019-01-05].https://arxiv.org/pdf/1410.0759.pdf. [23] FANG Jiarui,FU Haohuan,JIANG Jinlei,et al.swCaffe:a parallel framework for accelerating deep learning applications on Sunway Taihu Light[C]//Proceedings of 2018 IEEE International Conference on Cluster Computing.Washington D.C.,USA:IEEE Press,2018:413-422. [24] WANG Endong,ZHANG Qing,SHEN Bo,et al.High-performance computing on the Intel® Xeon Phi[M].Berlin,Germany:Springer,2014:167-188. |