[1] DU Guiming,WANG Xia,WANG Guangyan,et al.Speech recognition based on convolutional neural networks[C]//Proceedings of 2016 IEEE International Conference on Signal and Image Processing.Washington D.C.,USA:IEEE Press,2016:708-711. [2] LI Haoxiang,LIN Zhe,SHEN Xiaohui,et al.A convolu-tional neural network cascade for face detection[C]//Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2015:5325-5334. [3] REN S Q,HE K M,GIRSHICK R,et al.Faster R-CNN:towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149. [4] SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[EB/OL].[2020-02-05].https://arxiv.org/pdf/1409.1556.pdf. [5] HE Kaiming,ZHANG Xiangyu,REN Shaoqing,et al.Deep residual learning for image recognition[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition.Washington D.C.,USA:IEEE Press,2016:770-778. [6] ANDREW W,YUNSUP L,DAVID A P,et al.The RISC-V instruction set manual,volume I:user-level ISA,version 2.0:UCB/EECS-2014-54[EB/OL].[2020-02-05].https://www2.eecs.berkeley.edu/Pubs/TechRpts/2014/EECS-2014-54.pdf. [7] LEI Silei.Research on open source processor and SoC based on RISC-V[J].Microcontrollers & Embedded Systems,2017,17(2):56-60.(in Chinese)雷思磊.RISC-V架构的开源处理器及SoC研究综述[J].单片机与嵌入式系统应用,2017,17(2):56-60. [8] ROVINSKI A,CHUN Z,AL-HAWAJ K,et al.Evaluating celerity:a 16-nm 695 Giga-RISC-V instructions/s manycore processor with synthesizable PLL[J].IEEE Solid-State Circuits Letters,2019,2(12):289-292. [9] CAVALCANTE M,SCHUIKI F,ZARUBA F,et al.Ara:a 1-GHz+ scalable and energy-efficient RISC-V vector processor with multiprecision floating-point support in 22-nm FD-SOI[J].IEEE Transactions on Very Large Scale Integration Systems,2020,28(2):530-543. [10] MELONI P,CAPOTONDI A,DERIU G,et al.Neuraghe:exploiting CPU-FPGA synergies for efficient and flexible CNN inference acceleration on Zynq SoCs[J].ACM Transactions on Reconfigurable Technology and Systems,2017,11(3):1-24. [11] XUE Chengbo,CAO Shan,JIANG Rongkun,et al.A reconfigurable pipelined architecture for convolutional neural network acceleration[C]//Proceedings of 2018 IEEE International Symposium on Circuits and Systems.Washington D.C.,USA:IEEE Press,2018:1-5. [12] YAO Yuchen,DUAN Qinghua,ZHANG Zhiqian,et al.A FPGA-based hardware accelerator for multiple convo-lutional neural networks[C]//Proceedings of 2018 IEEE International Conference on Solid-State and Integrated Circuit Technology.Washington D.C.,USA:IEEE Press,2018:1075-1077. [13] CHANG Mengchou,PAN Zegang,CHEN Junliang.Hardware accelerator for boosting convolution computation in image classification applications[C]//Proceedings of 2017 IEEE Global Conference on Consumer Electronics.Washington D.C.,USA:IEEE Press,2017:1-2. [14] FLAMAND E,ROSSI D,CONTI F,et al.GAP-8:a RISC-V SoC for AI at the edge of the IoT[C]//Proceedings of 2018 IEEE International Conference on Application-Specific Systems,Architectures and Processors.Washington D.C.,USA:IEEE Press,2018:1-4. [15] YANG Weike.Research on design method of convolution neural network accelerator based on RISC-V open source processor[D].Shanghai:Shanghai Jiao Tong University,2018.(in Chinese)杨维科.基于RISC-V开源处理器的卷积神经网络加速器设计方法研究[D].上海:上海交通大学,2018. [16] ASANOVIC K,AVIZIENIS R,BACHRACH J,et al.The rocket chip generator:UCB/EECS-2016-17[EB/OL].[2020-02-05].https://www2.eecs.berkeley.edu/Pubs/TechRpts/2016/EECS-2016-17.pdf. [17] YANG Weike,HE Guanghui,JING Naifeng.Design and implementation of CNN acceleration module based on Rocket-Chip open source processor[J].Microelectronics & Computer,2018,35(4):17-21.(in Chinese)杨维科,贺光辉,景乃锋.基于Rocket-Chip开源处理器的CNN加速模块的设计与实现[J].微电子学与计算机,2018,35(4):17-21. [18] LI Dongze,GONG Haoran,CHANG Yuchun.Implementing RISCV system-on-chip for acceleration of convolution operation and activation function based on FPGA[C]//Proceedings of 2018 IEEE International Conference on Solid-State and Integrated Circuit Technology.Washington D.C.,USA:IEEE Press,2018:1-3. [19] ZHANG Chen,LI Peng,SUN Guangyu,et al.Optimizing FPGA-based accelerator design for deep convolutional neural networks[C]//Proceedings of 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays.New York,USA:ACM Press,2015:161-170. [20] CHEN Y,KRISHNA T,EMER J,et al.Eyeriss:an energy-efficient reconfigurable accelerator for deep convolutional neural networks[C]//Proceedings of 2016 IEEE International Solid-State Circuits Conference.Washington D.C.,USA:IEEE Press,2016:262-263. [21] ZHAO R Z,LUK W,NIU X Y,et al.Hardware acceleration for machine learning[C]//Proceedings of 2017 IEEE Computer Society Annual Symposium on VLSI.Washington D.C.,USA:IEEE Press,2017:645-650. [22] MA Y F,CAO Y,VRUDHULA S,et al.Optimizing the convolution operation to accelerate deep neural networks on FPGA[J].IEEE Transactions on Very Large Scale Integration Systems,2018,26(7):1354-1367. [23] CHEN Tianshi,DU Zidong,SUN Ninghui,et al.DianNao:a small-footprint high-throughput accelerator for ubiquitous machine-learning[J].ACM SIGPLAN Notices,2014,49(4):269-283. |