[1] |
PARK Y S,PARK K R,KIM J M,et al.Fast fourier transform benchmark on X86 Xeon system for multimedia data processing[J].Multimedia Tools and Applications,2015,76(4):6015-6030.
|
[2] |
YANG Liangliang,YANG Fuzeng,NOGUCHI N.Apple internal quality classification using X-ray and SVM[J].Control in Transportation Systems,2011,44(1):14145-14150.
|
[3] |
THOMPSON E A,AANDERSON T R.A CUDA implementation of the continuous space language model[J].Journal of Supercomputing,2014,68(1):65-86.
|
[4] |
李成军,周卫峰,朱重光.基于Intel SIMD指令的二维FFT优化算法[J].计算机工程与应用,2007,43(5):41-44.
|
[5] |
TIAN Xinmin,SAITO H,PREIS S V,et al.Effective SIMD vectorization for intel Xeon Phi coprocessors[J].Scientific Programming,2015,15(1):1-14.
|
[6] |
胡向东,杨剑新,朱英.高性能多核处理器申威1600[J].中国科学:信息科学,2015,45(4):513-522.
|
[7] |
彭飞,顾乃杰,高翔,等.龙芯3B的SIMD编译优化及分析[J].小型微型计算机系统,2012,33(12):2733-2737.
|
[8] |
ZHANG Xiang,WANG Qingwen,LIU Xin.Inertias and ranks of some Hermitian matrix functions with applications[J].Open Mathematics,2012,10(1):125-139.
|
[9] |
王昊,王向前.BWDSP SIMD编译的寄存器分配优化技术研究[J].单片机与嵌入式系统应用,2015,15(4):4-7.
|
[10] |
高伟,赵荣彩,韩林,等.SIMD自动向量化编译优化概述[J].软件学报,2015,26(6):1265-1284.
|
[11] |
解庆春,张云泉,王可,等.SIMD技术与向量数学库研究[J].计算机科学,2011,38(7):298-301.
|
[12] |
曹代,郭绍忠,张辛.基于申威26010处理器的扩展函数库实现与优化[J].计算机工程,2017,43(1):61-66.
|
[13] |
解庆春,张云泉,李焱,等.基于神威蓝光处理器的向量数学软件包[J].软件学报,2014,25(增刊):70-79.
|
[14] |
陈世淼,郭绍忠,陈建勋,等.一种基于SIMD功能部件处理器的三角函数性能优化算法[J].信息工程大学学报,2011,12(1):103-106.
|
[15] |
许瑾晨,郭绍忠,黄永忠,等.面向异构众核从核的数学函数库访存优化方法[J].计算机科学,2014,41(6):12-17.
|
[16] |
余成龙,王永文.SIMD非对齐访存结构设计与实现[J].计算机工程,2016,42(9):1-4.
|
[17] |
VAIDYA A S,SHAYESTEH A.SIMD divergence optimization through intra-warp compaction[J].ACM SIGARCH Computer Architecture News,2013,41(3): 368-379.
|
[18] |
凌坤,胡士文,连瑞琦.面向龙芯处理器SIMD扩展的编译器内在函数优化[C]//全国高性能计算学术年会论文集.济南: [出版者不详],2012:1-6.
|