[1]NAUMOV M.Incomplete LU and Cholesky preconditioned iterative methods using CUSPARSE and CUBLAS[EB/OL].[2017-07-01].https://docs.nvidia.com/cuda/incomplete-lu-cholesky/.br/
[2]Nvidia.CUDA CUBLAS Library[EB/OL].[2017-07-01].http://devel oper.download.nvidia.com/compute/
DevZone/docs/html/CUDALibraries/doc/CUBLAS_Library.pdf.br/
[3]Nvidia.CUDA CUSPARSE Library[EB/OL].[2017-07-01].http://developer.download.nvidia.com/compute/DevZone/doc/html/CUDALibraries/doc/CUSPARSE_Library.pdf.br/
[4]张健飞,沈德飞.基于GPU的稀疏线性系统的预条件共轭梯度法[J].计算机应用,2013,33(3):825-829.br/
[5]陈尧,赵永华,赵慰,等.GPU加速不完全Cholesky分解预条件共轭梯度法[J].计算机研究与发展,2015,52(4):843-850.br/
[6]殷建.基于GPU的矩阵乘法优化研究[D].济南:山东大学,2015.br/
[7]阳王东,李肯立.基于HYB格式稀疏矩阵与向量乘在CPU+GPU异构系统中的实现与优化[J].计算机工程与科学,2016,38(2):202-209.br/
[8]Computing Developer Home Page[EB/OL].[2017-07-01].http://developer.nvidia.com/object/gpucomputing.html.br/
[9]Nvidia.NVIDIA CUDA C programming guide[EB/OL].[2017-07-01].http://developer.download.Nvidia.com/compute/DevZone/docs/html/C/doc/CUDA_C_Program ming_Guide.pdf.br/
[10]BELL N,GARLAND M.Efficient sparse matrix-vector multiplication on CUDA:NVR-2008-004[R].Santa Clara,USA:Nvidia Corporation,2008.br/
[11]秦晋,龚春叶,胡庆丰,等.基于CUDA编程模型的稀疏对角矩阵向量乘优化[J].计算机工程与科学,2012,32(7):78-83.br/
[12]白洪涛,欧阳丹彤,李熙铭.基于GPU 的稀疏矩阵向量乘优化[J].计算机科学,2010,37(8):168-172.br/
[13]袁娥,张云泉,刘芳芳,等.SpMV 的自动性能优化实现技术及其应用研究[J].计算机研究与发展,2009,46(7):1117-1126.br/
[14]BELGIN M.BACK G,RIBBENS C J.Pattern based sparse matrix representation for memory-efficient SMVM kernels[C]//Proceedings of the 23rd International Conference on Supercomputing.New York,USA:ACM Press,2009:100-109.br/
[15]CHOL J W,SINGH A,VUDUC R W.Model-driven autotuning of sparse matrix-vector multiply on GPUs[J].ACM SIGPLAN Notices,2010,45(5):115-125.br/
[16]梁添.基于GPU的稀疏矩阵运算优化研究[D].武汉:华中科技大学,2012.br/
[17]阳王东,李肯立,石林.一种准对角矩阵的混合压缩算法及其与向量相乘在GPU上的实现[J].计算机科学,2014,41(7):290-296.br/
[18]WILLIAMS S,OLIKER L,VUDUC R W,et al.Optimization of sparse matrix-vector multiplication on emerging multicore platforms:RC24704(W0812-047)[R].IBM Inc.,2008.br/
[19]夏健明,魏德敏.共轭梯度法的GPU实现[J].计算机工程,2009,35(17):274-276.br/
[20]JI Hao,LI Yaohang.Block conjugate gradient algorithms for least squares problems[J].Journal of Computational and Applied Mathematics,2017,317:203-217.br/
[21]张兰.稀疏矩阵方程组预处理迭代技术研究[D].广州:华南理工大学,2010.br/
[22]university of Florida sparse matrix collection[EB/OL].[2017-07-01].http://www.cise.ufl.edu/research/sparse/matrices/list_by_id.html.br/ |