作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2009, Vol. 35 ›› Issue (10): 7-10. doi: 10.3969/j.issn.1000-3428.2009.10.003

• 博士论文 • 上一篇    下一篇

基于CUDA的矩阵乘法和FFT性能测试

肖 江,胡柯良,邓元勇   

  1. (中国科学院国家天文台,北京 100012)
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-05-20 发布日期:2009-05-20

Ability Test for Matrix-Multiplication and FFT Based on CUDA

XIAO Jiang, HU Ke-liang, DENG Yuan-yong   

  1. (National Astronomical Observatories, Chinese Academy of Sciences, Beijing 100012)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-05-20 Published:2009-05-20

摘要: 针对NVIDIA公司的CUDA技术用Geforce8800GT在Visual Studio2008环境下进行测试,从程序运行时间比较判断CUBLAS库、CUDA内核程序、CUDA驱动API、C循环程序与Intel MKL库以及FFTW库与CUFFT库运行响应的差异。测试结果表明,在大规模矩阵乘法和快速傅里叶变换的应用方面,相对于CPU,利用GPU运算性能可提高25倍以上。

关键词: 矩阵乘法, 快速傅里叶变换, 并行计算, GPU通用计算

Abstract: This paper introduces the result of a test that evaluates the effectiveness of Compute Unified Device Architecture(CUDA) using NVDIA GeForce8800GT and the compiler Visual Studio 2008. It tests the speed of NVIDIA CUBLAS, CUDA kernel, common C program, Intel MKL BLAS, CUDA driver API program, FFTW and CUFFT Library in matrix-multiplication and Fast Fourier Transform(FFT). Test result of the large scale data shows that the computing ability of GPU is 25 times better than that of CPU.

Key words: matrix-multiplication, Fast Fourier Transform(FFT), parallel computation, GPGPU

中图分类号: