Abstract:
In terms of parallel decoding H.264 video stream problems, this paper builds CPU/GPU cooperative computing model to accelerate video encoding and decoding computing. This model uses Compute Unified Device Architecture(CUDA) language as GPU programming model, proposes and implements DCT inverse conversation and intra-frame prediction in a GPU accelerated computing. In the premise of maintaining higher calculation accuracy, combined with CUDA mixed programming, improves the computational performance of the system greatly. The algorithm uses CUDA language provided by NVIDIA, and realizes the DCT inverse conversation and intra-frame prediction on GPU. The experiment compares the parallel algorithm and the sole CPU, and verifies the accelerating effect of the parallel decoding algorithm by using different number of video streams. Experimental result shows that this system improves the video streaming codec efficiency, and it can accelerate 10 times faster than the average CPU sole calculation.
Key words:
Compute Unified Device Architecture(CUDA),
parallel computation,
H.264 standard,
DCT inverse transform,
intra-frame prediction
摘要: 针对并行处理H.264标准视频流解码问题,提出基于CPU/GPU的协同运算算法。以统一设备计算架构(CUDA)语言作为GPU编程模型,实现DCT逆变换与帧内预测在GPU中的加速运算。在保持较高计算精度的前提下,结合CUDA混合编程,提高系统的计算性能。利用NIVIDIA提供的CUDA语言,在解码过程中使DCT逆变换和帧内预测在GPU上并行实现,将并行算法与CPU单机实现进行比较,并用不同数量的视频流验证并行解码算法的加速效果。实验结果表明,该算法可大幅提高视频流的编解码效率,比CPU单机的平均计算加速比提高10倍。
关键词:
统一设备计算架构,
并行运算,
H.264标准,
DCT逆变换,
帧内预测
CLC Number:
CHEN Peng, CAO Jian-wei, CHEN Qing-kui. H.264 Parallel Decoding Algorithm Based on GPU[J]. Computer Engineering.
陈鹏,曹剑炜,陈庆奎. 基于GPU的H.264并行解码算法[J]. 计算机工程.