摘要: 针对H.264视频编解码标准复杂度高、运算量大的缺点,基于统一计算设备架构平台设计并实现CPU+GPU异构并行H.264解码器,利用GPU的并行计算能力和CPU的逻辑控制优势加快运行速度,提高解码性能。实验结果表明,与FFmpege中传统的串行解码器相比,利用GPU加速的H.264并行解码器能获得2倍~7倍的性能提升,各并行单独模块也可实现5倍~11倍的加速。
关键词:
图形处理器,
统一计算设备架构平台,
H.264标准,
视频编解码器,
并行化
Abstract: Aiming at the defects of H.264 such as high complexity and large amounts of computation,this paper designs and implements the CPU+GPU heterogeneous parallel H.264 decoder based on Compute Unified Device Architecture(CUDA),which makes full use of GPU’s parallel computing ability and CPU’s logic control advantages,to improve the running speed and decoding performance.Experimental result shows that the performance is improved 2~7 times when using GPU acceleration,and the each parallel module can also accelerate 5 to 11 times,compared with that of the traditional serial decoder in FFmpeg.
Key words:
Graphics Processing Unit(GPU),
Compute Unified Device Architecture(CUDA),
H.264 standard,
video codec,
parallelization
中图分类号:
陈勇,吴晓民,杨坚,奚宏生. 基于CUDA的H.264并行解码器设计与实现[J]. 计算机工程.
CHEN Yong,WU Xiaomin,YANG Jian,XI Hongsheng. Design and Implementation of H.264 Parallel Decoder Based on CUDA[J]. Computer Engineering.