计算机工程

• 图形图像处理 • 上一篇    下一篇

异构计算平台图像边缘检测算法优化研究

魏秋明  a,梁军  a,鲍泓  b,王晶  b,李论  a   

  1. (北京联合大学 a.电子信息技术实验实训基地; b.北京市信息服务工程重点实验室,北京 100101)
  • 收稿日期:2016-10-17 出版日期:2017-05-15 发布日期:2017-05-15
  • 作者简介:魏秋明(1995—),男,硕士,主研方向为图像处理、并行计算;梁军(通信作者),副教授、硕士;鲍泓,教授;王晶,硕士研究生;李论,硕士。
  • 基金项目:
    国家自然科学基金(NSFC61271370);北京市教育委员会科技计划面上项目(SQKM201411417010,KM201311417001)。

Research on Image Edge Detection Algorithm Optimization on Heterogeneous Computing Platform

WEI Qiuming  a,LIANG Jun  a,BAO Hong  b,WANG Jing  b,LI Lun  a      

  1. (a.Training Center of Electronic Information; b.Beijing Key Laboratory of Information Service Engineering, Beijing Union University,Beijing 100101,China)
  • Received:2016-10-17 Online:2017-05-15 Published:2017-05-15

摘要: 随着实际应用中图像数据规模的增大和分辨率的提高,图像边缘检测算法的性能成为制约图像实时处理的关键。从向量化访存、数据本地化以及条件分支优化3个方面出发,结合算法特性和底层硬件架构特征,研究Canny边缘检测算法在NVIDIA Tegra K1异构计算平台上的GPU性能优化。实验结果表明,与基于OpenCV3.0 CPU的Canny边缘检测算法相比,优化后的Canny边缘检测算法在不同图像数据规模下可达13.2倍~17.8倍的性能加速比,具有较好的检测性能。

关键词: 图像边缘检测, 异构计算平台, 向量化访存, 数据本地化, 条件分支优化

Abstract: With the increase of the size of the image data and the improvement of the image resolution,the performance of the image edge detection algorithm becomes the key to the real-time processing of the image.Based on the three aspects of quantitative acess memory,data localization and conditional branch optimization,this paper studies the GPU performance optimization of Canny edge detection algorithm on NVIDIA Tegra K1 heterogeneous computing platform combined with algorithm characteristics and underlying hardware architecture characteristics.The experimental results show that compared with the Canny edge detection algorithm based on OpenCV3.0 CPU,the optimized Canny edge detection algorithm achieves 13.2 times to 17.8 times performance acceleration ratio with different graphic data size,and has better detection performance.

Key words: image edge detection, heterogeneous computing platform, quantitative acess memory, data localization, conditional branch optimization

中图分类号: