作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2026, Vol. 52 ›› Issue (5): 192-202. doi: 10.19678/j.issn.1000-3428.0070478

• 计算机视觉与图形图像处理 • 上一篇    下一篇

基于梯度频率多阶段引导的深度图超分辨率方法

蔡江河1, 陈飞1,*(), 姜凡2, 程航3, 王美清3   

  1. 1. 福州大学计算机与大数据学院, 福建 福州 350108
    2. 北不列颠哥伦比亚大学计算机科学系, 加拿大 乔治王子城 V2N 4Z9
    3. 福州大学数学与统计学院, 福建 福州 350108
  • 收稿日期:2024-10-12 修回日期:2024-11-26 出版日期:2026-05-15 发布日期:2026-05-12
  • 通讯作者: 陈飞
  • 作者简介:

    蔡江河, 男, 硕士研究生, 主研方向为图像处理与重建

    陈飞(通信作者), 教授

    姜凡, 教授

    程航, 教授

    王美清, 教授

  • 基金资助:
    国家自然科学基金(62471141)

Depth Map Super-Resolution Method Based on Gradient—Frequency Multi-Stage Guidance

CAI Jianghe1, CHEN Fei1,*(), JIANG Fan2, CHEN Hang3, WANG Meiqing3   

  1. 1. College of Computer and Data Science, Fuzhou University, Fuzhou 350108, Fujian, China
    2. Department of Computer Science, University of Northern British Columbia, Prince George V2N 4Z9, Canada
    3. School of Mathematics and Statistics, Fuzhou University, Fuzhou 350108, Fujian, China
  • Received:2024-10-12 Revised:2024-11-26 Online:2026-05-15 Published:2026-05-12
  • Contact: CHEN Fei

摘要:

颜色图像引导的深度图超分辨率(GDSR)旨在利用同一场景下高分辨率(HR)彩色图像提供的引导信息, 将低分辨率(LR)深度图重建为HR深度图。尽管基于空间域的学习方法可以有效提高深度图的整体重建质量, 但在从LR深度图进行重建时, 仍然面临边缘结构模糊的问题。为了解决该问题, 提出梯度频率引导的多阶段整合深度图重建网络(GFNet), 利用颜色图像的梯度先验和频率信息来增强深度边缘结构细节的重建。首先, 设计一个梯度特征提取(GFE)模块, 结合RGB图像的梯度先验知识来优化LR深度图的梯度结构。其次, 设计一个空间频率双路引导融合(SFI)模块, 将RGB图像中精确的高频成分传递到LR深度图中, 从而引导深度图丢失的高频信息重建。最后, 采用一种新颖的隐式神经函数来提高深度图的分辨率。实验结果表明, 在8倍缩放因子的情况下, GFNet在数据集NYUv2、Middlebury以及真实场景数据集RGB-D-D上均方根误差(RMSE)指标分别达到了2.48、1.62、2.57 cm, 相比于结构更复杂的模型GeoDSR, 分别降低了0.14、0.06、0.12 cm, 并在边缘结构细节方面优于对比方法, 证明了所提方法的有效性。

关键词: 深度图超分辨率, 边缘结构, 梯度特征, 频率引导, 隐式神经函数

Abstract:

Color image Guided Depth Super-Resolution (GDSR) aims to reconstruct a High-Resolution (HR) depth map from its Low-Resolution (LR) version with guidance from HR color images of the same scene. Although learning-based methods used in the spatial domain can effectively enhance the overall reconstruction quality of depth maps, they exhibit edge structure blurring during reconstruction from LR depth maps. To address this issue, this paper proposes a Gradient—Frequency guided multi-stage integration Network (GFNet). This network utilizes gradient priors and frequency information from color images to enhance the reconstruction of edge structure details in depth maps. First, a gradient feature extraction module is designed to incorporate the gradient prior knowledge of RGB images and thus optimize the gradient structures of LR depth maps. Subsequently, a spatial—frequency dual-path guidance module is developed to map precise high-frequency components from RGB images onto LR depth maps, thereby guiding the reconstruction of high-frequency information lost in the depth maps. Finally, a novel implicit neural function is employed to improve the resolution of depth maps. Experimental results show that, under an 8× scaling factor, GFNet achieves RMSE values of 2.48, 1.62, and 2.57 cm on the NYUv2, Middlebury, and real-world RGB-D-D datasets, respectively, outperforming the more complex GeoDSR model by 0.14, 0.06, and 0.12 cm, respectively. Additionally, it surpasses comparison methods in terms of edge structure detail reconstruction, demonstrating its effectiveness.

Key words: depth map Super-Resolution (SR), edge structure, gradient feature, frequency guidance, implicit neural function