Author Login Chief Editor Login Reviewer Login Editor Login Remote Office

Computer Engineering ›› 2026, Vol. 52 ›› Issue (4): 239-251. doi: 10.19678/j.issn.1000-3428.0252071

• Computer Vision and Image Processing • Previous Articles     Next Articles

Neural Implicit Surface Reconstruction Method Based on Multi-View Mixed Consistency Constraints

ZHU Wenqian1, SONG Lijuan1,2,3,*(), GUO Xinru1, MA Zirui1,2,3   

  1. 1. School of Information Engineering, Ningxia University, Yinchuan 750021, Ningxia, China
    2. Ningxia Key Laboratory of Artificial Intelligence and Information Security for Channeling Computing Resources from the East to the West, Yinchuan 750021, Ningxia, China
    3. Collaborative Innovation Center for Ningxia Big Data and Artificial Intelligence Co-founded by Ningxia Municipality and Ministry of Education, Yinchuan 750021, Ningxia, China
  • Received:2025-01-20 Revised:2025-04-18 Online:2026-04-15 Published:2026-04-08
  • Contact: SONG Lijuan

基于多视图混合一致性约束的神经隐式表面重建方法

朱文倩1, 宋丽娟1,2,3,*(), 郭新茹1, 马子睿1,2,3   

  1. 1. 宁夏大学信息工程学院, 宁夏 银川 750021
    2. 宁夏"东数西算"人工智能与信息安全重点实验室, 宁夏 银川 750021
    3. 宁夏大数据与人工智能省部共建协同创新中心, 宁夏 银川 750021
  • 通讯作者: 宋丽娟
  • 作者简介:

    朱文倩(CCF学生会员), 女, 硕士研究生, 主研方向为计算机视觉

    宋丽娟(通信作者), 副教授、博士

    郭新茹, 硕士研究生

    马子睿, 副教授

  • 基金资助:
    宁夏回族自治区重点研发计划(2023BEG02009); 宁夏回族自治区自然科学基金(2024AAC03062)

Abstract:

Multi-view 3D reconstruction based on neural implicit surface learning includes inherent ambiguities in representing the geometric shape and appearance of complex objects. Consequently, the fine geometric details of an object are prone to being lost in sparse texture areas, boundaries, and large smooth surfaces, making accurate recovery difficult. To address this issue, this study proposes a novel neural implicit surface reconstruction method based on multi-view mixed consistency constraints. This method uses Multi-View Stereo (MVS), multi-view photometric consistency, feature consistency, and volume rendering techniques to optimize the implicit surface representation, enabling the reconstruction of object models with fine geometric details. First, a dense point generation module based on MVS is proposed to supplement detail information in the sparse texture areas and boundaries of the object surface, achieving multi-view geometric optimization of the object surface. Second, a multi-view mixed consistency constraints module is introduced, which uses the Signed Distance Function (SDF) to locate the zero-level set. It applies multi-view photometric consistency constraints to impose geometric constraints on the smooth regions of the object, supervising the extracted implicit surface. Additionally, multi-view feature consistency constraints are applied to surface points at the zero-crossing of the linearly interpolated SDF, compensating for pixel matching errors in texture-sparse or structurally complex regions, thereby optimizing the object reconstruction model. Finally, volume rendering technology is applied to produce high-quality image renderings from the implicit SDF, enabling precise surface reconstruction of objects. Experimental results show that, compared to methods such as Colmap, the proposed method increases the Peak Signal-to-Noise Ratio (PSNR) by over 40.3% on the DTU dataset and successfully enables accurate surface reconstruction of the objects.

Key words: neural implicit surface reconstruction, Multi-View Stereo (MVS), Signed Distance Function (SDF), multi-view mixed consistency, volume rendering

摘要:

在基于神经隐式表面学习的多视图三维重建过程中, 复杂物体的几何形状和外观表示存在潜在的模糊性。因此, 物体的几何细节信息在纹理稀疏区域、边界区域与较大光滑区域中容易丢失, 难以精确恢复。为解决这个问题, 提出一种基于多视图混合一致性约束的神经隐式表面重建方法。该方法采用多视图立体匹配(MVS)、多视图光度一致性与特征一致性、体渲染技术来优化隐式表面表示, 从而重建具有精细几何细节的复杂物体模型。首先, 提出一个基于MVS的稠密点生成模块, 通过MVS生成稠密点, 来补充物体表面纹理稀疏区域与边界区域的细节信息, 从而实现物体表面的多视图几何优化。其次, 提出多视图混合一致性约束模块, 通过符号距离函数(SDF)定位零水平集, 利用多视图光度一致性约束来对物体光滑区域进行几何约束, 监督所提取的隐式表面, 并对经过线性插值的SDF过零处的表面点应用多视图特征一致性约束, 弥补纹理稀疏区域或结构复杂区域像素匹配的误差, 从而优化物体重建模型。最后, 通过应用体渲染技术, 利用隐式的SDF得出高质量的图像渲染, 以实现复杂物体的精确表面重建。实验结果表明, 在DTU数据集中, 相比于Colmap等方法, 所提方法峰值信噪比(PSNR)提升了40.3%以上, 实现了物体表面的精确重建。

关键词: 神经隐式表面重建, 多视图立体匹配, 符号距离函数, 多视图混合一致性, 体渲染