VRP-UDF: Towards Unbiased Learning of Unsigned Distance Functions from Multi-view Images with Volume Rendering Priors

TPAMI 2026 / ECCV 2024

1School of Software, Tsinghua University, 2China Telecom Wanwei Information Technology Co., Ltd., 3Kuaishou Technology, 4Wayne State University

Abstract

Unsigned distance functions (UDFs) have been a vital representation for open surfaces. With different differentiable renderers, current methods are able to train neural networks to infer a UDF by minimizing the rendering errors with the UDF to the multi-view ground truth. However, these differentiable renderers are mainly handcrafted, which makes them either biased on ray-surface intersections, or sensitive to unsigned distance outliers, or not scalable to large scenes. To resolve these issues, we present a novel differentiable renderer to infer UDFs more accurately. Instead of using handcrafted equations, our differentiable renderer is a neural network which is pre-trained in a data-driven manner. It learns how to render unsigned distances into depth images, leading to a prior knowledge, dubbed volume rendering priors. To infer a UDF for an unseen scene from multiple RGB images, we generalize the learned volume rendering priors to map inferred unsigned distances in alpha blending for RGB image rendering. To reduce the bias of sampling in UDF inference, we utilize an auxiliary point sampling prior as an indicator of ray-surface intersection, and propose novel schemes towards more accurate and uniform sampling near the zero-level sets. We also propose a new strategy that leverages our pretrained volume rendering prior to serve as a general surface refiner, which can be integrated with various Gaussian reconstruction methods to optimize the Gaussian distributions and refine geometric details. Our results show that the learned volume rendering prior is unbiased, robust, scalable, 3D aware, and more importantly, easy to learn. Further experiments show that the volume rendering prior is also a general strategy to enhance other neural implicit representations such as signed distance function and occupancy. We evaluate our method on both widely used benchmarks and real scenes, and report superior performance over the state-of-the-art methods.

Method

In this paper, we (1) introduce volume rendering priors to infer UDFs from multi-view images. Our prior can be learned in a data-driven manner, which provides a novel perspective to recover geometry with prior knowledge through volume rendering. (2) propose a novel deep neural network and learning scheme, and report extensive analysis to learn an unbiased differentiable renderer for UDFs with robustness, scalability, and 3D awareness. (3) extend the learned VRP to various neural representations and Gaussian Splatting-based surface reconstruction to validate its generalization ability for downstream applications.

Here is an overview of our method. In the training phase, our volume rendering prior takes sliding windows of GT UDFs from training meshes as input, and outputs opaque densities for alpha blending. The parameters are optimized by the error between rendered depth and ground truth depth maps. During the testing phase, we freeze the volume rendering prior and use ground truth multi-view RGB images to optimize a randomly initialized UDF field.

Visualization Results

Comparison on Deepfashion3D Dataset

Visual comparisons on open surface reconstructions with error maps on DeepFashion3D dataset. The transition from blue to yellow indicates small to large reconstruction errors.

Comparison on DTU Dataset

Comparison on Replica Dataset

Comparison on Insects Dataset

Comparison on Real-Captured Datasets

Visualization Video

BibTeX

@article{zhang2026vrpudf,
      title={{VRP-UDF}: Towards Unbiased Learning of Unsigned Distance Functions from Multi-view Images with Volume Rendering Priors},
      author={Zhang, Wenyuan and Wang, Chunsheng and Shi, Kanle and Liu, Yu-Shen and Han, Zhizhong},
      journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
      year={2026},
      publisher={IEEE}
    }
@inproceedings{zhang2024learning,
      title={Learning unsigned distance functions from multi-view images with volume rendering priors},
      author={Zhang, Wenyuan and Shi, Kanle and Liu, Yu-Shen and Han, Zhizhong},
      booktitle={European Conference on Computer Vision},
      pages={397--415},
      year={2024},
      organization={Springer}
    }