Neural Multi-View Self-Calibrated Photometric Stereo without Photometric Stereo Cues

30 July 2025

Xu Cao

Takafumi Taketomi

3DV

ArXiv (abs)PDF HTML Github

Main:8 Pages

17 Figures

Bibliography:3 Pages

3 Tables

Appendix:6 Pages

Abstract

We propose a neural inverse rendering approach that jointly reconstructs geometry, spatially varying reflectance, and lighting conditions from multi-view images captured under varying directional lighting. Unlike prior multi-view photometric stereo methods that require light calibration or intermediate cues such as per-view normal maps, our method jointly optimizes all scene parameters from raw images in a single stage. We represent both geometry and reflectance as neural implicit fields and apply shadow-aware volume rendering. A spatial network first predicts the signed distance and a reflectance latent code for each scene point. A reflectance network then estimates reflectance values conditioned on the latent code and angularly encoded surface normal, view, and light directions. The proposed method outperforms state-of-the-art normal-guided approaches in shape and lighting estimation accuracy, generalizes to view-unaligned multi-light images, and handles objects with challenging geometry and reflectance.

View on arXiv

Comments on this paper