IA-MVS: Instance-Focused Adaptive Depth Sampling for Multi-View Stereo

Multi-view stereo (MVS) models based on progressive depth hypothesis narrowing have made remarkable advancements. However, existing methods haven't fully utilized the potential that the depth coverage of individual instances is smaller than that of the entire scene, which restricts further improvements in depth estimation precision. Moreover, inevitable deviations in the initial stage accumulate as the process advances. In this paper, we propose Instance-Adaptive MVS (IA-MVS). It enhances the precision of depth estimation by narrowing the depth hypothesis range and conducting refinement on each instance. Additionally, a filtering mechanism based on intra-instance depth continuity priors is incorporated to boost robustness. Furthermore, recognizing that existing confidence estimation can degrade IA-MVS performance on point clouds. We have developed a detailed mathematical model for confidence estimation based on conditional probability. The proposed method can be widely applied in models based on MVSNet without imposing extra training burdens. Our method achieves state-of-the-art performance on the DTU benchmark. The source code is available atthis https URL.
View on arXiv@article{wang2025_2505.12714, title={ IA-MVS: Instance-Focused Adaptive Depth Sampling for Multi-View Stereo }, author={ Yinzhe Wang and Yiwen Xiao and Hu Wang and Yiping Xu and Yan Tian }, journal={arXiv preprint arXiv:2505.12714}, year={ 2025 } }