UniDepth: Universal Monocular Metric Depth Estimation

27 March 2024

Luigi Piccinelli

Yung-Hsu Yang

Daniel Gehrig

Mattia Segu

Siyuan Li

Luc Van Gool

Fisher Yu

VLM

MDE

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (897★)

Papers citing "UniDepth: Universal Monocular Metric Depth Estimation"

50 / 129 papers shown

Easy3D-Labels: Supervising Semantic Occupancy Estimation with 3D Pseudo-Labels for Automotive Perception

Ciaran Eising

323

27 Mar 2026

C3G: Learning Compact 3D Representations with 2K Gaussians

...

300

03 Dec 2025

DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling

...

492

02 Dec 2025

KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM

170

01 Dec 2025

EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes

160

30 Nov 2025

Seeing the Wind from a Falling Leaf

322

30 Nov 2025

Fin3R: Fine-tuning Feed-forward 3D Reconstruction Models via Monocular Knowledge Distillation

178

27 Nov 2025

Depth Anything 3: Recovering the Visual Space from Any Views

1.0K

162

13 Nov 2025

Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos

...

184

24 Oct 2025

GeoDiff: Geometry-Guided Diffusion for Metric Depth Estimation

306

21 Oct 2025

PAGE-4D: Disentangled Pose and Geometry Estimation for VGGT-4D Perception

392

20 Oct 2025

Leveraging 2D Priors and SDF Guidance for Dynamic Urban Scene Rendering

169

15 Oct 2025

XD-RCDepth: Lightweight Radar-Camera Depth Estimation with Explainability-Aligned and Distribution-Aware Distillation

160

15 Oct 2025

Prompt-Guided Spatial Understanding with RGB-D Transformers for Fine-Grained Object Relation Reasoning

180

13 Oct 2025

WorldMirror: Universal 3D World Reconstruction with Any-Prior Prompting

223

12 Oct 2025

Mono4DEditor: Text-Driven 4D Scene Editing from Monocular Video via Point-Level Localization of Language-Embedded Gaussians

201

10 Oct 2025

Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation

266

10 Oct 2025

MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency

193

08 Oct 2025

MorphoSim: An Interactive, Controllable, and Editable Language-guided 4D World Simulator

Xuehai He

Shijie Zhou

Thivyanth Venkateswaran

194

05 Oct 2025

From Tokens to Nodes: Semantic-Guided Motion Control for Dynamic 3D Gaussian Splatting

139

03 Oct 2025

Instant4D: 4D Gaussian Splatting in Minutes

197

01 Oct 2025

$DA$^{2}$: Depth Anything in Any Direction$

^{2}

: Depth Anything in Any Direction

654

30 Sep 2025

BRIDGE -- Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation

374

29 Sep 2025

DepthLM: Metric Depth From Vision Language Models

352

29 Sep 2025

Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos

186

27 Sep 2025

SingRef6D: Monocular Novel Object Pose Estimation with a Single RGB Reference

162

26 Sep 2025

EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device

Gunjan Chhablani

Xiaomeng Ye

Muhammad Zubair Irshad

Z. Kira

3DGS

216

22 Sep 2025

Taming Video Models for 3D and 4D Generation via Zero-Shot Camera Control

320

18 Sep 2025

MapAnything: Mapping Urban Assets using Single Street-View Images

131

18 Sep 2025

ROOM: A Physics-Based Continuum Robot Simulator for Photorealistic Medical Datasets Generation

Salvatore Esposito

Matías Mattamala

Daniel Rebain

Francis Xiatian Zhang

Kevin Dhaliwal

Mohsen Khadem

Subramanian Ramamoorthy

178

16 Sep 2025

Exploring Spectral Characteristics for Single Image Reflection Removal

142

16 Sep 2025

Loc$^2$: Interpretable Cross-View Localization via Depth-Lifted Local Feature Matching

Loc

^2

: Interpretable Cross-View Localization via Depth-Lifted Local Feature Matching

311

11 Sep 2025

DGFusion: Depth-Guided Sensor Fusion for Robust Semantic Perception

375

11 Sep 2025

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

...

446

11 Sep 2025

Zero-Shot Metric Depth Estimation via Monocular Visual-Inertial Rescaling for Autonomous Aerial Navigation

233

09 Sep 2025

S-LAM3D: Segmentation-Guided Monocular 3D Object Detection via Feature Space Fusion

Diana-Alexandra Sas

F. Oniga

3DPC

123

07 Sep 2025

MonoRelief V2: Leveraging Real Data for High-Fidelity Monocular Relief Recovery

224

27 Aug 2025

CoVeRaP: Cooperative Vehicular Perception through mmWave FMCW RadarsInternational Conference on Computer Communications and Networks (ICCCN), 2025

150

22 Aug 2025

Self-Supervised Sparse Sensor Fusion for Long Range Perception

181

19 Aug 2025

TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation

192

11 Aug 2025

Extending Foundational Monocular Depth Estimators to Fisheye Cameras with Calibration Tokens

Suchisrit Gangopadhyay

402

06 Aug 2025

Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images

195

04 Aug 2025

IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation

235

01 Aug 2025

3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection

320

31 Jul 2025

iLRM: An Iterative Large 3D Reconstruction Model

415

31 Jul 2025

LONG3R: Long Sequence Streaming 3D Reconstruction

292

24 Jul 2025

Towards Scalable Spatial Intelligence via 2D-to-3D Data Lifting

312

24 Jul 2025

SpatialTrackerV2: 3D Point Tracking Made Easy

286

16 Jul 2025

Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation

...

260

15 Jul 2025

DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation

384

02 Jul 2025