ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.18913
  4. Cited By
UniDepth: Universal Monocular Metric Depth Estimation

UniDepth: Universal Monocular Metric Depth Estimation

27 March 2024
Luigi Piccinelli
Yung-Hsu Yang
Daniel Gehrig
Mattia Segu
Siyuan Li
Luc Van Gool
Fisher Yu
    VLMMDE
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (897★)

Papers citing "UniDepth: Universal Monocular Metric Depth Estimation"

50 / 129 papers shown
Easy3D-Labels: Supervising Semantic Occupancy Estimation with 3D Pseudo-Labels for Automotive Perception
Easy3D-Labels: Supervising Semantic Occupancy Estimation with 3D Pseudo-Labels for Automotive Perception
Seamie Hayes
Ganesh Sistu
Ciarán Eising
Ciaran Eising
3DPC
323
3
0
27 Mar 2026
C3G: Learning Compact 3D Representations with 2K Gaussians
C3G: Learning Compact 3D Representations with 2K Gaussians
Honggyu An
Jaewoo Jung
Mungyeom Kim
Sunghwan Hong
Chaehyun Kim
...
Takuya Narihira
Hyuna Ko
J. Kim
Yuki Mitsufuji
Seungryong Kim
3DGS3DV
300
3
0
03 Dec 2025
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling
Kairun Wen
Yuzhi Huang
Runyu Chen
Hui Zheng
Yunlong Lin
...
Justin Theiss
Yue Huang
Xinghao Ding
Rakesh Ranjan
Zhiwen Fan
VGen
492
7
0
02 Dec 2025
KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM
KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM
Zaid Nasser
Mikhail Iumanov
Tianhao Li
Maxim Popov
Jaafar Mahmoud
Malik Mohrat
Ilya Obrubov
Ekaterina Derevyanka
Ivan Sosin
Sergey Kolyubin
170
0
0
01 Dec 2025
EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes
EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes
Xiaoshan Wu
Yifei Yu
Xiaoyang Lyu
Yihua Huang
Bo Wang
Baoheng Zhang
Zhongrui Wang
Xiaojuan Qi
3DGS
160
1
0
30 Nov 2025
Seeing the Wind from a Falling Leaf
Seeing the Wind from a Falling Leaf
Zhiyuan Gao
Jiageng Mao
Hong-Xing Yu
Haozhe Lou
Emily Yue-Ting Jia
J. Barbič
Jiajun Wu
Yue Wang
VGenPINN
322
3
0
30 Nov 2025
Fin3R: Fine-tuning Feed-forward 3D Reconstruction Models via Monocular Knowledge Distillation
Fin3R: Fine-tuning Feed-forward 3D Reconstruction Models via Monocular Knowledge Distillation
Weining Ren
Hongjun Wang
Xiao Tan
Kai Han
178
1
0
27 Nov 2025
Depth Anything 3: Recovering the Visual Space from Any Views
Depth Anything 3: Recovering the Visual Space from Any Views
Haotong Lin
Sili Chen
Junhao Liew
Donny Y. Chen
Z. Li
Guang Shi
Jiashi Feng
Bingyi Kang
3DVVLMMDE
1.0K
162
0
13 Nov 2025
Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
Qixiu Li
Yu Deng
Yaobo Liang
L. Luo
Lei Zhou
...
Hao Chen
Lily Sun
Dong Chen
J. Yang
B. Guo
184
18
0
24 Oct 2025
GeoDiff: Geometry-Guided Diffusion for Metric Depth Estimation
GeoDiff: Geometry-Guided Diffusion for Metric Depth Estimation
Tuan Pham
Thanh-Tung Le
Xiaohui Xie
Stephan Mandt
DiffMMDE
306
0
0
21 Oct 2025
PAGE-4D: Disentangled Pose and Geometry Estimation for VGGT-4D Perception
PAGE-4D: Disentangled Pose and Geometry Estimation for VGGT-4D Perception
Kaichen Zhou
Y. Wang
Grace Chen
Xinhai Chang
Gaspard Beaudouin
Fangneng Zhan
Paul Liang
Mengyu Wang
ViT
392
1
0
20 Oct 2025
Leveraging 2D Priors and SDF Guidance for Dynamic Urban Scene Rendering
Leveraging 2D Priors and SDF Guidance for Dynamic Urban Scene Rendering
Siddharth Tourani
Jayaram Reddy
Akash Kumbar
Satyajit Tourani
Nishant Goyal
Madhava Krishna
N. Dinesh Reddy
M. H. Khan
3DGS
169
0
0
15 Oct 2025
XD-RCDepth: Lightweight Radar-Camera Depth Estimation with Explainability-Aligned and Distribution-Aware Distillation
XD-RCDepth: Lightweight Radar-Camera Depth Estimation with Explainability-Aligned and Distribution-Aware Distillation
Huawei Sun
Zixu Wang
Xiangyuan Peng
Julius Ott
Georg Stettinger
Lorenzo Servadei
Robert Wille
160
0
0
15 Oct 2025
Prompt-Guided Spatial Understanding with RGB-D Transformers for Fine-Grained Object Relation Reasoning
Prompt-Guided Spatial Understanding with RGB-D Transformers for Fine-Grained Object Relation Reasoning
Tanner Muturi
Blessing Agyei Kyem
Joshua Kofi Asamoah
Neema Jakisa Owor
Richard Dyzinela
Andrews Danyo
Y. Adu-Gyamfi
Armstrong Aboah
LRM
180
3
0
13 Oct 2025
WorldMirror: Universal 3D World Reconstruction with Any-Prior Prompting
WorldMirror: Universal 3D World Reconstruction with Any-Prior Prompting
Yifan Liu
Zhiyuan Min
Zhenwei Wang
Junta Wu
Tengfei Wang
Yixuan Yuan
Yawei Luo
Chunchao Guo
3DGS
223
28
0
12 Oct 2025
Mono4DEditor: Text-Driven 4D Scene Editing from Monocular Video via Point-Level Localization of Language-Embedded Gaussians
Mono4DEditor: Text-Driven 4D Scene Editing from Monocular Video via Point-Level Localization of Language-Embedded Gaussians
Jin-Chuan Shi
Chengye Su
Jiajun Wang
Ariel Shamir
Miao Wang
DiffM3DGSVGen
201
1
0
10 Oct 2025
Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation
Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation
Wenyao Zhang
Hongsi Liu
Bohan Li
Jiawei He
Zekun Qi
Yunnan Wang
Shengyang Zhao
Xinqiang Yu
Wenjun Zeng
Jianfeng Dong
VLMMDE
266
4
0
10 Oct 2025
MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency
MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency
Dongki Jung
Jaehoon Choi
Yonghan Lee
Sungmin Eum
Heesung Kwon
Dinesh Manocha
193
1
0
08 Oct 2025
MorphoSim: An Interactive, Controllable, and Editable Language-guided 4D World Simulator
MorphoSim: An Interactive, Controllable, and Editable Language-guided 4D World Simulator
Xuehai He
Shijie Zhou
Thivyanth Venkateswaran
Kaizhi Zheng
Ziyu Wan
A. Kadambi
Xin Eric Wang
VGenSyDaAI4CE
194
1
0
05 Oct 2025
From Tokens to Nodes: Semantic-Guided Motion Control for Dynamic 3D Gaussian Splatting
From Tokens to Nodes: Semantic-Guided Motion Control for Dynamic 3D Gaussian Splatting
Jianing Chen
Zehao Li
Yujun Cai
Hao Jiang
Shuqin Gao
Honglong Zhao
Tianlu Mao
Y. Zhang
3DGS
139
1
0
03 Oct 2025
Instant4D: 4D Gaussian Splatting in Minutes
Instant4D: 4D Gaussian Splatting in Minutes
Zhanpeng Luo
Haoxi Ran
Li Lu
3DGSVGen
197
4
0
01 Oct 2025
DA$^{2}$: Depth Anything in Any Direction
DA2^{2}2: Depth Anything in Any Direction
Haodong Li
Wangguangdong Zheng
Jing He
Yuhao Liu
Xin Lin
Xin Yang
Ying-Cong Chen
Chunchao Guo
MDE
654
10
0
30 Sep 2025
BRIDGE -- Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation
BRIDGE -- Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation
Dingning Liu
Haoyu Guo
Jingyi Zhou
Tong He
OffRLMDE
374
0
0
29 Sep 2025
DepthLM: Metric Depth From Vision Language Models
DepthLM: Metric Depth From Vision Language Models
Zhipeng Cai
Ching-Feng Yeh
Hu Xu
Zhuang Liu
Gregory Meyer
X. Lei
Changsheng Zhao
Shang-Wen Li
Vikas Chandra
Yangyang Shi
VLM3DV
352
12
0
29 Sep 2025
Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos
Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos
Junyi Wu
Jiachen Tao
Haoxuan Wang
Gaowen Liu
Ramana Rao Kompella
Yan Yan
3DGS
186
5
0
27 Sep 2025
SingRef6D: Monocular Novel Object Pose Estimation with a Single RGB Reference
SingRef6D: Monocular Novel Object Pose Estimation with a Single RGB Reference
Jiahui Wang
H. Zhu
Haoren Guo
Abdullah Al Mamun
Cheng Xiang
T. Lee
162
1
0
26 Sep 2025
EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device
EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device
Gunjan Chhablani
Xiaomeng Ye
Muhammad Zubair Irshad
Z. Kira
3DGS
216
4
0
22 Sep 2025
Taming Video Models for 3D and 4D Generation via Zero-Shot Camera Control
Taming Video Models for 3D and 4D Generation via Zero-Shot Camera Control
Chenxi Song
Yanming Yang
Tong Zhao
Ruibo Li
Chi Zhang
VGen
320
9
0
18 Sep 2025
MapAnything: Mapping Urban Assets using Single Street-View Images
MapAnything: Mapping Urban Assets using Single Street-View Images
Miriam Louise Carnot
Jonas Kunze
Erik Fastermann
Eric Peukert
André Ludwig
Bogdan Franczyk
131
0
0
18 Sep 2025
ROOM: A Physics-Based Continuum Robot Simulator for Photorealistic Medical Datasets Generation
ROOM: A Physics-Based Continuum Robot Simulator for Photorealistic Medical Datasets Generation
Salvatore Esposito
Matías Mattamala
Daniel Rebain
Francis Xiatian Zhang
Kevin Dhaliwal
Mohsen Khadem
Subramanian Ramamoorthy
178
0
0
16 Sep 2025
Exploring Spectral Characteristics for Single Image Reflection Removal
Exploring Spectral Characteristics for Single Image Reflection Removal
Pengbo Guo
Chengxu Liu
Guoshuai Zhao
Xingsong Hou
Jialie Shen
Xueming Qian
142
0
0
16 Sep 2025
Loc$^2$: Interpretable Cross-View Localization via Depth-Lifted Local Feature Matching
Loc2^22: Interpretable Cross-View Localization via Depth-Lifted Local Feature Matching
Zimin Xia
Chenghao Xu
Alexandre Alahi
MDE
311
0
0
11 Sep 2025
DGFusion: Depth-Guided Sensor Fusion for Robust Semantic Perception
DGFusion: Depth-Guided Sensor Fusion for Robust Semantic Perception
Tim Broedermannn
Christos Sakaridis
Luigi Piccinelli
Wim Abbeloos
Luc Van Gool
MDE
375
3
0
11 Sep 2025
SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
Jiahao Wang
Yufeng Yuan
Rujie Zheng
Youtian Lin
Jian Gao
...
Xiaoxiao Long
Hao Zhu
Z. Zhang
X. Cao
Yao Yao
VGen
446
26
0
11 Sep 2025
Zero-Shot Metric Depth Estimation via Monocular Visual-Inertial Rescaling for Autonomous Aerial Navigation
Zero-Shot Metric Depth Estimation via Monocular Visual-Inertial Rescaling for Autonomous Aerial Navigation
Steven Yang
Xiaoyu Tian
K. Goel
Wennie Tabib
MDE
233
2
0
09 Sep 2025
S-LAM3D: Segmentation-Guided Monocular 3D Object Detection via Feature Space Fusion
S-LAM3D: Segmentation-Guided Monocular 3D Object Detection via Feature Space Fusion
Diana-Alexandra Sas
F. Oniga
3DPC
123
0
0
07 Sep 2025
MonoRelief V2: Leveraging Real Data for High-Fidelity Monocular Relief Recovery
MonoRelief V2: Leveraging Real Data for High-Fidelity Monocular Relief Recovery
Y. Zhang
Tongju Han
Lipeng Gao
Mingqiang Wei
Hui Liu
Changbao Li
Caiming Zhang
3DHMDE
224
0
0
27 Aug 2025
CoVeRaP: Cooperative Vehicular Perception through mmWave FMCW Radars
CoVeRaP: Cooperative Vehicular Perception through mmWave FMCW RadarsInternational Conference on Computer Communications and Networks (ICCCN), 2025
Jinyue Song
Hansol Ku
Jayneel Vora
Nelson Lee
Ahmad Kamari
P. Mohapatra
Parth H. Pathak
150
0
0
22 Aug 2025
Self-Supervised Sparse Sensor Fusion for Long Range Perception
Self-Supervised Sparse Sensor Fusion for Long Range Perception
Edoardo Palladin
Samuel Brucker
Filippo Ghilotti
Praveen Narayanan
Mario Bijelic
Felix Heide
SSL
181
3
0
19 Aug 2025
TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation
TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation
Huawei Sun
Zixu Wang
Hao Feng
Julius Ott
Lorenzo Servadei
Robert Wille
192
1
0
11 Aug 2025
Extending Foundational Monocular Depth Estimators to Fisheye Cameras with Calibration Tokens
Extending Foundational Monocular Depth Estimators to Fisheye Cameras with Calibration Tokens
Suchisrit Gangopadhyay
Jung-Hee Kim
Xien Chen
Patrick Rim
Hyoungseob Park
Alex Wong
MDEFedML
402
6
0
06 Aug 2025
Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images
Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images
Philipp Wulff
Felix Wimbauer
Dominik Muhle
Daniel Cremers
MDE
195
2
0
04 Aug 2025
IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
Wenxuan Guo
Xiuwei Xu
Hang Yin
Ziwei Wang
Jianjiang Feng
Jie Zhou
Jiwen Lu
3DGS
235
10
0
01 Aug 2025
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
Yung-Hsu Yang
Luigi Piccinelli
Mattia Segu
Siyuan Li
Rui Huang
Yuqian Fu
Marc Pollefeys
Hermann Blum
Z. Bauer
3DPC
320
10
0
31 Jul 2025
iLRM: An Iterative Large 3D Reconstruction Model
iLRM: An Iterative Large 3D Reconstruction Model
Gyeongjin Kang
Seungtae Nam
Xiangyu Sun
Sameh Khamis
Abdelrahman Mohamed
Eunbyung Park
Eunbyung Park
3DV3DGS
415
12
0
31 Jul 2025
LONG3R: Long Sequence Streaming 3D Reconstruction
LONG3R: Long Sequence Streaming 3D Reconstruction
Zhuoguang Chen
Minghui Qin
Tianyuan Yuan
Zhe Liu
Hang Zhao
292
27
0
24 Jul 2025
Towards Scalable Spatial Intelligence via 2D-to-3D Data Lifting
Towards Scalable Spatial Intelligence via 2D-to-3D Data Lifting
Xingyu Miao
Haoran Duan
Quanhao Qian
Jiuniu Wang
Yang Long
Ling Shao
Deli Zhao
Ran Xu
Gongjie Zhang
312
6
0
24 Jul 2025
SpatialTrackerV2: 3D Point Tracking Made Easy
SpatialTrackerV2: 3D Point Tracking Made Easy
Yuxi Xiao
Jianyuan Wang
Nan Xue
Nikita Karaev
Yuri Makarov
Bingyi Kang
Xing Zhu
Hujun Bao
Yujun Shen
Xiaowei Zhou
3DPCMDE
286
58
0
16 Jul 2025
Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation
Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation
Zhen Xu
Hongyu Zhou
Sida Peng
Haotong Lin
Haoyu Guo
...
Yue Wang
Ruizhen Hu
Yiyi Liao
Xiaowei Zhou
Hujun Bao
VLM
260
3
0
15 Jul 2025
DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation
DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation
Yue-Jiang Dong
Wang Zhao
Jiale Xu
Ying Shan
Song-Hai Zhang
DiffMMDE
384
3
0
02 Jul 2025
123
Next
Page 1 of 3