Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.01341
Cited By
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer
2 July 2019
René Ranftl
Katrin Lasinger
David Hafner
Konrad Schindler
V. Koltun
MDE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer"
50 / 1,053 papers shown
Title
Advances in Radiance Field for Dynamic Scene: From Neural Field to Gaussian Field
Jinlong Fan
Xuepu Zeng
J. Zhang
M. Gong
Yuxiang Yang
Dacheng Tao
3DGS
AI4CE
36
0
0
15 May 2025
Depth Anything with Any Prior
Zehan Wang
Siyu Chen
Lihe Yang
Jialei Wang
Ziang Zhang
Hengshuang Zhao
Zhou Zhao
3DGS
VLM
MDE
37
0
0
15 May 2025
Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis
B. Ke
Kevin Qu
T. Wang
Nando Metzger
Shengyu Huang
Bo Li
Anton Obukhov
Konrad Schindler
DiffM
VLM
22
0
0
14 May 2025
Visually Interpretable Subtask Reasoning for Visual Question Answering
Yu Cheng
A. Goel
Hakan Bilen
LRM
29
0
0
12 May 2025
VGLD: Visually-Guided Linguistic Disambiguation for Monocular Depth Scale Recovery
Bojin Wu
Jing Chen
MDE
46
0
0
05 May 2025
Unaligned RGB Guided Hyperspectral Image Super-Resolution with Spatial-Spectral Concordance
Y. Zhang
Zeqiang Lai
T. Zhang
Ying Fu
Chenghu Zhou
42
1
0
04 May 2025
Learning Multi-frame and Monocular Prior for Estimating Geometry in Dynamic Scenes
S. Park
Jinwoo Shin
34
0
0
03 May 2025
LMDepth: Lightweight Mamba-based Monocular Depth Estimation for Real-World Deployment
Jiahuan Long
Xin Zhou
MDE
51
0
0
02 May 2025
JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers
Kwon Byung-Ki
Qi Dai
Lee Hyoseok
Chong Luo
Tae-Hyun Oh
71
0
0
01 May 2025
V3LMA: Visual 3D-enhanced Language Model for Autonomous Driving
Jannik Lübberstedt
Esteban Rivera
Nico Uhlemann
Markus Lienkamp
MLLM
63
0
0
30 Apr 2025
Leveraging Multi-Modal Saliency and Fusion for Gaze Target Detection
Athul M. Mathew
Arshad Ali Khan
Thariq Khalid
Faroq AL-Tam
R. Souissi
89
1
0
27 Apr 2025
LiDAR-Guided Monocular 3D Object Detection for Long-Range Railway Monitoring
Raul David Dominguez Sanchez
Xavier Jair Diaz Ortiz
Xingcheng Zhou
M. Ronecker
Michael Karner
Daniel Watzenig
Alois C. Knoll
103
0
0
25 Apr 2025
The Fourth Monocular Depth Estimation Challenge
Anton Obukhov
Matteo Poggi
Fabio Tosi
Ripudaman Singh Arora
Jaime Spencer
...
Tuan-Anh Yang
Minh-Quang Nguyen
T. Tran
Albert Luginov
Muhammad Shahzad
MDE
120
0
0
24 Apr 2025
Dual-Camera All-in-Focus Neural Radiance Fields
Xianrui Luo
Zijin Wu
Juewen Peng
Huiqiang Sun
Zhiguo Cao
Guosheng Lin
VGen
27
0
0
23 Apr 2025
MonoTher-Depth: Enhancing Thermal Depth Estimation via Confidence-Aware Distillation
Xingxing Zuo
Nikhil Ranganathan
Connor T. Lee
Georgia Gkioxari
Soon-Jo Chung
VLM
58
1
0
21 Apr 2025
VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation
Mingxia Zhan
Li Zhang
Xiaomeng Chu
Beibei Wang
MDE
57
0
0
21 Apr 2025
Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization
Hongbin Xu
C. Yu
Feng Xiao
Jiazheng Xing
Hai Ci
Weitao Chen
Ming Li
DiffM
26
0
0
21 Apr 2025
Landmark-Free Preoperative-to-Intraoperative Registration in Laparoscopic Liver Resection
Jun Zhou
Bingchen Gao
Kai Wang
Jialun Pei
Pheng-Ann Heng
Jing Qin
MedIm
32
0
0
21 Apr 2025
Seurat: From Moving Points to Depth
Seokju Cho
Jiahui Huang
S. Kim
Joon-Young Lee
3DPC
MDE
34
0
0
20 Apr 2025
PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling
Alara Dirik
Tuanfeng Y. Wang
Duygu Ceylan
Stefanos Zafeiriou
Anna Frühstück
DiffM
47
0
0
19 Apr 2025
Occlusion-Ordered Semantic Instance Segmentation
Soroosh Baselizadeh
Cheuk-To Yu
O. Veksler
Yuri Boykov
ISeg
3DV
58
0
0
18 Apr 2025
Image Editing with Diffusion Models: A Survey
Jia Wang
Jie Hu
Xiaoqi Ma
Hanghang Ma
Xiaoming Wei
Enhua Wu
66
0
0
17 Apr 2025
Boosting Multi-View Stereo with Depth Foundation Model in the Absence of Real-World Labels
Jie Zhu
Bo Peng
Zhe Zhang
Bingzheng Liu
Jianjun Lei
33
0
0
16 Apr 2025
Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image
Tao Wen
J. Wang
Y. Chen
Shugong Xu
Chi Zhang
Xuelong Li
MDE
31
0
0
16 Apr 2025
TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion
Y. Wang
J. Li
Chaoyi Hong
Ruibo Li
Liusheng Sun
Xiao-yang Song
Zhe Wang
Zhiguo Cao
Guosheng Lin
MDE
29
0
0
16 Apr 2025
Real-World Depth Recovery via Structure Uncertainty Modeling and Inaccurate GT Depth Fitting
Delong Suzhang
Meng Yang
26
0
0
16 Apr 2025
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang
Xin Xu
Yu-Xiong Wang
DiffM
65
0
0
15 Apr 2025
DeepWheel: Generating a 3D Synthetic Wheel Dataset for Design and Performance Evaluation
Soyoung Yoo
Namwoo Kang
32
0
0
15 Apr 2025
GATE3D: Generalized Attention-based Task-synergized Estimation in 3D*
Eunsoo Im
Jung Kwon Lee
Changhyun Jee
36
0
0
15 Apr 2025
Art3D: Training-Free 3D Generation from Flat-Colored Illustration
Xiaoyan Cong
Jiayi Shen
Zekun Li
Rao Fu
Tao Lu
Srinath Sridhar
3DH
35
0
0
14 Apr 2025
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
Zeren Jiang
Chuanxia Zheng
Iro Laina
Diane Larlus
Andrea Vedaldi
VGen
43
0
0
10 Apr 2025
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution
Gene Chou
Wenqi Xian
Guandao Yang
Mohamed Abdelfattah
Bharath Hariharan
Noah Snavely
Ning Yu
P. Debevec
MDE
29
0
0
09 Apr 2025
PromptHMR: Promptable Human Mesh Recovery
Yufu Wang
Yu Sun
Priyanka Patel
Kostas Daniilidis
Michael J. Black
Muhammed Kocabas
3DH
52
0
0
08 Apr 2025
A High-Force Gripper with Embedded Multimodal Sensing for Powerful and Perception Driven Grasping
Edoardo Del Bianco
Davide Torielli
Federico Rollo
Damiano Gasperini
Arturo Laurenzi
Lorenzo Baccelliere
L. Muratore
Marco Roveri
Nikos Tsagarakis
28
1
0
07 Apr 2025
Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision
Yuandong Pu
Le Zhuo
Kaiwen Zhu
Liangbin Xie
Wenlong Zhang
Xiangyu Chen
Peng Gao
Yu Qiao
Chao Dong
Yihao Liu
MLLM
63
1
0
07 Apr 2025
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation
Xin Zhang
Robby T. Tan
Mamba
48
0
0
04 Apr 2025
All-day Depth Completion via Thermal-LiDAR Fusion
Janghyun Kim
Minseong Kweon
Jinsun Park
Ukcheol Shin
VLM
46
0
0
03 Apr 2025
Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization
J. Wang
Jingyuan Liu
Xin Sun
Krishna Kumar Singh
Zhixin Shu
...
Nanxuan Zhao
Tuanfeng Y. Wang
Simon Chen
Ulrich Neumann
Jae Shin Yoon
29
0
0
03 Apr 2025
DEPTHOR: Depth Enhancement from a Practical Light-Weight dToF Sensor and RGB Image
Jijun Xiang
Xuan Zhu
Xianqi Wang
Y. Wang
H. Zhang
Fei Guo
Xin-She Yang
36
0
0
02 Apr 2025
WorldScore: A Unified Evaluation Benchmark for World Generation
Haoyi Duan
Hong-Xing Yu
Sirui Chen
L. Fei-Fei
Jiajun Wu
VGen
65
1
0
01 Apr 2025
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Tian-Xing Xu
Xiangjun Gao
Wenbo Hu
Xiaoyu Li
Song-Hai Zhang
Ying Shan
VGen
MDE
60
1
0
01 Apr 2025
Beyond Static Scenes: Camera-controllable Background Generation for Human Motion
Mingshuai Yao
Mengting Chen
Qinye Zhou
Y. Zhang
Ming-Yu Liu
...
Chen Ju
Shuai Xiao
Qingwen Liu
Jinsong Lan
Wangmeng Zuo
DiffM
VGen
36
1
0
01 Apr 2025
Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views
Chong Bao
Xiyu Zhang
Zehao Yu
Jiale Shi
Guofeng Zhang
Songyou Peng
Zhaopeng Cui
3DGS
3DV
36
0
0
31 Mar 2025
Distance Estimation to Support Assistive Drones for the Visually Impaired using Robust Calibration
Suman Raj
Bhavani A Madhabhavi
Madhav Kumar
Prabhav Gupta
Yogesh Simmhan
43
1
0
31 Mar 2025
Easi3R: Estimating Disentangled Motion from DUSt3R Without Training
Xingyu Chen
Yue Chen
Yuliang Xiu
Andreas Geiger
Anpei Chen
3DPC
VGen
40
1
0
31 Mar 2025
Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model
Jannik Endres
Oliver Hahn
Charles Corbière
Simone Schaub-Meyer
Stefan Roth
Alexandre Alahi
MDE
37
0
0
30 Mar 2025
Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion
S. Yu
Yuxin Chen
Zhongang Qi
Zeke Xie
Yifan Wang
Lijun Wang
Ying Shan
Huchuan Lu
41
0
0
28 Mar 2025
MVSAnywhere: Zero-Shot Multi-View Stereo
Sergio Izquierdo
Mohamed Sayed
Michael Firman
Guillermo Garcia-Hernando
Daniyar Turmukhambetov
Javier Civera
Oisin Mac Aodha
Gabriel J. Brostow
Jamie Watson
3DV
39
3
0
28 Mar 2025
One Look is Enough: A Novel Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation Models on High-Resolution Images
Byeongjun Kwon
Munchurl Kim
VLM
MDE
57
0
0
28 Mar 2025
Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges
Ukcheol Shin
Jinsun Park
3DV
MDE
39
0
0
28 Mar 2025
1
2
3
4
...
20
21
22
Next