ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.12288
  4. Cited By
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth

ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth

23 February 2023
S. Bhat
R. Birkl
Diana Wofk
Peter Wonka
Matthias Müller
    VLM
    MDE
ArXivPDFHTML

Papers citing "ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth"

50 / 321 papers shown
Title
Language-Depth Navigated Thermal and Visible Image Fusion
Language-Depth Navigated Thermal and Visible Image Fusion
Jinchang Zhang
Zijun Li
Guoyu Lu
MDE
61
1
0
11 Mar 2025
VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation
VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation
Hanzhi Chen
Boyang Sun
Anran Zhang
Marc Pollefeys
Stefan Leutenegger
LM&Ro
65
0
0
10 Mar 2025
LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation
Quanjian Song
Zhihang Lin
Zhanpeng Zeng
Ziyue Zhang
Liujuan Cao
Rongrong Ji
VGen
61
0
0
09 Mar 2025
Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity
Xiaohao Xu
Feng Xue
X. Li
Haowei Li
S. M. I. Simon X. Yang
T. Zhang
Matthew Johnson-Roberson
Xiaonan Huang
3DV
41
0
0
08 Mar 2025
NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics
Kun Yang
Yuxiang Liu
Zeyu Cui
Yu Liu
Maojun Zhang
Shen Yan
Qing Wang
3DGS
67
0
0
05 Mar 2025
Back to the Future Cyclopean Stereo: a human perception approach combining deep and geometric constraints
Back to the Future Cyclopean Stereo: a human perception approach combining deep and geometric constraints
Sherlon Almeida da Silva
Davi Geiger
Luiz Velho
Moacir Antonelli Ponti
38
0
0
28 Feb 2025
BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground
BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground
Yufei Wei
Sha Lu
Wangtao Lu
R. Xiong
Y. Wang
40
0
0
27 Feb 2025
UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler
UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler
Luigi Piccinelli
Christos Sakaridis
Y. Yang
Mattia Segu
Siyuan Li
Wim Abbeloos
Luc Van Gool
MDE
41
6
0
27 Feb 2025
View-Invariant Policy Learning via Zero-Shot Novel View Synthesis
View-Invariant Policy Learning via Zero-Shot Novel View Synthesis
Stephen Tian
Blake Wulfe
Kyle Sargent
Katherine Liu
Sergey Zakharov
Vitor Campagnolo Guizilini
Jiajun Wu
73
10
0
21 Feb 2025
CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image
CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image
Kaixin Yao
Longwen Zhang
Xinhao Yan
Yan Zeng
Qixuan Zhang
Wei Yang
Lan Xu
Jiayuan Gu
Jingyi Yu
24
2
0
18 Feb 2025
L4P: Low-Level 4D Vision Perception Unified
L4P: Low-Level 4D Vision Perception Unified
Abhishek Badki
Hang Su
Bowen Wen
Orazio Gallo
VLM
78
1
0
18 Feb 2025
CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape Recovery
CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape Recovery
Chenghao Zhang
Lubin Fan
Shen Cao
Bojian Wu
Jieping Ye
76
0
0
13 Feb 2025
Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach
Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach
Yunuo Chen
Junli Cao
Anil Kag
Vidit Goel
Sergei Korolev
Chenfanfu Jiang
Sergey Tulyakov
Jian Ren
DiffM
VGen
86
1
0
05 Feb 2025
Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding
Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding
Jingming Xia
Guanqun Cao
Guang Ma
Yiben Luo
Qinzhao Li
John Oyekan
MDE
54
0
0
01 Feb 2025
Rethinking Encoder-Decoder Flow Through Shared Structures
Rethinking Encoder-Decoder Flow Through Shared Structures
Frederik Laboyrie
M. K. Yucel
Albert Saà-Garriga
AI4CE
40
0
0
24 Jan 2025
Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks
Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks
Alessio Quercia
Erenus Yildiz
Zhuo Cao
Kai Krajsek
Abigail Morrison
Ira Assent
Hanno Scharr
51
0
0
22 Jan 2025
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
Sili Chen
Hengkai Guo
Shengnan Zhu
Feihu Zhang
Zilong Huang
Jiashi Feng
Bingyi Kang
VLM
AuLLM
MDE
61
11
0
21 Jan 2025
Survey on Monocular Metric Depth Estimation
Survey on Monocular Metric Depth Estimation
Jiuling Zhang
VLM
69
0
0
21 Jan 2025
Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection
Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection
Yifang Xu
Yunzhuo Sun
Benxiang Zhai
Zien Xie
Youyao Jia
S. Du
37
2
0
18 Jan 2025
Joint Learning of Depth and Appearance for Portrait Image Animation
Joint Learning of Depth and Appearance for Portrait Image Animation
Xinya Ji
Gaspard Zoss
Prashanth Chandran
Lingchen Yang
Xun Cao
B. Solenthaler
D. Bradley
3DH
MDE
42
0
0
15 Jan 2025
HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos
Jinglei Zhang
Jiankang Deng
Chao Ma
Rolandos Alexandros Potamias
35
3
0
06 Jan 2025
Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis
Thang-Anh-Quan Nguyen
Nathan Piasco
Luis Roldão
Moussâb Bennehar
D. Tsishkou
Laurent Caraffa
J. Tarel
R. Brémond
DiffM
47
1
0
06 Jan 2025
GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking
Weikang Bian
Zhaoyang Huang
Xiaoyu Shi
Yijin Li
Fu-Yun Wang
Hongsheng Li
3DGS
VGen
DiffM
34
3
0
05 Jan 2025
GeoDiffuser: Geometry-Based Image Editing with Diffusion Models
GeoDiffuser: Geometry-Based Image Editing with Diffusion Models
Rahul Sajnani
Jeroen Vanbaar
Jie Min
Kapil D. Katyal
Srinath Sridhar
DiffM
49
11
0
03 Jan 2025
PatchRefiner V2: Fast and Lightweight Real-Domain High-Resolution Metric Depth Estimation
Zhenyu Li
Wenqing Cui
S. Bhat
Peter Wonka
MDE
36
0
0
03 Jan 2025
TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions
Vriksha Srihari
R. Bhavya
Shruti Jayaraman
V. Mary Anita Rajam
DiffM
VGen
28
0
0
02 Jan 2025
Multi-Modality Driven LoRA for Adverse Condition Depth Estimation
Multi-Modality Driven LoRA for Adverse Condition Depth Estimation
Guanglei Yang
Rui Tian
Yongqiang Zhang
Zhun Zhong
Yongqiang Li
Wangmeng Zuo
31
0
0
31 Dec 2024
Scaling 4D Representations
Scaling 4D Representations
João Carreira
Dilara Gokay
Michael King
Chuhan Zhang
Ignacio Rocco
...
Viorica Patraucean
Dima Damen
Pauline Luc
Mehdi S. M. Sajjadi
Andrew Zisserman
77
3
0
19 Dec 2024
V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D
  Annotations
V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations
Jin-Cheng Jhang
Tao Tu
Fu-En Wang
Ke Zhang
Min Sun
Cheng-Hao Kuo
71
2
0
16 Dec 2024
RoMeO: Robust Metric Visual Odometry
RoMeO: Robust Metric Visual Odometry
JunDa Cheng
Z. Cai
Zhaoxing Zhang
Wei Yin
Matthias Müller
Michael Paulitsch
Xin Yang
91
0
0
16 Dec 2024
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
Baorui Ma
Huachen Gao
Haoge Deng
Zhengxiong Luo
Tiejun Huang
Lulu Tang
Xinlong Wang
DiffM
VGen
114
14
0
09 Dec 2024
Pinco: Position-induced Consistent Adapter for Diffusion Transformer in
  Foreground-conditioned Inpainting
Pinco: Position-induced Consistent Adapter for Diffusion Transformer in Foreground-conditioned Inpainting
Guangben Lu
Yuzhen Du
Zhimin Sun
Ran Yi
Yifan Qi
Yizhe Tang
Tianyi Wang
Lizhuang Ma
Fangyuan Zou
DiffM
75
1
0
05 Dec 2024
Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
Jiahao Lu
Tianyu Huang
Peng Li
Zhiyang Dou
Cheng Lin
Zhiming Cui
Z. Dong
Sai-Kit Yeung
Wenping Wang
Yuan-Bin Liu
VGen
MDE
98
7
0
04 Dec 2024
AVS-Net: Audio-Visual Scale Net for Self-supervised Monocular Metric
  Depth Estimation
AVS-Net: Audio-Visual Scale Net for Self-supervised Monocular Metric Depth Estimation
Xiaohu Liu
Sascha Hornauer
Fabien Moutarde
Jialiang Lu
SSL
MDE
56
0
0
02 Dec 2024
SfM-Free 3D Gaussian Splatting via Hierarchical Training
SfM-Free 3D Gaussian Splatting via Hierarchical Training
Bo Ji
Angela Yao
3DGS
73
1
0
02 Dec 2024
OMNI-DC: Highly Robust Depth Completion with Multiresolution Depth
  Integration
OMNI-DC: Highly Robust Depth Completion with Multiresolution Depth Integration
Yiming Zuo
Willow Yang
Zeyu Ma
Jia Deng
MDE
85
2
0
28 Nov 2024
SharpDepth: Sharpening Metric Depth Predictions Using Diffusion
  Distillation
SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation
Duc-Hai Pham
Tung Do
P. Nguyen
Binh-Son Hua
K. Nguyen
Rang Nguyen
MDE
78
1
0
27 Nov 2024
Monocular Obstacle Avoidance Based on Inverse PPO for Fixed-wing UAVs
Monocular Obstacle Avoidance Based on Inverse PPO for Fixed-wing UAVs
Haochen Chai
Meimei Su
Yang Lyu
Zhunga Liu
Chunhui Zhao
Quan Pan
71
0
0
27 Nov 2024
Boost 3D Reconstruction using Diffusion-based Monocular Camera
  Calibration
Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Junyuan Deng
Wei Yin
Xiaoyang Guo
Qian Zhang
Xiaotao Hu
Weiqiang Ren
Xiaoxiao Long
P. Tan
DiffM
MDE
87
1
0
26 Nov 2024
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
Zhongyu Xia
Jishuo Li
Zhiwei Lin
Xinhao Wang
Y. Wang
Ming-Hsuan Yang
VLM
66
2
0
26 Nov 2024
MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model
MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model
Chenjie Cao
Chaohui Yu
Shang Liu
Fan Wang
Xiangyang Xue
Yanwei Fu
87
1
0
25 Nov 2024
Generalizable Single-view Object Pose Estimation by Two-side Generating
  and Matching
Generalizable Single-view Object Pose Estimation by Two-side Generating and Matching
Yujing Sun
Caiyi Sun
Yuan-Bin Liu
Yuexin Ma
S. Yiu
75
1
0
24 Nov 2024
Novel View Extrapolation with Video Diffusion Priors
Novel View Extrapolation with Video Diffusion Priors
Kunhao Liu
Ling Shao
Shijian Lu
VGen
75
3
0
21 Nov 2024
GalaxyEdit: Large-Scale Image Editing Dataset with Enhanced Diffusion
  Adapter
GalaxyEdit: Large-Scale Image Editing Dataset with Enhanced Diffusion Adapter
Aniruddha Bala
Rohan Jaiswal
Loay Rashid
Siddharth Roheda
72
0
0
21 Nov 2024
SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input
SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input
Zhen Lv
Yangqi Long
Congzhentao Huang
Cao Li
Chengfei Lv
Hao Ren
Dian Zheng
DiffM
VGen
MDE
112
5
0
18 Nov 2024
BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV
  Representation
BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation
Yufei Wei
Sha Lu
Fuzhang Han
R. Xiong
Yue Wang
26
1
0
15 Nov 2024
MFP3D: Monocular Food Portion Estimation Leveraging 3D Point Clouds
MFP3D: Monocular Food Portion Estimation Leveraging 3D Point Clouds
Jinge Ma
Xiaoyan Zhang
Gautham Vinod
S. Raghavan
Jiangpeng He
F. Zhu
37
1
0
14 Nov 2024
Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical
  2D Inpainting
Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting
Yian Wang
Xiaowen Qiu
Jiageng Liu
Zhehuan Chen
Jiting Cai
Yufei Wang
Tsun-Hsuan Wang
Zhou Xian
Chuang Gan
VGen
AI4CE
43
6
0
14 Nov 2024
ReCapture: Generative Video Camera Controls for User-Provided Videos
  using Masked Video Fine-Tuning
ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning
David Junhao Zhang
Roni Paiss
Shiran Zada
Nikhil Karnad
David E. Jacobs
Yael Pritch
Inbar Mosseri
Mike Zheng Shou
Neal Wadhwa
Nataniel Ruiz
DiffM
VGen
69
15
0
07 Nov 2024
Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain
  Adaptation
Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain Adaptation
Qingyao Tian
Huai Liao
Xinyan Huang
Lujie Li
Hongbin Liu
MDE
29
0
0
07 Nov 2024
Previous
1234567
Next