ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.12288
  4. Cited By
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth

ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth

23 February 2023
S. Bhat
R. Birkl
Diana Wofk
Peter Wonka
Matthias Müller
    VLM
    MDE
ArXivPDFHTML

Papers citing "ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth"

50 / 321 papers shown
Title
Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting
Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting
Paul Engstler
Andrea Vedaldi
Iro Laina
Christian Rupprecht
MDE
32
9
0
30 Apr 2024
HELPER-X: A Unified Instructable Embodied Agent to Tackle Four
  Interactive Vision-Language Domains with Memory-Augmented Language Models
HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language Models
Gabriel H. Sarch
Sahil Somani
Raghav Kapoor
Michael J. Tarr
Katerina Fragkiadaki
LM&Ro
LLMAG
29
3
0
29 Apr 2024
The Third Monocular Depth Estimation Challenge
The Third Monocular Depth Estimation Challenge
Jaime Spencer
Fabio Tosi
Matteo Poggi
Ripudaman Singh Arora
Chris Russell
...
Albert Luginov
Muhammad Shahzad
Seyed Hosseini
Aleksander Trajcevski
James H. Elder
MDE
33
7
0
25 Apr 2024
G3R: Generating Rich and Fine-grained mmWave Radar Data from 2D Videos
  for Generalized Gesture Recognition
G3R: Generating Rich and Fine-grained mmWave Radar Data from 2D Videos for Generalized Gesture Recognition
Kaikai Deng
Dong Zhao
Wenxin Zheng
Yue Ling
Kangwen Yin
Huadong Ma
28
1
0
23 Apr 2024
LTOS: Layout-controllable Text-Object Synthesis via Adaptive
  Cross-attention Fusions
LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions
Xiaoran Zhao
Tianhao Wu
Yu Lai
Zhiliang Tian
Zhen Huang
Yahui Liu
Zejiang He
Dongsheng Li
DiffM
31
1
0
21 Apr 2024
SPIdepth: Strengthened Pose Information for Self-supervised Monocular
  Depth Estimation
SPIdepth: Strengthened Pose Information for Self-supervised Monocular Depth Estimation
M. Lavrenyuk
MDE
24
2
0
18 Apr 2024
Food Portion Estimation via 3D Object Scaling
Food Portion Estimation via 3D Object Scaling
Gautham Vinod
Jiangpeng He
Zeman Shao
F. Zhu
22
5
0
18 Apr 2024
InFusion: Inpainting 3D Gaussians via Learning Depth Completion from
  Diffusion Prior
InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior
Zhiheng Liu
Ouyang Hao
Qiuyu Wang
Ka Leong Cheng
Jie Xiao
Kai Zhu
Nan Xue
Yu Liu
Yujun Shen
Yang Cao
DiffM
3DGS
41
20
0
17 Apr 2024
Predicting Long-horizon Futures by Conditioning on Geometry and Time
Predicting Long-horizon Futures by Conditioning on Geometry and Time
Tarasha Khurana
Deva Ramanan
AI4TS
41
0
0
17 Apr 2024
Taming Latent Diffusion Model for Neural Radiance Field Inpainting
Taming Latent Diffusion Model for Neural Radiance Field Inpainting
C. Lin
Changil Kim
Jia-Bin Huang
Qinbo Li
Chih-Yao Ma
Johannes Kopf
Ming-Hsuan Yang
Hung-Yu Tseng
AI4CE
DiffM
21
10
0
15 Apr 2024
In-Context Translation: Towards Unifying Image Recognition, Processing,
  and Generation
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
Han Xue
Qianru Sun
Li-Na Song
Wenjun Zhang
Zhiwu Huang
MLLM
36
0
0
15 Apr 2024
Probing the 3D Awareness of Visual Foundation Models
Probing the 3D Awareness of Visual Foundation Models
Mohamed El Banani
Amit Raj
Kevis-Kokitsi Maninis
Abhishek Kar
Yuanzhen Li
Michael Rubinstein
Deqing Sun
Leonidas J. Guibas
Justin Johnson
Varun Jampani
35
79
0
12 Apr 2024
Implicit and Explicit Language Guidance for Diffusion-based Visual
  Perception
Implicit and Explicit Language Guidance for Diffusion-based Visual Perception
Hefeng Wang
Jiale Cao
Jin Xie
Aiping Yang
Yanwei Pang
VLM
DiffM
35
2
0
11 Apr 2024
RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
Jaidev Shriram
Alex Trevithick
Lingjie Liu
Ravi Ramamoorthi
DiffM
3DGS
73
55
0
10 Apr 2024
Matching 2D Images in 3D: Metric Relative Pose from Metric
  Correspondences
Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
Axel Barroso-Laguna
Sowmya P. Munukutla
V. Prisacariu
Eric Brachmann
3DV
37
12
0
09 Apr 2024
SpatialTracker: Tracking Any 2D Pixels in 3D Space
SpatialTracker: Tracking Any 2D Pixels in 3D Space
Yuxi Xiao
Qianqian Wang
Shangzhan Zhang
Nan Xue
Sida Peng
Yujun Shen
Xiaowei Zhou
19
53
0
05 Apr 2024
Know Your Neighbors: Improving Single-View Reconstruction via Spatial
  Vision-Language Reasoning
Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning
Rui Li
Tobias Fischer
Mattia Segu
Marc Pollefeys
Luc Van Gool
Federico Tombari
21
8
0
04 Apr 2024
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View
Andreea Dogaru
M. Ozer
Bernhard Egger
3DGS
59
4
0
04 Apr 2024
SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation
SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation
Junyan Ye
Qiyan Luo
Jinhua Yu
Huaping Zhong
Zhimeng Zheng
Conghui He
Weijia Li
32
12
0
03 Apr 2024
TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Autonomous
  Driving
TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Autonomous Driving
Cheng Zhao
Su Sun
Ruoyu Wang
Yuliang Guo
Jun-Jun Wan
Zhou Huang
Xinyu Huang
Yingjie Victor Chen
Liu Ren
3DGS
45
4
0
03 Apr 2024
SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent
  Objects
SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects
Avinash Ummadisingu
Jongkeum Choi
Koki Yamane
Shimpei Masuda
Naoki Fukaya
Kuniyuki Takahashi
55
2
0
28 Mar 2024
UniDepth: Universal Monocular Metric Depth Estimation
UniDepth: Universal Monocular Metric Depth Estimation
Luigi Piccinelli
Yung-Hsu Yang
Christos Sakaridis
Mattia Segu
Siyuan Li
Luc Van Gool
Fisher Yu
VLM
MDE
73
127
0
27 Mar 2024
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth
  Estimation
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation
Suraj Patni
Aradhye Agarwal
Chetan Arora
VLM
DiffM
MDE
27
26
0
27 Mar 2024
Track Everything Everywhere Fast and Robustly
Track Everything Everywhere Fast and Robustly
Yunzhou Song
Jiahui Lei
ZiYun Wang
Lingjie Liu
Kostas Daniilidis
27
5
0
26 Mar 2024
DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing
DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing
Matias Turkulainen
Xuqian Ren
Iaroslav Melekhov
Otto Seiskari
Esa Rahtu
Juho Kannala
3DGS
43
56
0
26 Mar 2024
MMVP: A Multimodal MoCap Dataset with Vision and Pressure Sensors
MMVP: A Multimodal MoCap Dataset with Vision and Pressure Sensors
He Zhang
Shenghao Ren
Haolei Yuan
Jianhui Zhao
Fan Li
Shuangpeng Sun
Zhenghao Liang
Tao Yu
Qiu Shen
Xun Cao
35
4
0
26 Mar 2024
TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos
TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos
Yufu Wang
ZiYun Wang
Lingjie Liu
Kostas Daniilidis
37
25
0
26 Mar 2024
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
Mu Hu
Wei Yin
C. Zhang
Zhipeng Cai
Xiaoxiao Long
Kaixuan Wang
Kaixuan Wang
Gang Yu
Chunhua Shen
Shaojie Shen
3DGS
52
115
0
22 Mar 2024
DepthFM: Fast Monocular Depth Estimation with Flow Matching
DepthFM: Fast Monocular Depth Estimation with Flow Matching
Ming Gui
Johannes S. Fischer
Ulrich Prestel
Pingchuan Ma
Dmytro Kotovenko
Olga Grebenkova
S. A. Baumann
Vincent Tao Hu
Bjorn Ommer
MDE
34
52
0
20 Mar 2024
SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language
  Models through Prompting and Interacting 3D Priors
SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors
Chenyang Ma
Kai Lu
Ta-Ying Cheng
Niki Trigoni
Andrew Markham
LRM
30
7
0
18 Mar 2024
Diffusion Models are Geometry Critics: Single Image 3D Editing Using
  Pre-Trained Diffusion Priors
Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors
Ruicheng Wang
Jianfeng Xiang
Jiaolong Yang
Xin Tong
DiffM
32
4
0
18 Mar 2024
Touch-GS: Visual-Tactile Supervised 3D Gaussian Splatting
Touch-GS: Visual-Tactile Supervised 3D Gaussian Splatting
Aiden Swann
Matthew Strong
Won Kyung Do
Gadiel Sznaier Camps
Mac Schwager
Monroe Kennedy
3DGS
36
9
0
14 Mar 2024
3D-VLA: A 3D Vision-Language-Action Generative World Model
3D-VLA: A 3D Vision-Language-Action Generative World Model
Haoyu Zhen
Xiaowen Qiu
Peihao Chen
Jincheng Yang
Xin Yan
Yilun Du
Yining Hong
Chuang Gan
LM&Ro
VGen
PINN
34
89
0
14 Mar 2024
3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation
3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation
Frank Zhang
Yibo Zhang
Quan Zheng
R. Ma
W. Hua
Hujun Bao
Weiwei Xu
Changqing Zou
49
9
0
14 Mar 2024
LVIC: Multi-modality segmentation by Lifting Visual Info as Cue
LVIC: Multi-modality segmentation by Lifting Visual Info as Cue
Zichao Dong
Bowen Pang
Xufeng Huang
Hang Ji
Xin Zhan
Junbo Chen
3DPC
35
0
0
08 Mar 2024
Scene Depth Estimation from Traditional Oriental Landscape Paintings
Scene Depth Estimation from Traditional Oriental Landscape Paintings
Sungho Kang
Yeonghyeon Park
H. Park
Juneho Yi
30
0
0
06 Mar 2024
OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied
  Instruction Following
OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following
Haochen Shi
Zhiyuan Sun
Xingdi Yuan
Marc-Alexandre Côté
Bang Liu
LLMAG
27
10
0
05 Mar 2024
Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps
Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps
Timothy Chen
O. Shorinwa
Joseph Bruno
Javier Yu
Weijia Zeng
Weijia Zeng
Keiko Nagami
Mac Schwager
Mac Schwager
3DGS
35
31
0
05 Mar 2024
How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey
How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey
Fabio Tosi
Youming Zhang
Ziren Gong
Erik Sandström
S. Mattoccia
Martin R. Oswald
Matteo Poggi
3DGS
56
53
0
20 Feb 2024
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image
  Generation
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
Chong Zeng
Yue Dong
Pieter Peers
Youkang Kong
Hongzhi Wu
Xin Tong
27
27
0
19 Feb 2024
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
Shoubin Yu
Jaehong Yoon
Mohit Bansal
77
4
0
08 Feb 2024
SPAD : Spatially Aware Multiview Diffusers
SPAD : Spatially Aware Multiview Diffusers
Yash Kant
Ziyi Wu
Michael Vasilkovsky
Guocheng Qian
Jian Ren
R. A. Guler
Bernard Ghanem
Sergey Tulyakov
Igor Gilitschenski
Aliaksandr Siarohin
DiffM
22
34
0
07 Feb 2024
MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction
MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction
Heng Zhou
Zhetao Guo
Shuhong Liu
Lechen Zhang
Qihao Wang
Yuxiang Ren
Mingrui Li
MDE
23
13
0
06 Feb 2024
Extreme Two-View Geometry From Object Poses with Diffusion Models
Extreme Two-View Geometry From Object Poses with Diffusion Models
Yujing Sun
Caiyi Sun
Yuan-Bin Liu
Yuexin Ma
S. Yiu
25
2
0
05 Feb 2024
RIDERS: Radar-Infrared Depth Estimation for Robust Sensing
RIDERS: Radar-Infrared Depth Estimation for Robust Sensing
Han Li
Yukai Ma
Yuehao Huang
Yaqing Gu
Weihua Xu
Yong-Jin Liu
Xingxing Zuo
19
4
0
03 Feb 2024
Geometry Transfer for Stylizing Radiance Fields
Geometry Transfer for Stylizing Radiance Fields
Hyunyoung Jung
Seonghyeon Nam
N. Sarafianos
Sungjoo Yoo
Alexander Sorkine-Hornung
Rakesh Ranjan
37
10
0
01 Feb 2024
Template-Free Single-View 3D Human Digitalization with Diffusion-Guided
  LRM
Template-Free Single-View 3D Human Digitalization with Diffusion-Guided LRM
Zhenzhen Weng
Jingyuan Liu
Hao Tan
Zhan Xu
Yang Zhou
Serena Yeung-Levy
Jimei Yang
3DH
30
8
0
22 Jan 2024
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning
  Capabilities
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities
Boyuan Chen
Zhuo Xu
Sean Kirmani
Brian Ichter
Danny Driess
Pete Florence
Dorsa Sadigh
Leonidas J. Guibas
Fei Xia
LRM
ReLM
39
205
0
22 Jan 2024
General Flow as Foundation Affordance for Scalable Robot Learning
General Flow as Foundation Affordance for Scalable Robot Learning
Chengbo Yuan
Chuan Wen
Tong Zhang
Yang Gao
AI4CE
21
31
0
21 Jan 2024
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Lihe Yang
Bingyi Kang
Zilong Huang
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
VLM
139
706
0
19 Jan 2024
Previous
1234567
Next