ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.02145
  4. Cited By
Repurposing Diffusion-Based Image Generators for Monocular Depth
  Estimation

Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

4 December 2023
B. Ke
Anton Obukhov
Shengyu Huang
Nando Metzger
Rodrigo Caye Daudt
Konrad Schindler
    VLM
    MDE
ArXivPDFHTML

Papers citing "Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation"

41 / 41 papers shown
Title
VIN-NBV: A View Introspection Network for Next-Best-View Selection for Resource-Efficient 3D Reconstruction
VIN-NBV: A View Introspection Network for Next-Best-View Selection for Resource-Efficient 3D Reconstruction
Noah Frahm
Dongxu Zhao
Andrea Dunn Beltran
Ron Alterovitz
Jan-Michael Frahm
Junier Oliva
Roni Sengupta
63
0
0
09 May 2025
VGLD: Visually-Guided Linguistic Disambiguation for Monocular Depth Scale Recovery
VGLD: Visually-Guided Linguistic Disambiguation for Monocular Depth Scale Recovery
Bojin Wu
Jing Chen
MDE
42
0
0
05 May 2025
Fast Flow-based Visuomotor Policies via Conditional Optimal Transport Couplings
Fast Flow-based Visuomotor Policies via Conditional Optimal Transport Couplings
Andreas Sochopoulos
Nikolay Malkin
Nikolaos Tsagkas
João Moura
Michael Gienger
S. Vijayakumar
37
1
0
02 May 2025
LiDAR-Guided Monocular 3D Object Detection for Long-Range Railway Monitoring
LiDAR-Guided Monocular 3D Object Detection for Long-Range Railway Monitoring
Raul David Dominguez Sanchez
Xavier Jair Diaz Ortiz
Xingcheng Zhou
M. Ronecker
Michael Karner
Daniel Watzenig
Alois C. Knoll
73
0
0
25 Apr 2025
MonoTher-Depth: Enhancing Thermal Depth Estimation via Confidence-Aware Distillation
MonoTher-Depth: Enhancing Thermal Depth Estimation via Confidence-Aware Distillation
Xingxing Zuo
Nikhil Ranganathan
Connor T. Lee
Georgia Gkioxari
Soon-Jo Chung
VLM
51
1
0
21 Apr 2025
VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation
VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation
Mingxia Zhan
Li Zhang
Xiaomeng Chu
Beibei Wang
MDE
57
0
0
21 Apr 2025
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang
Xin Xu
Yu-Xiong Wang
DiffM
60
0
0
15 Apr 2025
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation
Jiantao Lin
Xin Yang
Meixi Chen
Yingjie Xu
D. Yan
Leyi Wu
Xinli Xu
Lie Xu
Shunsi Zhang
Ying-Cong Chen
55
1
0
03 Mar 2025
Revisiting Gradient-based Uncertainty for Monocular Depth Estimation
Julia Hornauer
Amir El-Ghoussani
Vasileios Belagiannis
UQCV
50
0
0
09 Feb 2025
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation
Chenguo Lin
Panwang Pan
Bangbang Yang
Zeming Li
Yadong Mu
3DGS
71
7
0
28 Jan 2025
Rethinking Encoder-Decoder Flow Through Shared Structures
Rethinking Encoder-Decoder Flow Through Shared Structures
Frederik Laboyrie
M. K. Yucel
Albert Saà-Garriga
AI4CE
40
0
0
24 Jan 2025
CheapNVS: Real-Time On-Device Narrow-Baseline Novel View Synthesis
CheapNVS: Real-Time On-Device Narrow-Baseline Novel View Synthesis
K. Georgiadis
M. K. Yucel
Albert Saà-Garriga
ViT
50
1
0
24 Jan 2025
Survey on Monocular Metric Depth Estimation
Survey on Monocular Metric Depth Estimation
Jiuling Zhang
VLM
69
0
0
21 Jan 2025
DPBridge: Latent Diffusion Bridge for Dense Prediction
DPBridge: Latent Diffusion Bridge for Dense Prediction
Haorui Ji
Taojun Lin
Hongdong Li
DiffM
46
1
0
29 Dec 2024
IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
Zhibing Li
Tong Wu
Jing Tan
Mengchen Zhang
Jiaqi Wang
D. Lin
97
1
0
16 Dec 2024
DepthSplat: Connecting Gaussian Splatting and Depth
DepthSplat: Connecting Gaussian Splatting and Depth
Haofei Xu
Songyou Peng
Fangjinhua Wang
Hermann Blum
Dániel Baráth
Andreas Geiger
Marc Pollefeys
3DGS
MDE
50
29
0
17 Oct 2024
A Simple Approach to Unifying Diffusion-based Conditional Generation
A Simple Approach to Unifying Diffusion-based Conditional Generation
Xirui Li
Charles Herrmann
Kelvin C.K. Chan
Yinxiao Li
Deqing Sun
Chao Ma
Ming Yang
DiffM
VLM
38
1
0
15 Oct 2024
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Ziyue Li
Tianyi Zhou
MoE
66
16
0
14 Oct 2024
High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity
High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity
Qian Yu
Peng-Tao Jiang
Hao Zhang
Jinwei Chen
Bo Li
Lihe Zhang
Huchuan Lu
44
2
0
14 Oct 2024
IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera
IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera
Jian Huang
Chengrui Dong
Peidong Liu
Peidong Liu
3DGS
29
2
0
10 Oct 2024
Diffusion Models in 3D Vision: A Survey
Diffusion Models in 3D Vision: A Survey
Zhen Wang
Dongyuan Li
Renhe Jiang
Tianyu He
Jiang Bian
Renhe Jiang
MedIm
58
4
0
07 Oct 2024
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion
Junyi Zhang
Charles Herrmann
Junhwa Hur
Varun Jampani
Trevor Darrell
Forrester Cole
Deqing Sun
Ming Yang
VGen
81
69
0
04 Oct 2024
RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through
  Language Descriptions
RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Ziyao Zeng
Yangchao Wu
Hyoungseob Park
Daniel Wang
Fengyu Yang
Stefano Soatto
Dong Lao
Byung-Woo Hong
Alex Wong
MDE
16
7
0
03 Oct 2024
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He
Haodong Li
Wei Yin
Yixun Liang
Leheng Li
Kaiqiang Zhou
Hongbo Zhang
Bingbing Liu
Ying-Cong Chen
DiffM
VLM
44
38
0
26 Sep 2024
FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera
FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera
Guoyang Zhao
Yuxuan Liu
Weiqing Qi
Fulong Ma
Ming Liu
Jun Ma
MDE
29
0
0
23 Sep 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
50
10
0
23 Sep 2024
SteeredMarigold: Steering Diffusion Towards Depth Completion of Largely Incomplete Depth Maps
SteeredMarigold: Steering Diffusion Towards Depth Completion of Largely Incomplete Depth Maps
Jakub Gregorek
Lazaros Nalpantidis
3DGS
35
2
0
16 Sep 2024
EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation
EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation
Nischal Khanal
Shivanand Venkanna Sheshappanavar
MDE
29
0
0
10 Sep 2024
LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model
  Priors
LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors
Hanyang Yu
Xiaoxiao Long
Ping Tan
3DGS
31
4
0
05 Sep 2024
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Dongyang Liu
Shitian Zhao
Le Zhuo
Weifeng Lin
Yu Qiao
Xinyue Li
Qi Qin
Yu Qiao
Hongsheng Li
Peng Gao
MLLM
62
48
0
05 Aug 2024
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
Chang-Han Yeh
Chin-Yang Lin
Zhixiang Wang
Chi-Wei Hsiao
Ting-Hsuan Chen
Hau-Shiang Shiu
Yu-Lun Liu
VGen
DiffM
54
5
0
01 Jul 2024
DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image
DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image
Qingxuan Wu
Zhiyang Dou
Sirui Xu
Soshi Shimada
Chen Wang
...
Taku Komura
Vladislav Golyanik
Christian Theobalt
Wenping Wang
Lingjie Liu
CVBM
3DH
66
4
0
26 Jun 2024
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
Mu Hu
Wei Yin
C. Zhang
Zhipeng Cai
Xiaoxiao Long
Kaixuan Wang
Kaixuan Wang
Gang Yu
Chunhua Shen
Shaojie Shen
3DGS
52
114
0
22 Mar 2024
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image
Wei Yin
Chi Zhang
Hao Chen
Zhipeng Cai
Gang Yu
Kaixuan Wang
Xiaozhi Chen
Chunhua Shen
MDE
129
169
0
20 Jul 2023
Neural LiDAR Fields for Novel View Synthesis
Neural LiDAR Fields for Novel View Synthesis
S. Huang
Zan Gojcic
Zian Wang
Francis Williams
Yoni Kasten
Sanja Fidler
Konrad Schindler
Or Litany
70
45
0
02 May 2023
Visual-Language Prompt Tuning with Knowledge-guided Context Optimization
Visual-Language Prompt Tuning with Knowledge-guided Context Optimization
Hantao Yao
Rui Zhang
Changsheng Xu
VLM
VPVLM
122
193
0
23 Mar 2023
DiffusionDepth: Diffusion Denoising Approach for Monocular Depth
  Estimation
DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation
Yiqun Duan
Xianda Guo
Zhengbiao Zhu
DiffM
MDE
44
66
0
09 Mar 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao
Yongming Rao
Zuyan Liu
Benlin Liu
Jie Zhou
Jiwen Lu
ObjD
VLM
MDE
158
213
0
03 Mar 2023
Palette: Image-to-Image Diffusion Models
Palette: Image-to-Image Diffusion Models
Chitwan Saharia
William Chan
Huiwen Chang
Chris A. Lee
Jonathan Ho
Tim Salimans
David J. Fleet
Mohammad Norouzi
DiffM
VLM
325
1,570
0
10 Nov 2021
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language
  Modeling
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling
Renrui Zhang
Rongyao Fang
Wei Zhang
Peng Gao
Kunchang Li
Jifeng Dai
Yu Qiao
Hongsheng Li
VLM
184
384
0
06 Nov 2021
Deep Ordinal Regression Network for Monocular Depth Estimation
Deep Ordinal Regression Network for Monocular Depth Estimation
Huan Fu
Mingming Gong
Chaohui Wang
Kayhan Batmanghelich
Dacheng Tao
MDE
180
1,687
0
06 Jun 2018
1