Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.18124
Cited By
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
26 September 2024
Jing He
Haodong Li
Wei Yin
Yixun Liang
Leheng Li
Kaiqiang Zhou
Hongbo Zhang
Bingbing Liu
Ying-Cong Chen
DiffM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction"
26 / 26 papers shown
Title
The Fourth Monocular Depth Estimation Challenge
Anton Obukhov
Matteo Poggi
Fabio Tosi
Ripudaman Singh Arora
Jaime Spencer
...
Tuan-Anh Yang
Minh-Quang Nguyen
T. Tran
Albert Luginov
Muhammad Shahzad
MDE
40
0
0
24 Apr 2025
DiMeR: Disentangled Mesh Reconstruction Model
Lutao Jiang
Jiantao Lin
Kanghao Chen
Wenhang Ge
Xin Yang
Yifan Jiang
Y. Lyu
Xu Zheng
Yingcong Chen
3DV
64
0
0
24 Apr 2025
ePBR: Extended PBR Materials in Image Synthesis
Yu Guo
Zhiqiang Lao
Xiyun Song
Yubin Zhou
Zongfang Lin
Heather Yu
24
0
0
23 Apr 2025
NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors
Yanrui Bin
Wenbo Hu
Haoyuan Wang
Xinya Chen
Bing Wang
DiffM
45
0
0
15 Apr 2025
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution
Gene Chou
Wenqi Xian
Guandao Yang
Mohamed Abdelfattah
Bharath Hariharan
Noah Snavely
Ning Yu
P. Debevec
MDE
27
0
0
09 Apr 2025
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Tian-Xing Xu
Xiangjun Gao
Wenbo Hu
Xiaoyu Li
Song-Hai Zhang
Ying Shan
VGen
MDE
56
1
0
01 Apr 2025
MVSAnywhere: Zero-Shot Multi-View Stereo
Sergio Izquierdo
Mohamed Sayed
Michael Firman
Guillermo Garcia-Hernando
Daniyar Turmukhambetov
Javier Civera
Oisin Mac Aodha
Gabriel J. Brostow
Jamie Watson
3DV
39
3
0
28 Mar 2025
MMGen: Unified Multi-modal Image Generation and Understanding in One Go
Jiepeng Wang
Zhaoqing Wang
H. Pan
Yuan Liu
Dongdong Yu
Changhu Wang
Wenping Wang
DiffM
76
0
0
26 Mar 2025
Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation
Jiyuan Wang
Chunyu Lin
Cheng Guan
Lang Nie
Jing He
Haodong Li
K. Liao
Yao Zhao
DiffM
MDE
61
0
0
20 Mar 2025
Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception
Dingkang Liang
Dingyuan Zhang
Xin Zhou
Sifan Tu
Tianrui Feng
Xiaofan Li
Yumeng Zhang
Mingyang Du
Xiao Tan
Xiang Bai
65
2
0
17 Mar 2025
ConsisLoRA: Enhancing Content and Style Consistency for LoRA-based Style Transfer
Bolin Chen
Baoquan Zhao
H. Xie
Yi Cai
Qing Li
Xudong Mao
DiffM
51
0
0
13 Mar 2025
VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion
Lehan Yang
Jincen Song
Tianlong Wang
Daiqing Qi
Weili Shi
Yuheng Liu
Sheng Li
DiffM
VOS
VGen
69
0
0
11 Mar 2025
LBM: Latent Bridge Matching for Fast Image-to-Image Translation
Clement Chadebec
O. Tasar
Sanjeev Sreetharan
Benjamin Aubin
37
0
0
10 Mar 2025
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation
Jiantao Lin
Xin Yang
Meixi Chen
Yingjie Xu
D. Yan
Leyi Wu
Xinli Xu
Lie Xu
Shunsi Zhang
Ying-Cong Chen
55
1
0
03 Mar 2025
MatSwap: Light-aware material transfers in images
Ivan Lopes
Valentin Deschaintre
Yannick Hold-Geoffroy
Raoul de Charette
DiffM
84
0
0
11 Feb 2025
Shape from Semantics: 3D Shape Generation from Multi-View Semantics
Liangchen Li
Caoliwen Wang
Yuqi Zhou
Bailin Deng
Juyong Zhang
3DV
37
0
0
01 Feb 2025
DINO-Foresight
\texttt{DINO-Foresight}
DINO-Foresight
: Looking into the Future with DINO
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
AI4CE
79
1
0
16 Dec 2024
Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors
Alex Rich
Noah Stier
P. Sen
Tobias Höllerer
MDE
72
0
0
08 Dec 2024
Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
Jiahao Lu
Tianyu Huang
Peng Li
Zhiyang Dou
Cheng Lin
Zhiming Cui
Z. Dong
Sai-Kit Yeung
Wenping Wang
Yuan-Bin Liu
VGen
MDE
95
7
0
04 Dec 2024
FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation
Yunpeng Bai
Qixing Huang
DiffM
88
0
0
01 Dec 2024
SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation
Duc-Hai Pham
Tung Do
P. Nguyen
Binh-Son Hua
K. Nguyen
Rang Nguyen
MDE
74
1
0
27 Nov 2024
PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
Ziyao Zeng
Jingcheng Ni
Daniel Wang
Patrick Rim
Younjoon Chung
Fengyu Yang
Byung-Woo Hong
A. Wong
DiffM
MDE
91
2
0
24 Nov 2024
C-DiffSET: Leveraging Latent Diffusion for SAR-to-EO Image Translation with Confidence-Guided Reliable Object Generation
Jeonghyeok Do
Jaehyup Lee
Munchurl Kim
DiffM
35
1
0
16 Nov 2024
TDSM: Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action Recognition
Jeonghyeok Do
Munchurl Kim
42
1
0
16 Nov 2024
LucidFusion: Reconstructing 3D Gaussians with Arbitrary Unposed Images
Hao He
Yixun Liang
Luozhou Wang
Yuanhao Cai
Xinli Xu
Hao-Xiang Guo
Xiang Wen
Yingcong Chen
3DGS
21
0
0
21 Oct 2024
DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation
Jing He
Haodong Li
Yongzhe Hu
Guibao Shen
Yingjie Cai
Weichao Qiu
Ying-Cong Chen
DiffM
19
2
0
02 Oct 2024
1