Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.13061
Cited By
v1
v2
v3 (latest)
3D Visual Illusion Depth Estimation
19 May 2025
Chengtang Yao
Zhidan Liu
Jiaxi Zeng
Lidong Yu
Yuwei Wu
Yunde Jia
MDE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"3D Visual Illusion Depth Estimation"
36 / 36 papers shown
Title
VGGT: Visual Geometry Grounded Transformer
Jianyuan Wang
Minghao Chen
Nikita Karaev
Andrea Vedaldi
Christian Rupprecht
David Novotny
ViT
127
38
0
14 Mar 2025
SAM 2: Segment Anything in Images and Videos
Nikhila Ravi
Valentin Gabeur
Yuan-Ting Hu
Ronghang Hu
Chaitanya K. Ryali
...
Nicolas Carion
Chao-Yuan Wu
Ross B. Girshick
Piotr Dollár
Christoph Feichtenhofer
VLM
MLLM
170
943
0
01 Aug 2024
Grounding Image Matching in 3D with MASt3R
Vincent Leroy
Yohann Cabon
Jérôme Revaud
3DGS
3DV
128
164
0
14 Jun 2024
Depth Anything V2
Lihe Yang
Bingyi Kang
Zilong Huang
Zhen Zhao
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
DiffM
VLM
MDE
125
434
0
13 Jun 2024
MoCha-Stereo: Motif Channel Attention Network for Stereo Matching
Ziyang Chen
Wei Long
He Yao
Yongjun Zhang
Bingshu Wang
Yongbin Qin
Jia Wu
3DV
104
31
0
10 Apr 2024
InstantSplat: Sparse-view Gaussian Splatting in Seconds
Zhiwen Fan
Wenyan Cong
Kairun Wen
Kevin Wang
Jian Zhang
...
Boris Ivanovic
Marco Pavone
Georgios Pavlakos
Zhangyang Wang
Yue Wang
3DGS
131
1
0
29 Mar 2024
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
Xiao Fu
Wei Yin
Mu Hu
Kaixuan Wang
Yuexin Ma
Ping Tan
Shaojie Shen
Dahua Lin
Xiaoxiao Long
DiffM
128
124
0
18 Mar 2024
Neural Markov Random Field for Stereo Matching
Tongfan Guan
Chen Wang
Yunchun Liu
3DV
70
26
0
17 Mar 2024
Selective-Stereo: Adaptive Frequency Information Selection for Stereo Matching
Xianqi Wang
Gangwei Xu
Hao Jia
Xin Yang
89
50
0
01 Mar 2024
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Lihe Yang
Bingyi Kang
Zilong Huang
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
VLM
268
824
0
19 Jan 2024
DUSt3R: Geometric 3D Vision Made Easy
Shuzhe Wang
Vincent Leroy
Yohann Cabon
Boris Chidlovskii
Jérôme Revaud
3DGS
116
404
0
21 Dec 2023
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Bingxin Ke
Anton Obukhov
Shengyu Huang
Nando Metzger
Rodrigo Caye Daudt
Konrad Schindler
VLM
MDE
138
173
0
04 Dec 2023
Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond
Jinze Bai
Shuai Bai
Shusheng Yang
Shijie Wang
Sinan Tan
Peng Wang
Junyang Lin
Chang Zhou
Jingren Zhou
MLLM
VLM
ObjD
189
945
0
24 Aug 2023
3D Gaussian Splatting for Real-Time Radiance Field Rendering
Bernhard Kerbl
Georgios Kopanas
Thomas Leimkuehler
G. Drettakis
3DGS
288
3,840
0
08 Aug 2023
Uncertainty Guided Adaptive Warping for Robust and Efficient Stereo Matching
Junpeng Jing
Jiankun Li
Pengfei Xiong
Jiangyu Liu
Shuaicheng Liu
Yichen Guo
Xin Deng
Mai Xu
Lai Jiang
Leonid Sigal
3DV
94
24
0
26 Jul 2023
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image
Wei Yin
Chi Zhang
Hao Chen
Zhipeng Cai
Gang Yu
Kaixuan Wang
Xiaozhi Chen
Chunhua Shen
MDE
212
197
0
20 Jul 2023
Vision-Language Models for Vision Tasks: A Survey
Jingyi Zhang
Jiaxing Huang
Sheng Jin
Shijian Lu
VLM
165
551
0
03 Apr 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao
Yongming Rao
Zuyan Liu
Benlin Liu
Jie Zhou
Jiwen Lu
ObjD
VLM
MDE
249
233
0
03 Mar 2023
Scalable Diffusion Models with Transformers
William S. Peebles
Saining Xie
GNN
150
2,439
0
19 Dec 2022
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow
Philippe Weinzaepfel
Thomas Lucas
Vincent Leroy
Yohann Cabon
Vaibhav Arora
Romain Brégier
G. Csurka
L. Antsfeld
Boris Chidlovskii
Jérôme Revaud
ViT
127
97
0
18 Nov 2022
Context-Enhanced Stereo Transformer
Weiyu Guo
Zhaoshuo Li
Yongkui Yang
Ziyi Wang
Russell H. Taylor
Mathias Unberath
Alan Yuille
Yingwei Li
70
41
0
21 Oct 2022
CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View Completion
Philippe Weinzaepfel
Vincent Leroy
Thomas Lucas
Romain Brégier
Yohann Cabon
Vaibhav Arora
L. Antsfeld
Boris Chidlovskii
G. Csurka
Jérôme Revaud
SSL
138
73
0
19 Oct 2022
Flow Matching for Generative Modeling
Y. Lipman
Ricky T. Q. Chen
Heli Ben-Hamu
Maximilian Nickel
Matt Le
OOD
225
1,391
0
06 Oct 2022
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning
Yi-Lin Sung
Jaemin Cho
Joey Tianyi Zhou
VLM
99
246
0
13 Jun 2022
Open Challenges in Deep Stereo: the Booster Dataset
Pierluigi Zama Ramirez
Fabio Tosi
Matteo Poggi
Samuele Salti
S. Mattoccia
Luigi Di Stefano
3DV
MDE
80
34
0
09 Jun 2022
Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation
Jiankun Li
Peisen Wang
Pengfei Xiong
Tao Cai
Zi-Ping Yan
Lei Yang
Jiangyu Liu
Haoqiang Fan
Shuaicheng Liu
3DV
144
258
0
22 Mar 2022
RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching
Lahav Lipson
Zachary Teed
Jia Deng
MDE
105
412
0
15 Sep 2021
Resolution-robust Large Mask Inpainting with Fourier Convolutions
Roman Suvorov
Elizaveta Logacheva
Anton Mashikhin
Anastasia Remizova
Arsenii Ashukha
Aleksei Silvestrov
Naejin Kong
Harshith Goka
Kiwoong Park
Victor Lempitsky
113
872
0
15 Sep 2021
Vision Transformers for Dense Prediction
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViT
MDE
143
1,751
0
24 Mar 2021
AdaBins: Depth Estimation using Adaptive Bins
S. Bhat
Ibraheem Alhashim
Peter Wonka
3DV
MDE
ViT
173
862
0
28 Nov 2020
Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers
Zhaoshuo Li
Xingtong Liu
Nathan G. Drenkow
Andy S Ding
Francis X. Creighton
Russell H. Taylor
Mathias Unberath
MDE
ViT
157
291
0
05 Nov 2020
Self-Supervised Monocular Depth Estimation: Solving the Dynamic Object Problem by Semantic Guidance
Marvin Klingner
Jan-Aike Termöhlen
Jonas Mikolajczyk
Tim Fingscheidt
MDE
148
325
0
14 Jul 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
1.0K
42,651
0
28 May 2020
3D Packing for Self-Supervised Monocular Depth Estimation
Vitor Campagnolo Guizilini
Rares Andrei Ambrus
Sudeep Pillai
Allan Raventos
Adrien Gaidon
SSL
3DPC
MDE
126
650
0
06 May 2019
Parameter-Efficient Transfer Learning for NLP
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
230
4,547
0
02 Feb 2019
A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation
N. Mayer
Eddy Ilg
Philip Häusser
Philipp Fischer
Daniel Cremers
Alexey Dosovitskiy
Thomas Brox
3DPC
100
2,652
0
07 Dec 2015
1