Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2103.13413
Cited By
Vision Transformers for Dense Prediction
IEEE International Conference on Computer Vision (ICCV), 2021
24 March 2021
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViT
MDE
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (2138★)
Papers citing
"Vision Transformers for Dense Prediction"
50 / 1,224 papers shown
Monocular Visual-Inertial Depth Estimation
IEEE International Conference on Robotics and Automation (ICRA), 2023
Diana Wofk
René Ranftl
Matthias Muller
V. Koltun
MDE
151
23
0
21 Mar 2023
Zero-1-to-3: Zero-shot One Image to 3D Object
IEEE International Conference on Computer Vision (ICCV), 2023
Ruoshi Liu
Rundi Wu
Basile Van Hoorick
P. Tokmakov
Sergey Zakharov
Carl Vondrick
DiffM
401
1,512
0
20 Mar 2023
Versatile Depth Estimator Based on Common Relative Depth Estimation and Camera-Specific Relative-to-Metric Depth Conversion
Journal of Visual Communication and Image Representation (JVCIR), 2023
Jinyoung Jun
Jae-Han Lee
Chang-Su Kim
MDE
190
1
0
20 Mar 2023
Boosting Weakly Supervised Object Detection using Fusion and Priors from Hallucinated Depth
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Cagri Gungor
Adriana Kovashka
MDE
281
4
0
20 Mar 2023
MECPformer: Multi-estimations Complementary Patch with CNN-Transformers for Weakly Supervised Semantic Segmentation
Chunmeng Liu
Guang-pu Li
Yao Shen
Ruiqi Wang
ViT
363
7
0
19 Mar 2023
Just Noticeable Visual Redundancy Forecasting: A Deep Multimodal-driven Approach
AAAI Conference on Artificial Intelligence (AAAI), 2023
Wuyuan Xie
Shukang Wang
Sukun Tian
Lirong Huang
Ye Liu
Miaohui Wang
89
4
0
18 Mar 2023
Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
Jia-Xuan Bai
Zhen He
Shangxue Yang
Jie Guo
Zhenyu Chen
Yan Zhang
Yanwen Guo
178
11
0
18 Mar 2023
Single-view Neural Radiance Fields with Depth Teacher
Yurui Chen
Chun Gu
Feihu Zhang
Li Zhang
168
2
0
17 Mar 2023
Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction
AAAI Conference on Artificial Intelligence (AAAI), 2023
Senqiao Yang
Jiarui Wu
Jiaming Liu
Xiaoqi Li
Qizhe Zhang
Mingjie Pan
Yulu Gan
Zehui Chen
Shanghang Zhang
MDE
VLM
302
33
0
17 Mar 2023
Efficient Computation Sharing for Multi-Task Visual Scene Understanding
IEEE International Conference on Computer Vision (ICCV), 2023
Sara Shoouri
Mingyu Yang
Zichen Fan
Hun-Seok Kim
MoE
269
8
0
16 Mar 2023
Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers
Computer Vision and Pattern Recognition (CVPR), 2023
Zhibo Yang
Sounak Mondal
Seoyoung Ahn
Ruoyu Xue
G. Zelinsky
Minh Hoai
Dimitris Samaras
270
24
0
16 Mar 2023
Large Selective Kernel Network for Remote Sensing Object Detection
IEEE International Conference on Computer Vision (ICCV), 2023
Yuxuan Li
Qibin Hou
Zhaohui Zheng
Mingmei Cheng
Jian Yang
Xiang Li
ObjD
328
465
0
16 Mar 2023
High-level Feature Guided Decoding for Semantic Segmentation
Ye Huang
Di Kang
Shenghua Gao
Wen Li
Lixin Duan
394
6
0
15 Mar 2023
Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation
International Conference on Learning Representations (ICLR), 2023
Junyoung Seo
Wooseok Jang
Minseop Kwak
Ines Hyeonsu Kim
Jaehoon Ko
J. Kim
Jin-Hwa Kim
Jiyoung Lee
Seung Wook Kim
DiffM
397
155
0
14 Mar 2023
Adjacent-view Transformers for Supervised Surround-view Depth Estimation
Xianda Guo
Wenjie Yuan
Yunpeng Zhang
Tian Yang
Chenming Zhang
Zhengbiao Zhu
Long Chen
Long Chen
MDE
343
6
0
14 Mar 2023
Pretrained ViTs Yield Versatile Representations For Medical Images
Christos Matsoukas
Johan Fredin Haslum
Magnus P Soderberg
Kevin Smith
MedIm
ViT
288
16
0
13 Mar 2023
Token Sparsification for Faster Medical Image Segmentation
Information Processing in Medical Imaging (IPMI), 2023
Lei Zhou
Huidong Liu
Joseph Bae
Junjun He
Dimitris Samaras
Prateek Prasanna
MedIm
178
5
0
11 Mar 2023
3D Cinemagraphy from a Single Image
Computer Vision and Pattern Recognition (CVPR), 2023
Xingyi Li
Z. Cao
Huiqiang Sun
Jianming Zhang
Ke Xian
Guo-Shing Lin
3DV
VGen
178
29
0
10 Mar 2023
Lifelong-MonoDepth: Lifelong Learning for Multi-Domain Monocular Metric Depth Estimation
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Junjie Hu
Chenyou Fan
Liguang Zhou
Qing Gao
Honghai Liu
Tin Lun Lam
297
7
0
09 Mar 2023
DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation
European Conference on Computer Vision (ECCV), 2023
Yiqun Duan
Xianda Guo
Zhengbiao Zhu
DiffM
MDE
379
99
0
09 Mar 2023
Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
Chenfei Wu
Sheng-Kai Yin
Weizhen Qi
Xiaodong Wang
Zecheng Tang
Nan Duan
MLLM
LRM
359
771
0
08 Mar 2023
Weakly Supervised Caveline Detection For AUV Navigation Inside Underwater Caves
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Boxiao Yu
R. Tibbetts
T. Barua
Ailani Morales
Ioannis M. Rekleitis
M. Islam
212
10
0
07 Mar 2023
DwinFormer: Dual Window Transformers for End-to-End Monocular Depth Estimation
IEEE Sensors Journal (IEEE Sens. J.), 2023
Md Awsafur Rahman
S. Fattah
ViT
MDE
217
7
0
06 Mar 2023
Prismer: A Vision-Language Model with Multi-Task Experts
Shikun Liu
Linxi Fan
Edward Johns
Zhiding Yu
Chaowei Xiao
Anima Anandkumar
VLM
MLLM
325
33
0
04 Mar 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
IEEE International Conference on Computer Vision (ICCV), 2023
Wenliang Zhao
Yongming Rao
Zuyan Liu
Benlin Liu
Jie Zhou
Jiwen Lu
ObjD
VLM
MDE
1.0K
301
0
03 Mar 2023
Monocular Depth Estimation using Diffusion Models
Saurabh Saxena
Abhishek Kar
Mohammad Norouzi
David J. Fleet
DiffM
VLM
MDE
254
105
0
28 Feb 2023
Autonomous Intelligent Navigation for Flexible Endoscopy Using Monocular Depth Guidance and 3-D Shape Planning
IEEE International Conference on Robotics and Automation (ICRA), 2023
Yiang Lu
Ruofeng Wei
Bin Li
Wei Chen
Jianshu Zhou
Qingxu Dou
Dong Sun
Yunhui Liu
124
15
0
26 Feb 2023
Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting
Computer Vision and Pattern Recognition (CVPR), 2023
Tarasha Khurana
Peiyun Hu
David Held
Deva Ramanan
3DPC
418
78
0
25 Feb 2023
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth
S. Bhat
R. Birkl
Diana Wofk
Peter Wonka
Matthias Müller
VLM
MDE
472
774
0
23 Feb 2023
Bokeh Rendering Based on Adaptive Depth Calibration Network
International Conference on Communication, Computing & Security (ICCCS), 2023
Lu Liu
Lei Zhou
Yuhan Dong
MDE
110
4
0
21 Feb 2023
Self-Supervised Monocular Depth Estimation with Self-Reference Distillation and Disparity Offset Refinement
Zhong Liu
Ran Li
Shuwei Shao
Xingming Wu
Weihai Chen
MDE
248
36
0
20 Feb 2023
Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks
Xueye Zheng
Yexin Liu
Yunfan Lu
Tongyan Hua
Tianbo Pan
Weiming Zhang
Dacheng Tao
Lin Wang
AI4TS
BDL
3DV
327
119
0
17 Feb 2023
URCDC-Depth: Uncertainty Rectified Cross-Distillation with CutFlip for Monocular Depth Estimation
IEEE transactions on multimedia (IEEE TMM), 2023
Shuwei Shao
Z. Pei
Weihai Chen
Ran Li
Zhong Liu
Zhengguo Li
ViT
UQCV
299
45
0
16 Feb 2023
VQ3D: Learning a 3D-Aware Generative Model on ImageNet
IEEE International Conference on Computer Vision (ICCV), 2023
Kyle Sargent
Jing Yu Koh
Han Zhang
Huiwen Chang
Charles Herrmann
Pratul P. Srinivasan
Jiajun Wu
Deqing Sun
223
32
0
14 Feb 2023
VA-DepthNet: A Variational Approach to Single Image Depth Prediction
International Conference on Learning Representations (ICLR), 2023
Ce Liu
Suryansh Kumar
Shuhang Gu
Radu Timofte
Luc Van Gool
MDE
303
73
0
13 Feb 2023
Semantic Image Segmentation: Two Decades of Research
Foundations and Trends in Computer Graphics and Vision (FTCGV), 2023
G. Csurka
Riccardo Volpi
Boris Chidlovskii
3DV
273
77
0
13 Feb 2023
Scaling Vision Transformers to 22 Billion Parameters
International Conference on Machine Learning (ICML), 2023
Mostafa Dehghani
Josip Djolonga
Basil Mustafa
Piotr Padlewski
Jonathan Heek
...
Mario Luvcić
Xiaohua Zhai
Daniel Keysers
Jeremiah Harmsen
N. Houlsby
MLLM
409
779
0
10 Feb 2023
Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
International Conference on Machine Learning (ICML), 2023
Ondrej Biza
Sjoerd van Steenkiste
Mehdi S. M. Sajjadi
Gamaleldin F. Elsayed
Aravindh Mahendran
Thomas Kipf
OCL
353
50
0
09 Feb 2023
Semantic Diffusion Network for Semantic Segmentation
Neural Information Processing Systems (NeurIPS), 2023
Hao Hao Tan
Sitong Wu
Jimin Pi
DiffM
205
51
0
04 Feb 2023
TEXTure: Text-Guided Texturing of 3D Shapes
International Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 2023
Elad Richardson
G. Metzer
Yuval Alaluf
Raja Giryes
Daniel Cohen-Or
DiffM
306
334
0
03 Feb 2023
SceneScape: Text-Driven Consistent Scene Generation
Neural Information Processing Systems (NeurIPS), 2023
Rafail Fridman
Amit Abecasis
Yoni Kasten
Tali Dekel
VGen
430
160
0
02 Feb 2023
Multi-modal Large Language Model Enhanced Pseudo 3D Perception Framework for Visual Commonsense Reasoning
Jian Zhu
Hanli Wang
Miaojing Shi
LRM
255
4
0
30 Jan 2023
HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera
Mostafa Ahmadi
A. A. Naeini
M. Sheikholeslami
Z. Arjmandi
Yujia Zhang
Gunho Sohn
MDE
315
5
0
27 Jan 2023
Leveraging the Third Dimension in Contrastive Learning
Sumukh K Aithal
Anirudh Goyal
Alex Lamb
Yoshua Bengio
Michael C. Mozer
MDE
207
0
0
27 Jan 2023
AI-Based Framework for Understanding Car Following Behaviors of Drivers in A Naturalistic Driving Environment
Armstrong Aboah
Abdul Rashid Mussah
Y. Adu-Gyamfi
203
5
0
23 Jan 2023
FG-Depth: Flow-Guided Unsupervised Monocular Depth Estimation
IEEE International Conference on Robotics and Automation (ICRA), 2023
Junyu Zhu
Lina Liu
Yong Liu
Wanlong Li
Feng Wen
Hongbo Zhang
MDE
144
5
0
20 Jan 2023
Multiview Compressive Coding for 3D Reconstruction
Computer Vision and Pattern Recognition (CVPR), 2023
Chaozheng Wu
Justin Johnson
Jitendra Malik
Christoph Feichtenhofer
Georgia Gkioxari
286
91
0
19 Jan 2023
Booster: a Benchmark for Depth from Images of Specular and Transparent Surfaces
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Pierluigi Zama Ramirez
Alex Costanzino
Fabio Tosi
Matteo Poggi
Samuele Salti
S. Mattoccia
Luigi Di Stefano
MDE
223
35
0
19 Jan 2023
SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded Network
IEEE International Conference on Robotics and Automation (ICRA), 2023
D. Shim
H. J. Kim
ViT
MDE
271
35
0
17 Jan 2023
Scene-Aware 3D Multi-Human Motion Capture from a Single Camera
D. Luvizon
Marc Habermann
Vladislav Golyanik
Adam Kortylewski
Christian Theobalt
3DH
HAI
347
24
0
12 Jan 2023
Previous
1
2
3
...
18
19
20
...
23
24
25
Next
Page 19 of 25
Page
of 25
Go