Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.13413
Cited By
Vision Transformers for Dense Prediction
24 March 2021
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViT
MDE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vision Transformers for Dense Prediction"
50 / 982 papers shown
Title
Automated Distance Estimation for Wildlife Camera Trapping
Peter Johanns
T. Haucke
Volker Steinhage
15
17
0
09 Feb 2022
Transformers in Self-Supervised Monocular Depth Estimation with Unknown Camera Intrinsics
Arnav Varma
Hemang Chawla
Bahram Zonooz
Elahe Arani
ViT
MDE
31
49
0
07 Feb 2022
GLPanoDepth: Global-to-Local Panoramic Depth Estimation
Jia-Chi Bai
Shuichang Lai
Haoyu Qin
Jie Guo
Yanwen Guo
ViT
MDE
61
21
0
06 Feb 2022
StandardSim: A Synthetic Dataset For Retail Environments
Cristina Mata
Nick Locascio
Mohammed Azeem Sheikh
Kenny Kihara
Daniel L. Fischetti
16
9
0
04 Feb 2022
Towards 3D Scene Reconstruction from Locally Scale-Aligned Monocular Video Depth
Guangkai Xu
Wei Yin
Hao Chen
Chunhua Shen
Kai-Sheng Cheng
Fengyu Wu
Fengshang Zhao
MDE
19
9
0
03 Feb 2022
AtmoDist: Self-supervised Representation Learning for Atmospheric Dynamics
Sebastian Hoffmann
C. Lessig
AI4Cl
24
8
0
02 Feb 2022
A Comprehensive Study of Vision Transformers on Dense Prediction Tasks
Kishaan Jeeveswaran
Senthilkumar S. Kathiresan
Arnav Varma
Omar Magdy
Bahram Zonooz
Elahe Arani
ViT
17
10
0
21 Jan 2022
Multi-view Monocular Depth and Uncertainty Prediction with Deep SfM in Dynamic Environments
Christian Homeyer
Oliver Lange
Christoph Schnörr
MDE
13
3
0
21 Jan 2022
GeoFill: Reference-Based Image Inpainting with Better Geometric Understanding
Yunhan Zhao
Connelly Barnes
Yuqian Zhou
Eli Shechtman
Sohrab Amirghodsi
Charless C. Fowlkes
17
8
0
20 Jan 2022
Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth
Doyeon Kim
Woonghyun Ka
Pyunghwan Ahn
Donggyu Joo
S. Chun
Junmo Kim
MDE
11
124
0
19 Jan 2022
SwinUNet3D -- A Hierarchical Architecture for Deep Traffic Prediction using Shifted Window Transformers
Alabi Bojesomo
Hasan Al Marzouqi
P. Liatsis
ViT
26
6
0
17 Jan 2022
Domain Adaptation via Bidirectional Cross-Attention Transformer
Xiyu Wang
Pengxin Guo
Yu Zhang
ViT
14
19
0
15 Jan 2022
A Survey on RGB-D Datasets
Alexandre Lopes
Roberto Souza
Hélio Pedrini
3DV
MDE
24
33
0
15 Jan 2022
Language-driven Semantic Segmentation
Boyi Li
Kilian Q. Weinberger
Serge J. Belongie
V. Koltun
René Ranftl
VLM
43
600
0
10 Jan 2022
QuadTree Attention for Vision Transformers
Shitao Tang
Jiahui Zhang
Siyu Zhu
Ping Tan
ViT
157
156
0
08 Jan 2022
THE Benchmark: Transferable Representation Learning for Monocular Height Estimation
Zhitong Xiong
Wei Huang
Jingtao Hu
Xiao Xiang Zhu
11
19
0
30 Dec 2021
Learning Generative Vision Transformer with Energy-Based Latent Space for Saliency Prediction
Jing Zhang
Jianwen Xie
Nick Barnes
Ping Li
ViT
35
90
0
27 Dec 2021
Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry
Gwangbin Bae
Ignas Budvytis
R. Cipolla
3DV
4
59
0
15 Dec 2021
E-CRF: Embedded Conditional Random Field for Boundary-caused Class Weights Confusion in Semantic Segmentation
Jie Zhu
Huabin Huang
Banghuai Li
Leye Wang
25
14
0
14 Dec 2021
Stereoscopic Universal Perturbations across Different Architectures and Datasets
Z. Berger
Parth T. Agrawal
Tianlin Liu
Stefano Soatto
A. Wong
AAML
19
19
0
12 Dec 2021
DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition
Yuxuan Liang
Pan Zhou
Roger Zimmermann
Shuicheng Yan
ViT
21
21
0
09 Dec 2021
Unsupervised Domain Adaptation for Semantic Image Segmentation: a Comprehensive Survey
G. Csurka
Riccardo Volpi
Boris Chidlovskii
OOD
VLM
3DV
63
40
0
06 Dec 2021
GETAM: Gradient-weighted Element-wise Transformer Attention Map for Weakly-supervised Semantic segmentation
Weixuan Sun
Jing Zhang
Zheyuan Liu
Yiran Zhong
Nick Barnes
ViT
58
14
0
06 Dec 2021
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation
Haobo Yuan
Xiangtai Li
Yibo Yang
Guangliang Cheng
Jing Zhang
Yunhai Tong
Lefei Zhang
Dacheng Tao
MDE
33
42
0
05 Dec 2021
Toward Practical Monocular Indoor Depth Estimation
Cho-Ying Wu
Jialiang Wang
Michael Hall
Ulrich Neumann
Shuochen Su
3DV
MDE
43
62
0
04 Dec 2021
Machine Learning Subsystem for Autonomous Collision Avoidance on a small UAS with Embedded GPU
Nicholas Polosky
Tyler Gwin
Sean Furman
Parth Barhanpurkar
Jithin Jagannath
17
6
0
03 Dec 2021
Object-aware Monocular Depth Prediction with Instance Convolutions
Enis Simsar
Evin Pınar Örnek
Fabian Manhardt
Helisa Dhamo
Nassir Navab
F. Tombari
3DH
MDE
13
1
0
02 Dec 2021
3D Photo Stylization: Learning to Generate Stylized Novel Views from a Single Image
Fangzhou Mu
Jian Wang
Yichen Wu
Yin Li
DiffM
3DH
17
46
0
30 Nov 2021
360MonoDepth: High-Resolution 360° Monocular Depth Estimation
M. Rey-Area
Mingze Yuan
Christian Richardt
MDE
13
71
0
30 Nov 2021
AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
Lingchen Meng
Hengduo Li
Bor-Chun Chen
Shiyi Lan
Zuxuan Wu
Yu-Gang Jiang
Ser-Nam Lim
ViT
23
219
0
30 Nov 2021
PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction
Qingyun Wang
Baojian Ma
Wei Liu
Ming Lou
Mingchuan Zhou
Huanyu Jiang
Y. Ying
3DV
9
0
0
30 Nov 2021
Pyramid Adversarial Training Improves ViT Performance
Charles Herrmann
Kyle Sargent
Lu Jiang
Ramin Zabih
Huiwen Chang
Ce Liu
Dilip Krishnan
Deqing Sun
ViT
18
56
0
30 Nov 2021
TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers
Yikang Ding
Wentao Yuan
Qingtian Zhu
Haotian Zhang
Xiangyue Liu
Yuanjiang Wang
Xiao Liu
ViT
21
178
0
29 Nov 2021
The Implicit Values of A Good Hand Shake: Handheld Multi-Frame Neural Depth Refinement
Ilya Chugunov
Yuxuan Zhang
Zhihao Xia
Xuaner
Cecilia Zhang
Jiawen Chen
Felix Heide
3DH
MDE
22
14
0
26 Nov 2021
SWAT: Spatial Structure Within and Among Tokens
Kumara Kahatapitiya
Michael S. Ryoo
23
6
0
26 Nov 2021
Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations
Mehdi S. M. Sajjadi
H. Meyer
Etienne Pot
Urs M. Bergmann
Klaus Greff
...
Daniel Duckworth
Alexey Dosovitskiy
Jakob Uszkoreit
Thomas Funkhouser
Andrea Tagliasacchi
ViT
35
184
0
25 Nov 2021
Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing
Xiaoxue Chen
Tianyu Liu
Hao Zhao
Guyue Zhou
Ya-Qin Zhang
21
22
0
24 Nov 2021
Distortion Reduction for Off-Center Perspective Projection of Panoramas
Chi-Han Peng
Jiayao Zhang
MDE
11
0
0
23 Nov 2021
Monocular Road Planar Parallax Estimation
Haobo Yuan
Teng Chen
Wei Sui
Jiafeng Xie
Lefei Zhang
Yuan Li
Qian Zhang
10
4
0
22 Nov 2021
Topological Regularization for Dense Prediction
Deqing Fu
Bradley J. Nelson
MDE
14
0
0
22 Nov 2021
Towards Comprehensive Monocular Depth Estimation: Multiple Heads Are Better Than One
Shuwei Shao
Ran Li
Z. Pei
Zhong Liu
Weihai Chen
Wentao Zhu
Xingming Wu
Baochang Zhang
ViT
MDE
23
11
0
16 Nov 2021
Beyond Mono to Binaural: Generating Binaural Audio from Mono Audio with Depth and Cross Modal Attention
Kranti K. Parida
Siddharth Srivastava
Gaurav Sharma
MDE
31
20
0
15 Nov 2021
Online Mutual Adaptation of Deep Depth Prediction and Visual SLAM
S. Loo
M. Shakeri
S. Tang
S. Mashohor
Hong Zhang
MDE
14
6
0
07 Nov 2021
Body Size and Depth Disambiguation in Multi-Person Reconstruction from Single Images
Nicolas Ugrinovic
Adria Ruiz
Antonio Agudo
Alberto Sanfeliu
Francesc Moreno-Noguer
3DH
13
12
0
02 Nov 2021
Transformers for prompt-level EMA non-response prediction
Supriya Nagesh
Alexander Moreno
Stephanie M Carpenter
Jamie Yap
Soujanya Chatterjee
...
Santosh Kumar
Cho Lam
D. Wetter
Inbal Nahum-Shani
James M. Rehg
12
0
0
01 Nov 2021
HRFormer: High-Resolution Transformer for Dense Prediction
Yuhui Yuan
Rao Fu
Lang Huang
Weihong Lin
Chao Zhang
Xilin Chen
Jingdong Wang
ViT
24
226
0
18 Oct 2021
Learning multiplane images from single views with self-supervision
Gustavo Sutter P. Carvalho
D. Luvizon
Antonio Joia Neto
André G. C. Pacheco
O. A. B. Penatti
SSL
30
1
0
18 Oct 2021
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
224
1,018
0
13 Oct 2021
Dense Uncertainty Estimation
Jing Zhang
Yuchao Dai
Mochu Xiang
Deng-Ping Fan
Peyman Moghadam
Mingyi He
Christian J. Walder
Kaihao Zhang
Mehrtash Harandi
Nick Barnes
UQCV
BDL
52
10
0
13 Oct 2021
Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans
Ainaz Eftekhar
Alexander Sax
Roman Bachmann
Jitendra Malik
Amir Zamir
MedIm
33
288
0
11 Oct 2021
Previous
1
2
3
...
18
19
20
Next