Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.13413
Cited By
Vision Transformers for Dense Prediction
24 March 2021
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViT
MDE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vision Transformers for Dense Prediction"
50 / 982 papers shown
Title
DaViT: Dual Attention Vision Transformers
Mingyu Ding
Bin Xiao
Noel Codella
Ping Luo
Jingdong Wang
Lu Yuan
ViT
30
240
0
07 Apr 2022
Simple and Effective Synthesis of Indoor 3D Scenes
Jing Yu Koh
Harsh Agrawal
Dhruv Batra
Richard Tucker
Austin Waters
Honglak Lee
Yinfei Yang
Jason Baldridge
Peter Anderson
VGen
3DV
13
29
0
06 Apr 2022
Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows
Shengqing Liu
Xiaohan Nie
Raffay Hamid
12
16
0
05 Apr 2022
P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior
Vaishakh Patil
Christos Sakaridis
Alexander Liniger
Luc Van Gool
MDE
19
128
0
05 Apr 2022
Monitoring social distancing with single image depth estimation
Alessio Mingozzi
Andrea Conti
Filippo Aleotti
Matteo Poggi
S. Mattoccia
14
2
0
04 Apr 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders
Roman Bachmann
David Mizrahi
Andrei Atanov
Amir Zamir
32
265
0
04 Apr 2022
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation
Zhenyu Li
Xuyang Wang
Xianming Liu
Junjun Jiang
MDE
24
191
0
03 Apr 2022
SIMBAR: Single Image-Based Scene Relighting For Effective Data Augmentation For Automated Driving Vision Tasks
Xianling Zhang
Nathan Tseng
Ameerah Syed
Rohan Bhasin
Nikita Jaipuria
ViT
6
16
0
01 Apr 2022
Monitored Distillation for Positive Congruent Depth Completion
Tianlin Liu
Parth T. Agrawal
Allison Chen
Byung-Woo Hong
A. Wong
6
37
0
30 Mar 2022
LocalBins: Improving Depth Estimation by Learning Local Distributions
S. Bhat
Ibraheem Alhashim
Peter Wonka
MDE
20
99
0
28 Mar 2022
Learning Graph Regularisation for Guided Super-Resolution
Riccardo de Lutio
Alexander Becker
Stefano Dáronco
S. Russo
Jan Dirk Wegner
Konrad Schindler
SupR
13
36
0
27 Mar 2022
DepthFormer: Exploiting Long-Range Correlation and Local Information for Accurate Monocular Depth Estimation
Zhenyu Li
Zehui Chen
Xianming Liu
Junjun Jiang
ViT
MDE
36
183
1
27 Mar 2022
RSTT: Real-time Spatial Temporal Transformer for Space-Time Video Super-Resolution
Z. Geng
Luming Liang
Tianyu Ding
Ilya Zharkov
21
68
0
27 Mar 2022
Feature Selective Transformer for Semantic Image Segmentation
Fangjian Lin
Tianyi Wu
Sitong Wu
Sheng Tian
Guodong Guo
ViT
20
5
0
26 Mar 2022
Semantic Segmentation by Early Region Proxy
Yifan Zhang
Bo Pang
Cewu Lu
ViT
42
29
0
26 Mar 2022
On the Viability of Monocular Depth Pre-training for Semantic Segmentation
Dong Lao
Fengyu Yang
Daniel Wang
Hyoungseob Park
Samuel Lu
Alex Wong
Stefano Soatto
MDE
20
0
0
26 Mar 2022
Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation
Theodoros Pissas
Claudio S. Ravasio
L. Cruz
Christos Bergeles
15
15
0
25 Mar 2022
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Yuting Yang
Licheng Jiao
Xuantong Liu
F. Liu
Shuyuan Yang
Zhixi Feng
Xu Tang
ViT
MedIm
24
28
0
24 Mar 2022
StructToken : Rethinking Semantic Segmentation with Structural Prior
Fangjian Lin
Zhanhao Liang
Miao Zheng
Junjun He
Kaibing Chen
Sheng Tian
15
48
0
23 Mar 2022
Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds
Chenhang He
Ruihuang Li
Shuai Li
Lei Zhang
ViT
3DPC
22
163
0
19 Mar 2022
PanoFormer: Panorama Transformer for Indoor 360 Depth Estimation
Zhijie Shen
Chunyu Lin
K. Liao
Lang Nie
Zishuo Zheng
Yao Zhao
ViT
MDE
27
85
0
17 Mar 2022
Depth-aware Neural Style Transfer using Instance Normalization
E. Ioannou
S. Maddock
11
14
0
17 Mar 2022
Data Efficient 3D Learner via Knowledge Transferred from 2D Model
Ping Yu
Cheng Sun
Min Sun
3DPC
26
11
0
16 Mar 2022
Unsupervised Semantic Segmentation by Distilling Feature Correspondences
Mark Hamilton
Zhoutong Zhang
Bharath Hariharan
Noah Snavely
William T. Freeman
9
234
0
16 Mar 2022
From 2D to 3D: Re-thinking Benchmarking of Monocular Depth Prediction
Evin Pınar Örnek
Shristi Mudgal
Johanna Wald
Yida Wang
Nassir Navab
F. Tombari
3DV
20
20
0
15 Mar 2022
InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding
Hanrong Ye
Dan Xu
ViT
19
84
0
15 Mar 2022
Enriched CNN-Transformer Feature Aggregation Networks for Super-Resolution
Jinsu Yoo
Taehoon Kim
Sihaeng Lee
Seunghyeon Kim
H. Lee
Tae Hyun Kim
SupR
ViT
31
51
0
15 Mar 2022
CAR: Class-aware Regularizations for Semantic Segmentation
Ye Huang
Di Kang
Liang Chen
Xuefei Zhe
W. Jia
Xiangjian He
Linchao Bao
25
16
0
14 Mar 2022
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
Xiaohan Ding
X. Zhang
Yi Zhou
Jungong Han
Guiguang Ding
Jian-jun Sun
VLM
47
528
0
13 Mar 2022
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
Jiaming Zhang
Huayao Liu
Kailun Yang
Xinxin Hu
Ruiping Liu
Rainer Stiefelhagen
ViT
21
296
0
09 Mar 2022
ChiTransformer:Towards Reliable Stereo from Cues
Qing Su
Shihao Ji
MDE
ViT
16
12
0
09 Mar 2022
Lightweight Monocular Depth Estimation through Guided Decoding
M. Rudolph
Youssef Dawoud
Ronja Güldenring
Lazaros Nalpantidis
Vasileios Belagiannis
MDE
23
22
0
08 Mar 2022
RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation
Hao He
Yuhui Yuan
Xiangyu Yue
Han Hu
VOS
VLM
19
13
0
08 Mar 2022
Monocular Robot Navigation with Self-Supervised Pretrained Vision Transformers
Miguel A. Saavedra-Ruiz
Sacha Morin
Liam Paull
MDE
ViT
32
3
0
07 Mar 2022
ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization
Menelaos Kanakis
S. Maurer
Matteo Spallanzani
Ajad Chhatkuli
Luc Van Gool
3DPC
19
13
0
07 Mar 2022
Multi-class Token Transformer for Weakly Supervised Semantic Segmentation
Lian Xu
Wanli Ouyang
Bennamoun
F. Boussaïd
Dan Xu
ViT
25
209
0
06 Mar 2022
Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers
Lixiang Ru
Yibing Zhan
Baosheng Yu
Bo Du
ViT
36
181
0
05 Mar 2022
Fast Neural Architecture Search for Lightweight Dense Prediction Networks
Lam Huynh
Esa Rahtu
Juan E. Sala Matas
J. Heikkilä
18
2
0
03 Mar 2022
NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation
Weihao Yuan
Xiaodong Gu
Zuozhuo Dai
Siyu Zhu
Ping Tan
31
174
0
03 Mar 2022
3D Common Corruptions and Data Augmentation
Oğuzhan Fatih Kar
Teresa Yeo
Andrei Atanov
Amir Zamir
3DPC
35
107
0
02 Mar 2022
OmniFusion: 360 Monocular Depth Estimation via Geometry-Aware Fusion
Yu-yang Li
Yuliang Guo
Zhixin Yan
Xinyu Huang
Ye Duan
Liu Ren
MDE
22
66
0
02 Mar 2022
3DCTN: 3D Convolution-Transformer Network for Point Cloud Classification
Dening Lu
Qian Xie
Linlin Xu
Jonathan Li
3DV
16
67
0
02 Mar 2022
TransKD: Transformer Knowledge Distillation for Efficient Semantic Segmentation
R. Liu
Kailun Yang
Alina Roitberg
Jiaming Zhang
Kunyu Peng
Huayao Liu
Yaonan Wang
Rainer Stiefelhagen
ViT
39
36
0
27 Feb 2022
Supervising Remote Sensing Change Detection Models with 3D Surface Semantics
Isaac Corley
Peyman Najafirad
3DPC
16
6
0
26 Feb 2022
Light Robust Monocular Depth Estimation For Outdoor Environment Via Monochrome And Color Camera Fusion
Hyeonsoo Jang
Yeongmin Ko
Younkwan Lee
M. Jeon
MDE
18
1
0
24 Feb 2022
HiP: Hierarchical Perceiver
João Carreira
Skanda Koppula
Daniel Zoran
Adrià Recasens
Catalin Ionescu
...
M. Botvinick
Oriol Vinyals
Karen Simonyan
Andrew Zisserman
Andrew Jaegle
VLM
28
14
0
22 Feb 2022
Modern Augmented Reality: Applications, Trends, and Future Directions
Shervin Minaee
Xiaodan Liang
Shuicheng Yan
20
25
0
18 Feb 2022
Spatio-Temporal Outdoor Lighting Aggregation on Image Sequences using Transformer Networks
Haebom Lee
Christian Homeyer
R. Herzog
J. Rexilius
Carsten Rother
ViT
14
4
0
18 Feb 2022
Joint Learning of Frequency and Spatial Domains for Dense Predictions
Shaocheng Jia
Wei-Ting Yao
25
0
0
18 Feb 2022
Depth-Cooperated Trimodal Network for Video Salient Object Detection
Yukang Lu
Dingyao Min
Keren Fu
Qijun Zhao
MDE
27
13
0
12 Feb 2022
Previous
1
2
3
...
17
18
19
20
Next