Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.13413
Cited By
Vision Transformers for Dense Prediction
24 March 2021
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViT
MDE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vision Transformers for Dense Prediction"
50 / 982 papers shown
Title
Leveraging the Third Dimension in Contrastive Learning
Sumukh K Aithal
Anirudh Goyal
Alex Lamb
Yoshua Bengio
Michael C. Mozer
MDE
29
0
0
27 Jan 2023
AI-Based Framework for Understanding Car Following Behaviors of Drivers in A Naturalistic Driving Environment
Armstrong Aboah
Abdul Rashid Mussah
Y. Adu-Gyamfi
22
4
0
23 Jan 2023
FG-Depth: Flow-Guided Unsupervised Monocular Depth Estimation
Junyu Zhu
Lina Liu
Yong Liu
Wanlong Li
Feng Wen
Hongbo Zhang
MDE
23
2
0
20 Jan 2023
Multiview Compressive Coding for 3D Reconstruction
Chaozheng Wu
Justin Johnson
Jitendra Malik
Christoph Feichtenhofer
Georgia Gkioxari
19
71
0
19 Jan 2023
Booster: a Benchmark for Depth from Images of Specular and Transparent Surfaces
Pierluigi Zama Ramirez
Alex Costanzino
Fabio Tosi
Matteo Poggi
Samuele Salti
S. Mattoccia
Luigi Di Stefano
MDE
19
21
0
19 Jan 2023
SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded Network
D. Shim
H. J. Kim
ViT
MDE
13
20
0
17 Jan 2023
Scene-Aware 3D Multi-Human Motion Capture from a Single Camera
D. Luvizon
Marc Habermann
Vladislav Golyanik
Adam Kortylewski
Christian Theobalt
3DH
HAI
22
18
0
12 Jan 2023
ViTs for SITS: Vision Transformers for Satellite Image Time Series
Michail Tarasiou
Erik Chavez
S. Zafeiriou
ViT
11
48
0
12 Jan 2023
CARD: Semantic Segmentation with Efficient Class-Aware Regularized Decoder
Ye Huang
Di Kang
Liang Chen
W. Jia
Xiangjian He
Lixin Duan
Xuefei Zhe
Linchao Bao
34
2
0
11 Jan 2023
DeMT: Deformable Mixer Transformer for Multi-Task Learning of Dense Prediction
Yang Yang
Yibo Yang
L. Zhang
ViT
25
51
0
09 Jan 2023
Deep Planar Parallax for Monocular Depth Estimation
H. Liang
Zhichao Li
Y. Yang
Naiyan Wang
MDE
26
0
0
09 Jan 2023
A Study on the Generality of Neural Network Structures for Monocular Depth Estimation
Ji-Hoon Bae
K. Hwang
Sunghoon Im
MDE
24
7
0
09 Jan 2023
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
Jia Ning
Chen Li
Zheng-Wei Zhang
Zigang Geng
Qi Dai
Kun He
Han Hu
33
44
0
05 Jan 2023
TiG-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning
Pei-Kai Huang
L. Liu
Renrui Zhang
Song Zhang
Xin Xu
Bai-Qi Wang
G. Liu
3DPC
MDE
34
42
0
28 Dec 2022
Representation Separation for Semantic Segmentation with Vision Transformers
Yuanduo Hong
Huihui Pan
Weichao Sun
Xinghu Yu
Huijun Gao
ViT
28
5
0
28 Dec 2022
Shakes on a Plane: Unsupervised Depth Estimation from Unstabilized Photography
Ilya Chugunov
Yuxuan Zhang
Felix Heide
MDE
33
9
0
22 Dec 2022
GOOD: Exploring Geometric Cues for Detecting Objects in an Open World
Haiwen Huang
Andreas Geiger
Dan Zhang
VLM
ObjD
21
11
0
22 Dec 2022
DuAT: Dual-Aggregation Transformer Network for Medical Image Segmentation
Feilong Tang
Q. Huang
Jinfeng Wang
Xianxu Hou
Jionglong Su
Jingxin Liu
ViT
MedIm
27
49
0
21 Dec 2022
MaskingDepth: Masked Consistency Regularization for Semi-supervised Monocular Depth Estimation
Jongbeom Baek
Gyeongnyeon Kim
Seonghoon Park
Honggyu An
Matteo Poggi
Seung Wook Kim
MDE
29
0
0
21 Dec 2022
DAG: Depth-Aware Guidance with Denoising Diffusion Probabilistic Models
Gyeongnyeon Kim
Wooseok Jang
Gyuseong Lee
Susung Hong
Junyoung Seo
Seung Wook Kim
VLM
DiffM
32
11
0
17 Dec 2022
NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior
Wenjing Bian
Zirui Wang
Kejie Li
Jiawang Bian
V. Prisacariu
30
236
0
14 Dec 2022
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries
Jinjie Mai
Abdullah Hamdi
Silvio Giancola
Chen Zhao
Bernard Ghanem
EgoV
30
14
0
14 Dec 2022
Towards Practical Plug-and-Play Diffusion Models
Hyojun Go
Yunsung Lee
Jin-Young Kim
Seunghyun Lee
Myeongho Jeong
Hyun Seung Lee
Seungtaek Choi
DiffM
27
16
0
12 Dec 2022
ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient Self-Supervised Monocular Depth Estimation
Daitao Xing
Jinglin Shen
C. Ho
Anthony Tzes
ViT
MDE
26
4
0
12 Dec 2022
Source-free Depth for Object Pop-out
Zongwei Wu
D. Paudel
Deng-Ping Fan
Jingjing Wang
Shuo Wang
C. Demonceaux
Radu Timofte
Luc Van Gool
35
44
0
10 Dec 2022
Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation
L. Talker
Aviad Cohen
E. Yosef
Alexandra Dana
Michael Dinerstein
32
6
0
10 Dec 2022
Monocular Camera and Single-Beam Sonar-Based Underwater Collision-Free Navigation with Domain Randomization
Pengzhi Yang
Haowen Liu
Monika Roznere
Alberto Quattrini Li
14
9
0
08 Dec 2022
MIME: Human-Aware 3D Scene Generation
Hongwei Yi
C. Huang
Shashank Tripathi
Lea Hering
Justus Thies
Michael J. Black
3DH
25
48
0
08 Dec 2022
Surround-view Fisheye BEV-Perception for Valet Parking: Dataset, Baseline and Distortion-insensitive Multi-task Framework
Zizhang Wu
Yuanzhu Gan
Xianzhi Li
Yunzhe Wu
Xiaoquan Wang
Tianhao Xu
Fan Wang
23
9
0
08 Dec 2022
NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors
Congyue Deng
C. Jiang
C. Qi
Xinchen Yan
Yin Zhou
Leonidas J. Guibas
Drago Anguelov
DiffM
29
161
0
06 Dec 2022
Event-based Monocular Dense Depth Estimation with Recurrent Transformers
Xu Liu
Jianing Li
Xiaopeng Fan
Yonghong Tian
ViT
MDE
64
16
0
06 Dec 2022
Objects as Spatio-Temporal 2.5D points
Paridhi Singh
Gaurav Singh
Arun C. S. Kumar
3DPC
24
0
0
06 Dec 2022
Location-Aware Self-Supervised Transformers for Semantic Segmentation
Mathilde Caron
N. Houlsby
Cordelia Schmid
ViT
21
10
0
05 Dec 2022
Self-supervised AutoFlow
Hsin-Ping Huang
Charles Herrmann
Junhwa Hur
Erika Lu
Kyle Sargent
Austin Stone
Ming Yang
Deqing Sun
29
8
0
04 Dec 2022
Multi-resolution Monocular Depth Map Fusion by Self-supervised Gradient-based Composition
Yaqiao Dai
Renjiao Yi
Chenyang Zhu
Hongjun He
Kai Xu
MDE
14
5
0
03 Dec 2022
BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for BEV 3D Object Detection
Jianing Li
Ming Lu
Jiaming Liu
Yandong Guo
Li Du
Shanghang Zhang
40
6
0
01 Dec 2022
NeuralLift-360: Lifting An In-the-wild 2D Photo to A 3D Object with 360° Views
Dejia Xu
Yifan Jiang
Peihao Wang
Zhiwen Fan
Yi Wang
Zhangyang Wang
DiffM
35
143
0
29 Nov 2022
Leveraging Image Matching Toward End-to-End Relative Camera Pose Regression
Fadi Khatib
Yuval Margalit
Meirav Galun
Ronen Basri
25
2
0
27 Nov 2022
Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation
Ning Zhang
F. Nex
G. Vosselman
N. Kerle
MDE
36
153
0
23 Nov 2022
Event Transformer+. A multi-purpose solution for efficient event data processing
Alberto Sabater
Luis Montesano
Ana C. Murillo
ViT
31
8
0
22 Nov 2022
Hybrid Transformer Based Feature Fusion for Self-Supervised Monocular Depth Estimation
S. Tomar
Maitreya Suin
A. N. Rajagopalan
ViT
MDE
16
4
0
20 Nov 2022
A Practical Stereo Depth System for Smart Glasses
Jialiang Wang
D. Scharstein
Akash Bapat
Kevin Blackburn-Matzen
Matthew Yu
...
Jan-Michael Frahm
Zijian He
Peter Vajda
Michael F. Cohen
M. Uyttendaele
MDE
28
5
0
19 Nov 2022
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow
Philippe Weinzaepfel
Thomas Lucas
Vincent Leroy
Yohann Cabon
Vaibhav Arora
Romain Brégier
G. Csurka
L. Antsfeld
Boris Chidlovskii
Jérôme Revaud
ViT
20
81
0
18 Nov 2022
Estimating more camera poses for ego-centric videos is essential for VQ3D
Jinjie Mai
Chen Zhao
Abdullah Hamdi
Silvio Giancola
Bernard Ghanem
EgoV
14
4
0
18 Nov 2022
LightDepth: A Resource Efficient Depth Estimation Approach for Dealing with Ground Truth Sparsity via Curriculum Learning
Fatemeh Karimi
Amir Mehrpanah
Reza Rawassizadeh
17
1
0
16 Nov 2022
YORO -- Lightweight End to End Visual Grounding
Chih-Hui Ho
Srikar Appalaraju
Bhavan A. Jasani
R. Manmatha
Nuno Vasconcelos
ObjD
21
21
0
15 Nov 2022
3D Scene Inference from Transient Histograms
Sacha Jungerman
Atul Ingle
Yin Li
Mohit Gupta
16
6
0
09 Nov 2022
3DFill:Reference-guided Image Inpainting by Self-supervised 3D Image Alignment
Liang Zhao
Xinyuan Zhao
Hailong Ma
Xinyu Zhang
Long Zeng
33
3
0
09 Nov 2022
Realistic Bokeh Effect Rendering on Mobile GPUs, Mobile AI & AIM 2022 challenge: Report
Andrey D. Ignatov
Radu Timofte
Jin Zhang
Feng Zhang
G. Yu
...
Mingyang Qian
Huixin Ma
Yanan Li
Xiaotao Wang
Lei Lei
15
10
0
07 Nov 2022
SC-DepthV3: Robust Self-supervised Monocular Depth Estimation for Dynamic Scenes
Libo Sun
Jiawang Bian
Huangying Zhan
Wei Yin
Ian Reid
Chunhua Shen
MDE
21
55
0
07 Nov 2022
Previous
1
2
3
...
14
15
16
...
18
19
20
Next