ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.13413
  4. Cited By
Vision Transformers for Dense Prediction

Vision Transformers for Dense Prediction

24 March 2021
René Ranftl
Alexey Bochkovskiy
V. Koltun
    ViT
    MDE
ArXivPDFHTML

Papers citing "Vision Transformers for Dense Prediction"

50 / 982 papers shown
Title
Leveraging the Third Dimension in Contrastive Learning
Leveraging the Third Dimension in Contrastive Learning
Sumukh K Aithal
Anirudh Goyal
Alex Lamb
Yoshua Bengio
Michael C. Mozer
MDE
29
0
0
27 Jan 2023
AI-Based Framework for Understanding Car Following Behaviors of Drivers
  in A Naturalistic Driving Environment
AI-Based Framework for Understanding Car Following Behaviors of Drivers in A Naturalistic Driving Environment
Armstrong Aboah
Abdul Rashid Mussah
Y. Adu-Gyamfi
22
4
0
23 Jan 2023
FG-Depth: Flow-Guided Unsupervised Monocular Depth Estimation
FG-Depth: Flow-Guided Unsupervised Monocular Depth Estimation
Junyu Zhu
Lina Liu
Yong Liu
Wanlong Li
Feng Wen
Hongbo Zhang
MDE
23
2
0
20 Jan 2023
Multiview Compressive Coding for 3D Reconstruction
Multiview Compressive Coding for 3D Reconstruction
Chaozheng Wu
Justin Johnson
Jitendra Malik
Christoph Feichtenhofer
Georgia Gkioxari
19
71
0
19 Jan 2023
Booster: a Benchmark for Depth from Images of Specular and Transparent
  Surfaces
Booster: a Benchmark for Depth from Images of Specular and Transparent Surfaces
Pierluigi Zama Ramirez
Alex Costanzino
Fabio Tosi
Matteo Poggi
Samuele Salti
S. Mattoccia
Luigi Di Stefano
MDE
19
21
0
19 Jan 2023
SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via
  Swin Transformer and Densely Cascaded Network
SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded Network
D. Shim
H. J. Kim
ViT
MDE
13
20
0
17 Jan 2023
Scene-Aware 3D Multi-Human Motion Capture from a Single Camera
Scene-Aware 3D Multi-Human Motion Capture from a Single Camera
D. Luvizon
Marc Habermann
Vladislav Golyanik
Adam Kortylewski
Christian Theobalt
3DH
HAI
22
18
0
12 Jan 2023
ViTs for SITS: Vision Transformers for Satellite Image Time Series
ViTs for SITS: Vision Transformers for Satellite Image Time Series
Michail Tarasiou
Erik Chavez
S. Zafeiriou
ViT
11
48
0
12 Jan 2023
CARD: Semantic Segmentation with Efficient Class-Aware Regularized
  Decoder
CARD: Semantic Segmentation with Efficient Class-Aware Regularized Decoder
Ye Huang
Di Kang
Liang Chen
W. Jia
Xiangjian He
Lixin Duan
Xuefei Zhe
Linchao Bao
34
2
0
11 Jan 2023
DeMT: Deformable Mixer Transformer for Multi-Task Learning of Dense
  Prediction
DeMT: Deformable Mixer Transformer for Multi-Task Learning of Dense Prediction
Yang Yang
Yibo Yang
L. Zhang
ViT
25
51
0
09 Jan 2023
Deep Planar Parallax for Monocular Depth Estimation
Deep Planar Parallax for Monocular Depth Estimation
H. Liang
Zhichao Li
Y. Yang
Naiyan Wang
MDE
26
0
0
09 Jan 2023
A Study on the Generality of Neural Network Structures for Monocular
  Depth Estimation
A Study on the Generality of Neural Network Structures for Monocular Depth Estimation
Ji-Hoon Bae
K. Hwang
Sunghoon Im
MDE
24
7
0
09 Jan 2023
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
Jia Ning
Chen Li
Zheng-Wei Zhang
Zigang Geng
Qi Dai
Kun He
Han Hu
33
44
0
05 Jan 2023
TiG-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry
  Learning
TiG-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning
Pei-Kai Huang
L. Liu
Renrui Zhang
Song Zhang
Xin Xu
Bai-Qi Wang
G. Liu
3DPC
MDE
34
42
0
28 Dec 2022
Representation Separation for Semantic Segmentation with Vision
  Transformers
Representation Separation for Semantic Segmentation with Vision Transformers
Yuanduo Hong
Huihui Pan
Weichao Sun
Xinghu Yu
Huijun Gao
ViT
28
5
0
28 Dec 2022
Shakes on a Plane: Unsupervised Depth Estimation from Unstabilized
  Photography
Shakes on a Plane: Unsupervised Depth Estimation from Unstabilized Photography
Ilya Chugunov
Yuxuan Zhang
Felix Heide
MDE
33
9
0
22 Dec 2022
GOOD: Exploring Geometric Cues for Detecting Objects in an Open World
GOOD: Exploring Geometric Cues for Detecting Objects in an Open World
Haiwen Huang
Andreas Geiger
Dan Zhang
VLM
ObjD
21
11
0
22 Dec 2022
DuAT: Dual-Aggregation Transformer Network for Medical Image
  Segmentation
DuAT: Dual-Aggregation Transformer Network for Medical Image Segmentation
Feilong Tang
Q. Huang
Jinfeng Wang
Xianxu Hou
Jionglong Su
Jingxin Liu
ViT
MedIm
27
49
0
21 Dec 2022
MaskingDepth: Masked Consistency Regularization for Semi-supervised
  Monocular Depth Estimation
MaskingDepth: Masked Consistency Regularization for Semi-supervised Monocular Depth Estimation
Jongbeom Baek
Gyeongnyeon Kim
Seonghoon Park
Honggyu An
Matteo Poggi
Seung Wook Kim
MDE
29
0
0
21 Dec 2022
DAG: Depth-Aware Guidance with Denoising Diffusion Probabilistic Models
DAG: Depth-Aware Guidance with Denoising Diffusion Probabilistic Models
Gyeongnyeon Kim
Wooseok Jang
Gyuseong Lee
Susung Hong
Junyoung Seo
Seung Wook Kim
VLM
DiffM
32
11
0
17 Dec 2022
NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior
NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior
Wenjing Bian
Zirui Wang
Kejie Li
Jiawang Bian
V. Prisacariu
30
236
0
14 Dec 2022
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with
  Visual Queries
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries
Jinjie Mai
Abdullah Hamdi
Silvio Giancola
Chen Zhao
Bernard Ghanem
EgoV
30
14
0
14 Dec 2022
Towards Practical Plug-and-Play Diffusion Models
Towards Practical Plug-and-Play Diffusion Models
Hyojun Go
Yunsung Lee
Jin-Young Kim
Seunghyun Lee
Myeongho Jeong
Hyun Seung Lee
Seungtaek Choi
DiffM
27
16
0
12 Dec 2022
ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient
  Self-Supervised Monocular Depth Estimation
ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient Self-Supervised Monocular Depth Estimation
Daitao Xing
Jinglin Shen
C. Ho
Anthony Tzes
ViT
MDE
26
4
0
12 Dec 2022
Source-free Depth for Object Pop-out
Source-free Depth for Object Pop-out
Zongwei Wu
D. Paudel
Deng-Ping Fan
Jingjing Wang
Shuo Wang
C. Demonceaux
Radu Timofte
Luc Van Gool
35
44
0
10 Dec 2022
Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular
  Depth Estimation
Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation
L. Talker
Aviad Cohen
E. Yosef
Alexandra Dana
Michael Dinerstein
32
6
0
10 Dec 2022
Monocular Camera and Single-Beam Sonar-Based Underwater Collision-Free
  Navigation with Domain Randomization
Monocular Camera and Single-Beam Sonar-Based Underwater Collision-Free Navigation with Domain Randomization
Pengzhi Yang
Haowen Liu
Monika Roznere
Alberto Quattrini Li
14
9
0
08 Dec 2022
MIME: Human-Aware 3D Scene Generation
MIME: Human-Aware 3D Scene Generation
Hongwei Yi
C. Huang
Shashank Tripathi
Lea Hering
Justus Thies
Michael J. Black
3DH
25
48
0
08 Dec 2022
Surround-view Fisheye BEV-Perception for Valet Parking: Dataset,
  Baseline and Distortion-insensitive Multi-task Framework
Surround-view Fisheye BEV-Perception for Valet Parking: Dataset, Baseline and Distortion-insensitive Multi-task Framework
Zizhang Wu
Yuanzhu Gan
Xianzhi Li
Yunzhe Wu
Xiaoquan Wang
Tianhao Xu
Fan Wang
23
9
0
08 Dec 2022
NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as
  General Image Priors
NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors
Congyue Deng
C. Jiang
C. Qi
Xinchen Yan
Yin Zhou
Leonidas J. Guibas
Drago Anguelov
DiffM
29
161
0
06 Dec 2022
Event-based Monocular Dense Depth Estimation with Recurrent Transformers
Event-based Monocular Dense Depth Estimation with Recurrent Transformers
Xu Liu
Jianing Li
Xiaopeng Fan
Yonghong Tian
ViT
MDE
64
16
0
06 Dec 2022
Objects as Spatio-Temporal 2.5D points
Objects as Spatio-Temporal 2.5D points
Paridhi Singh
Gaurav Singh
Arun C. S. Kumar
3DPC
24
0
0
06 Dec 2022
Location-Aware Self-Supervised Transformers for Semantic Segmentation
Location-Aware Self-Supervised Transformers for Semantic Segmentation
Mathilde Caron
N. Houlsby
Cordelia Schmid
ViT
21
10
0
05 Dec 2022
Self-supervised AutoFlow
Self-supervised AutoFlow
Hsin-Ping Huang
Charles Herrmann
Junhwa Hur
Erika Lu
Kyle Sargent
Austin Stone
Ming Yang
Deqing Sun
29
8
0
04 Dec 2022
Multi-resolution Monocular Depth Map Fusion by Self-supervised
  Gradient-based Composition
Multi-resolution Monocular Depth Map Fusion by Self-supervised Gradient-based Composition
Yaqiao Dai
Renjiao Yi
Chenyang Zhu
Hongjun He
Kai Xu
MDE
14
5
0
03 Dec 2022
BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for
  BEV 3D Object Detection
BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for BEV 3D Object Detection
Jianing Li
Ming Lu
Jiaming Liu
Yandong Guo
Li Du
Shanghang Zhang
40
6
0
01 Dec 2022
NeuralLift-360: Lifting An In-the-wild 2D Photo to A 3D Object with
  360° Views
NeuralLift-360: Lifting An In-the-wild 2D Photo to A 3D Object with 360° Views
Dejia Xu
Yifan Jiang
Peihao Wang
Zhiwen Fan
Yi Wang
Zhangyang Wang
DiffM
35
143
0
29 Nov 2022
Leveraging Image Matching Toward End-to-End Relative Camera Pose
  Regression
Leveraging Image Matching Toward End-to-End Relative Camera Pose Regression
Fadi Khatib
Yuval Margalit
Meirav Galun
Ronen Basri
25
2
0
27 Nov 2022
Lite-Mono: A Lightweight CNN and Transformer Architecture for
  Self-Supervised Monocular Depth Estimation
Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation
Ning Zhang
F. Nex
G. Vosselman
N. Kerle
MDE
36
153
0
23 Nov 2022
Event Transformer+. A multi-purpose solution for efficient event data
  processing
Event Transformer+. A multi-purpose solution for efficient event data processing
Alberto Sabater
Luis Montesano
Ana C. Murillo
ViT
31
8
0
22 Nov 2022
Hybrid Transformer Based Feature Fusion for Self-Supervised Monocular
  Depth Estimation
Hybrid Transformer Based Feature Fusion for Self-Supervised Monocular Depth Estimation
S. Tomar
Maitreya Suin
A. N. Rajagopalan
ViT
MDE
16
4
0
20 Nov 2022
A Practical Stereo Depth System for Smart Glasses
A Practical Stereo Depth System for Smart Glasses
Jialiang Wang
D. Scharstein
Akash Bapat
Kevin Blackburn-Matzen
Matthew Yu
...
Jan-Michael Frahm
Zijian He
Peter Vajda
Michael F. Cohen
M. Uyttendaele
MDE
28
5
0
19 Nov 2022
CroCo v2: Improved Cross-view Completion Pre-training for Stereo
  Matching and Optical Flow
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow
Philippe Weinzaepfel
Thomas Lucas
Vincent Leroy
Yohann Cabon
Vaibhav Arora
Romain Brégier
G. Csurka
L. Antsfeld
Boris Chidlovskii
Jérôme Revaud
ViT
20
81
0
18 Nov 2022
Estimating more camera poses for ego-centric videos is essential for
  VQ3D
Estimating more camera poses for ego-centric videos is essential for VQ3D
Jinjie Mai
Chen Zhao
Abdullah Hamdi
Silvio Giancola
Bernard Ghanem
EgoV
14
4
0
18 Nov 2022
LightDepth: A Resource Efficient Depth Estimation Approach for Dealing
  with Ground Truth Sparsity via Curriculum Learning
LightDepth: A Resource Efficient Depth Estimation Approach for Dealing with Ground Truth Sparsity via Curriculum Learning
Fatemeh Karimi
Amir Mehrpanah
Reza Rawassizadeh
17
1
0
16 Nov 2022
YORO -- Lightweight End to End Visual Grounding
YORO -- Lightweight End to End Visual Grounding
Chih-Hui Ho
Srikar Appalaraju
Bhavan A. Jasani
R. Manmatha
Nuno Vasconcelos
ObjD
21
21
0
15 Nov 2022
3D Scene Inference from Transient Histograms
3D Scene Inference from Transient Histograms
Sacha Jungerman
Atul Ingle
Yin Li
Mohit Gupta
16
6
0
09 Nov 2022
3DFill:Reference-guided Image Inpainting by Self-supervised 3D Image
  Alignment
3DFill:Reference-guided Image Inpainting by Self-supervised 3D Image Alignment
Liang Zhao
Xinyuan Zhao
Hailong Ma
Xinyu Zhang
Long Zeng
33
3
0
09 Nov 2022
Realistic Bokeh Effect Rendering on Mobile GPUs, Mobile AI & AIM 2022
  challenge: Report
Realistic Bokeh Effect Rendering on Mobile GPUs, Mobile AI & AIM 2022 challenge: Report
Andrey D. Ignatov
Radu Timofte
Jin Zhang
Feng Zhang
G. Yu
...
Mingyang Qian
Huixin Ma
Yanan Li
Xiaotao Wang
Lei Lei
15
10
0
07 Nov 2022
SC-DepthV3: Robust Self-supervised Monocular Depth Estimation for
  Dynamic Scenes
SC-DepthV3: Robust Self-supervised Monocular Depth Estimation for Dynamic Scenes
Libo Sun
Jiawang Bian
Huangying Zhan
Wei Yin
Ian Reid
Chunhua Shen
MDE
21
55
0
07 Nov 2022
Previous
123...141516...181920
Next