ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.13413
  4. Cited By
Vision Transformers for Dense Prediction

Vision Transformers for Dense Prediction

24 March 2021
René Ranftl
Alexey Bochkovskiy
V. Koltun
    ViT
    MDE
ArXivPDFHTML

Papers citing "Vision Transformers for Dense Prediction"

50 / 982 papers shown
Title
MonoIndoor++:Towards Better Practice of Self-Supervised Monocular Depth
  Estimation for Indoor Environments
MonoIndoor++:Towards Better Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments
Runze Li
Pan Ji
Yi Tian Xu
B. Bhanu
MDE
13
22
0
18 Jul 2022
MPIB: An MPI-Based Bokeh Rendering Framework for Realistic Partial
  Occlusion Effects
MPIB: An MPI-Based Bokeh Rendering Framework for Realistic Partial Occlusion Effects
Juewen Peng
Jianming Zhang
Xianrui Luo
Hao Lu
Ke Xian
Zhiguo Cao
13
13
0
18 Jul 2022
Egocentric Scene Understanding via Multimodal Spatial Rectifier
Egocentric Scene Understanding via Multimodal Spatial Rectifier
Tien Do
Khiem Vuong
Hyunjong Park
EgoV
25
4
0
14 Jul 2022
Joint Prediction of Monocular Depth and Structure using Planar and
  Parallax Geometry
Joint Prediction of Monocular Depth and Structure using Planar and Parallax Geometry
Hao Xing
Yifan Cao
Maximilian Biber
Mingchuan Zhou
Darius Burschka
3DPC
3DV
MDE
31
10
0
13 Jul 2022
Vision Transformer for NeRF-Based View Synthesis from a Single Input
  Image
Vision Transformer for NeRF-Based View Synthesis from a Single Input Image
Kai-En Lin
Yen-Chen Lin
Wei-Sheng Lai
Tsung-Yi Lin
Yichang Shih
R. Ramamoorthi
ViT
17
111
0
12 Jul 2022
Tracking Objects as Pixel-wise Distributions
Tracking Objects as Pixel-wise Distributions
Zelin Zhao
Ze Wu
Yueqing Zhuang
Boxun Li
Jiaya Jia
VOT
26
54
0
12 Jul 2022
Depthformer : Multiscale Vision Transformer For Monocular Depth
  Estimation With Local Global Information Fusion
Depthformer : Multiscale Vision Transformer For Monocular Depth Estimation With Local Global Information Fusion
Ashutosh Agarwal
Chetan Arora
MDE
ViT
71
38
0
10 Jul 2022
Self-attention on Multi-Shifted Windows for Scene Segmentation
Self-attention on Multi-Shifted Windows for Scene Segmentation
Litao Yu
Zhibin Li
Jian Andrew Zhang
Qiang Wu
SSeg
19
1
0
10 Jul 2022
Transformer based Models for Unsupervised Anomaly Segmentation in Brain
  MR Images
Transformer based Models for Unsupervised Anomaly Segmentation in Brain MR Images
Ahmed Ghorbel
Ahmed Aldahdooh
Shadi Albarqouni
Neuherberg
ViT
MedIm
22
4
0
05 Jul 2022
Interaction Transformer for Human Reaction Generation
Interaction Transformer for Human Reaction Generation
Baptiste Chopin
Hao Tang
N. Otberdout
Mohamed Daoudi
N. Sebe
ViT
25
27
0
04 Jul 2022
BokehMe: When Neural Rendering Meets Classical Rendering
BokehMe: When Neural Rendering Meets Classical Rendering
Juewen Peng
Zhiguo Cao
Xianrui Luo
Hao Lu
Ke Xian
Jianming Zhang
16
44
0
25 Jun 2022
Not Just Streaks: Towards Ground Truth for Single Image Deraining
Not Just Streaks: Towards Ground Truth for Single Image Deraining
Yunhao Ba
Howard Zhang
Ethan Yang
Akira Suzuki
Arnold Pfahnl
...
C. Melo
Suya You
Stefano Soatto
A. Wong
A. Kadambi
19
39
0
22 Jun 2022
Vicinity Vision Transformer
Vicinity Vision Transformer
Weixuan Sun
Zhen Qin
Huiyuan Deng
Jianyuan Wang
Yi Zhang
Kaihao Zhang
Nick Barnes
Stan Birchfield
Lingpeng Kong
Yiran Zhong
ViT
34
31
0
21 Jun 2022
IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering
  in Indoor Scenes
IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering in Indoor Scenes
Rui Zhu
Zhengqin Li
J. Matai
Fatih Porikli
Manmohan Chandraker
ViT
38
45
0
16 Jun 2022
K-Radar: 4D Radar Object Detection for Autonomous Driving in Various
  Weather Conditions
K-Radar: 4D Radar Object Detection for Autonomous Driving in Various Weather Conditions
Dong-Hee Paek
Seung-Hyun Kong
Kevin Tirta Wijaya
27
107
0
16 Jun 2022
SAVi++: Towards End-to-End Object-Centric Learning from Real-World
  Videos
SAVi++: Towards End-to-End Object-Centric Learning from Real-World Videos
Gamaleldin F. Elsayed
Aravindh Mahendran
Sjoerd van Steenkiste
Klaus Greff
Michael C. Mozer
Thomas Kipf
VOS
OCL
47
137
0
15 Jun 2022
Open Challenges in Deep Stereo: the Booster Dataset
Open Challenges in Deep Stereo: the Booster Dataset
Pierluigi Zama Ramirez
Fabio Tosi
Matteo Poggi
Samuele Salti
S. Mattoccia
Luigi Di Stefano
3DV
MDE
11
28
0
09 Jun 2022
SparseFormer: Attention-based Depth Completion Network
SparseFormer: Attention-based Depth Completion Network
Frederik Warburg
Michael Ramamonjisoa
Manuel López-Antequera
MoE
MDE
21
4
0
09 Jun 2022
Dyna-DM: Dynamic Object-aware Self-supervised Monocular Depth Maps
Dyna-DM: Dynamic Object-aware Self-supervised Monocular Depth Maps
Kieran Saunders
George Vogiatzis
Luis J. Manso
MDE
24
5
0
08 Jun 2022
Layered Depth Refinement with Mask Guidance
Layered Depth Refinement with Mask Guidance
S. Kim
Jianming Zhang
Simon Niklaus
Yifei Fan
Simon Chen
Zhe-nan Lin
Munchurl Kim
29
8
0
07 Jun 2022
MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface
  Reconstruction
MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface Reconstruction
Zehao Yu
Songyou Peng
Michael Niemeyer
Torsten Sattler
Andreas Geiger
13
447
0
01 Jun 2022
A Survey on Deep Learning for Skin Lesion Segmentation
A Survey on Deep Learning for Skin Lesion Segmentation
Z. Mirikharaji
Kumar Abhishek
Alceu Bissoto
Catarina Barata
Sandra Avila
Eduardo Valle
M. Celebi
Ghassan Hamarneh
31
82
0
01 Jun 2022
ViT-BEVSeg: A Hierarchical Transformer Network for Monocular
  Birds-Eye-View Segmentation
ViT-BEVSeg: A Hierarchical Transformer Network for Monocular Birds-Eye-View Segmentation
Pramit Dutta
Ganesh Sistu
S. Yogamani
E. López
J. McDonald
ViT
8
16
0
31 May 2022
Decomposing NeRF for Editing via Feature Field Distillation
Decomposing NeRF for Editing via Feature Field Distillation
Sosuke Kobayashi
Eiichi Matsumoto
Vincent Sitzmann
175
328
0
31 May 2022
Self-Supervised Pre-training of Vision Transformers for Dense Prediction
  Tasks
Self-Supervised Pre-training of Vision Transformers for Dense Prediction Tasks
Jaonary Rabarisoa
Velentin Belissen
Florian Chabot
Q. C. Pham
VLM
ViT
SSL
MDE
13
2
0
30 May 2022
Multi-Task Learning with Multi-Query Transformer for Dense Prediction
Multi-Task Learning with Multi-Query Transformer for Dense Prediction
Yangyang Xu
Xiangtai Li
Haobo Yuan
Yibo Yang
Lefei Zhang
ViT
23
45
0
28 May 2022
WT-MVSNet: Window-based Transformers for Multi-view Stereo
WT-MVSNet: Window-based Transformers for Multi-view Stereo
Jinli Liao
Yikang Ding
Yoli Shavit
Dihe Huang
Shihao Ren
Jia Guo
Wensen Feng
Kai Zhang
ViT
9
28
0
28 May 2022
Single-View View Synthesis in the Wild with Learned Adaptive Multiplane
  Images
Single-View View Synthesis in the Wild with Learned Adaptive Multiplane Images
Yuxuan Han
Ruicheng Wang
Jiaolong Yang
3DV
178
65
0
24 May 2022
Deep Digging into the Generalization of Self-Supervised Monocular Depth
  Estimation
Deep Digging into the Generalization of Self-Supervised Monocular Depth Estimation
Ji-Hoon Bae
Sungho Moon
Sunghoon Im
MDE
20
84
0
23 May 2022
Deep Learning for Omnidirectional Vision: A Survey and New Perspectives
Deep Learning for Omnidirectional Vision: A Survey and New Perspectives
Hao Ai
Zidong Cao
Jin Zhu
Haotian Bai
Yucheng Chen
Ling Wang
35
35
0
21 May 2022
Physically-Based Editing of Indoor Scene Lighting from a Single Image
Physically-Based Editing of Indoor Scene Lighting from a Single Image
Zhengqin Li
Jia Shi
Sai Bi
Rui Zhu
Kalyan Sunkavalli
Milovs Havsan
Zexiang Xu
R. Ramamoorthi
Manmohan Chandraker
3DV
26
57
0
19 May 2022
BodyMap: Learning Full-Body Dense Correspondence Map
BodyMap: Learning Full-Body Dense Correspondence Map
A. Ianina
N. Sarafianos
Yuanlu Xu
Ignacio Rocco
Tony Tung
3DH
22
14
0
18 May 2022
Visual Attention-based Self-supervised Absolute Depth Estimation using
  Geometric Priors in Autonomous Driving
Visual Attention-based Self-supervised Absolute Depth Estimation using Geometric Priors in Autonomous Driving
Jie Xiang
Yun Wang
Lifeng An
Haiyang Liu
Zijun Wang
Jian Liu
MDE
13
16
0
18 May 2022
Vision Transformer Adapter for Dense Predictions
Vision Transformer Adapter for Dense Predictions
Zhe Chen
Yuchen Duan
Wenhai Wang
Junjun He
Tong Lu
Jifeng Dai
Yu Qiao
43
541
0
17 May 2022
ColonFormer: An Efficient Transformer based Method for Colon Polyp
  Segmentation
ColonFormer: An Efficient Transformer based Method for Colon Polyp Segmentation
N. Duc
Nguyen Thi Oanh
N. T. Thuy
Trần Minh Triết
V. Dinh
ViT
MedIm
133
117
0
17 May 2022
Transformer Scale Gate for Semantic Segmentation
Transformer Scale Gate for Semantic Segmentation
Hengcan Shi
Munawar Hayat
Jianfei Cai
ViT
32
22
0
14 May 2022
3D Moments from Near-Duplicate Photos
3D Moments from Near-Duplicate Photos
Qianqian Wang
Zhengqi Li
D. Salesin
Noah Snavely
Brian L. Curless
Janne Kontkanen
3DH
DiffM
VGen
35
19
0
12 May 2022
GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense
  Mapping
GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping
Pan Ji
Qingan Yan
Yuxin Ma
Yi Tian Xu
MDE
26
11
0
03 May 2022
SideRT: A Real-time Pure Transformer Architecture for Single Image Depth
  Estimation
SideRT: A Real-time Pure Transformer Architecture for Single Image Depth Estimation
Chang Shu
Zi-Chun Chen
Lei Chen
Kuan Ma
Minghui Wang
Haibing Ren
ViT
16
14
0
29 Apr 2022
Depth Estimation with Simplified Transformer
Depth Estimation with Simplified Transformer
John Yang
Le An
Anurag Dixit
Jinkyu Koo
Su Inn Park
MDE
28
21
0
28 Apr 2022
Stochastic Coherence Over Attention Trajectory For Continuous Learning
  In Video Streams
Stochastic Coherence Over Attention Trajectory For Continuous Learning In Video Streams
Matteo Tiezzi
Simone Marullo
Lapo Faggi
Enrico Meloni
Alessandro Betti
S. Melacci
22
6
0
26 Apr 2022
Revealing Occlusions with 4D Neural Fields
Revealing Occlusions with 4D Neural Fields
Basile Van Hoorick
Purva Tendulkar
Dídac Surís
Dennis Park
Simon Stent
Carl Vondrick
22
16
0
22 Apr 2022
Learning Dynamic View Synthesis With Few RGBD Cameras
Shengze Wang
Y. Kwon
Yuan-Chung Shen
Q. Zhang
A. State
Jia-Bin Huang
Henry Fuchs
21
2
0
22 Apr 2022
CALI: Coarse-to-Fine ALIgnments Based Unsupervised Domain Adaptation of
  Traversability Prediction for Deployable Autonomous Navigation
CALI: Coarse-to-Fine ALIgnments Based Unsupervised Domain Adaptation of Traversability Prediction for Deployable Autonomous Navigation
Zheng Chen
Durgakant Pushp
Lantao Liu
11
6
0
20 Apr 2022
An Energy-Based Prior for Generative Saliency
An Energy-Based Prior for Generative Saliency
Jing Zhang
Jianwen Xie
Nick Barnes
Ping Li
37
3
0
19 Apr 2022
Multi-Frame Self-Supervised Depth with Transformers
Multi-Frame Self-Supervised Depth with Transformers
Vitor Campagnolo Guizilini
Rares Ambrus
Di Chen
Sergey Zakharov
Adrien Gaidon
ViT
MDE
15
84
0
15 Apr 2022
End-to-end Learning for Joint Depth and Image Reconstruction from
  Diffracted Rotation
End-to-end Learning for Joint Depth and Image Reconstruction from Diffracted Rotation
Mazen Mel
M. Siddiqui
Pietro Zanuttigh
MDE
8
6
0
14 Apr 2022
Pyramidal Attention for Saliency Detection
Pyramidal Attention for Saliency Detection
Tanveer Hussain
A. Anwar
Saeed Anwar
L. Petersson
Sungyong Baik
29
18
0
14 Apr 2022
SwinNet: Swin Transformer drives edge-aware RGB-D and RGB-T salient
  object detection
SwinNet: Swin Transformer drives edge-aware RGB-D and RGB-T salient object detection
Zhengyi Liu
Yacheng Tan
Qian He
Yun Xiao
ViT
25
225
0
12 Apr 2022
Stripformer: Strip Transformer for Fast Image Deblurring
Stripformer: Strip Transformer for Fast Image Deblurring
Fu-Jen Tsai
Yan-Tsung Peng
Yen-Yu Lin
Chung-Chi Tsai
Chia-Wen Lin
ViT
11
170
0
10 Apr 2022
Previous
123...1617181920
Next