Vision Transformers for Dense Prediction

IEEE International Conference on Computer Vision (ICCV), 2021

24 March 2021

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (2138★)

Papers citing "Vision Transformers for Dense Prediction"

50 / 1,224 papers shown

Monocular Visual-Inertial Depth EstimationIEEE International Conference on Robotics and Automation (ICRA), 2023

151

21 Mar 2023

Zero-1-to-3: Zero-shot One Image to 3D ObjectIEEE International Conference on Computer Vision (ICCV), 2023

Carl Vondrick

401

1,512

20 Mar 2023

Versatile Depth Estimator Based on Common Relative Depth Estimation and Camera-Specific Relative-to-Metric Depth ConversionJournal of Visual Communication and Image Representation (JVCIR), 2023

190

20 Mar 2023

Boosting Weakly Supervised Object Detection using Fusion and Priors from Hallucinated DepthIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Cagri Gungor

Adriana Kovashka

MDE

281

20 Mar 2023

MECPformer: Multi-estimations Complementary Patch with CNN-Transformers for Weakly Supervised Semantic Segmentation

363

19 Mar 2023

Just Noticeable Visual Redundancy Forecasting: A Deep Multimodal-driven ApproachAAAI Conference on Artificial Intelligence (AAAI), 2023

18 Mar 2023

Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting PredictionIEEE Transactions on Visualization and Computer Graphics (TVCG), 2023

178

18 Mar 2023

Single-view Neural Radiance Fields with Depth Teacher

Yurui Chen

Chun Gu

Feihu Zhang

Li Zhang

168

17 Mar 2023

Exploring Sparse Visual Prompt for Domain Adaptive Dense PredictionAAAI Conference on Artificial Intelligence (AAAI), 2023

Qizhe Zhang

Shanghang Zhang

302

17 Mar 2023

Efficient Computation Sharing for Multi-Task Visual Scene UnderstandingIEEE International Conference on Computer Vision (ICCV), 2023

269

16 Mar 2023

Unifying Top-down and Bottom-up Scanpath Prediction Using TransformersComputer Vision and Pattern Recognition (CVPR), 2023

270

16 Mar 2023

Large Selective Kernel Network for Remote Sensing Object DetectionIEEE International Conference on Computer Vision (ICCV), 2023

Jian Yang

Xiang Li

ObjD

328

465

16 Mar 2023

High-level Feature Guided Decoding for Semantic Segmentation

394

15 Mar 2023

Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D GenerationInternational Conference on Learning Representations (ICLR), 2023

397

155

14 Mar 2023

Adjacent-view Transformers for Supervised Surround-view Depth Estimation

343

14 Mar 2023

Pretrained ViTs Yield Versatile Representations For Medical Images

288

13 Mar 2023

Token Sparsification for Faster Medical Image SegmentationInformation Processing in Medical Imaging (IPMI), 2023

178

11 Mar 2023

3D Cinemagraphy from a Single ImageComputer Vision and Pattern Recognition (CVPR), 2023

Huiqiang Sun

178

10 Mar 2023

Lifelong-MonoDepth: Lifelong Learning for Multi-Domain Monocular Metric Depth EstimationIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023

297

09 Mar 2023

DiffusionDepth: Diffusion Denoising Approach for Monocular Depth EstimationEuropean Conference on Computer Vision (ECCV), 2023

Yiqun Duan

Xianda Guo

Zhengbiao Zhu

DiffM MDE

379

09 Mar 2023

Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

359

771

08 Mar 2023

Weakly Supervised Caveline Detection For AUV Navigation Inside Underwater CavesIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023

212

07 Mar 2023

DwinFormer: Dual Window Transformers for End-to-End Monocular Depth EstimationIEEE Sensors Journal (IEEE Sens. J.), 2023

Md Awsafur Rahman

S. Fattah

ViT MDE

217

06 Mar 2023

Prismer: A Vision-Language Model with Multi-Task Experts

Linxi Fan

325

04 Mar 2023

Unleashing Text-to-Image Diffusion Models for Visual PerceptionIEEE International Conference on Computer Vision (ICCV), 2023

Wenliang Zhao

Jie Zhou

1.0K

301

03 Mar 2023

Monocular Depth Estimation using Diffusion Models

David J. Fleet

254

105

28 Feb 2023

Autonomous Intelligent Navigation for Flexible Endoscopy Using Monocular Depth Guidance and 3-D Shape PlanningIEEE International Conference on Robotics and Automation (ICRA), 2023

Ruofeng Wei

124

26 Feb 2023

Point Cloud Forecasting as a Proxy for 4D Occupancy ForecastingComputer Vision and Pattern Recognition (CVPR), 2023

418

25 Feb 2023

ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth

472

774

23 Feb 2023

Bokeh Rendering Based on Adaptive Depth Calibration NetworkInternational Conference on Communication, Computing & Security (ICCCS), 2023

110

21 Feb 2023

Self-Supervised Monocular Depth Estimation with Self-Reference Distillation and Disparity Offset Refinement

248

20 Feb 2023

Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks

Yunfan Lu

Lin Wang

327

119

17 Feb 2023

URCDC-Depth: Uncertainty Rectified Cross-Distillation with CutFlip for Monocular Depth EstimationIEEE transactions on multimedia (IEEE TMM), 2023

299

16 Feb 2023

VQ3D: Learning a 3D-Aware Generative Model on ImageNetIEEE International Conference on Computer Vision (ICCV), 2023

Charles Herrmann

Jiajun Wu

223

14 Feb 2023

VA-DepthNet: A Variational Approach to Single Image Depth PredictionInternational Conference on Learning Representations (ICLR), 2023

Ce Liu

Suryansh Kumar

Shuhang Gu

Radu Timofte

Luc Van Gool

MDE

303

13 Feb 2023

Semantic Image Segmentation: Two Decades of ResearchFoundations and Trends in Computer Graphics and Vision (FTCGV), 2023

273

13 Feb 2023

Scaling Vision Transformers to 22 Billion ParametersInternational Conference on Machine Learning (ICML), 2023

...

409

779

10 Feb 2023

Invariant Slot Attention: Object Discovery with Slot-Centric Reference FramesInternational Conference on Machine Learning (ICML), 2023

Ondrej Biza

Sjoerd van Steenkiste

Mehdi S. M. Sajjadi

Gamaleldin F. Elsayed

Aravindh Mahendran

Thomas Kipf

OCL

353

09 Feb 2023

Semantic Diffusion Network for Semantic SegmentationNeural Information Processing Systems (NeurIPS), 2023

205

04 Feb 2023

TEXTure: Text-Guided Texturing of 3D ShapesInternational Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 2023

Daniel Cohen-Or

306

334

03 Feb 2023

SceneScape: Text-Driven Consistent Scene GenerationNeural Information Processing Systems (NeurIPS), 2023

430

160

02 Feb 2023

Multi-modal Large Language Model Enhanced Pseudo 3D Perception Framework for Visual Commonsense Reasoning

255

30 Jan 2023

HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera

315

27 Jan 2023

Leveraging the Third Dimension in Contrastive Learning

207

27 Jan 2023

AI-Based Framework for Understanding Car Following Behaviors of Drivers in A Naturalistic Driving Environment

Armstrong Aboah

Abdul Rashid Mussah

Y. Adu-Gyamfi

203

23 Jan 2023

FG-Depth: Flow-Guided Unsupervised Monocular Depth EstimationIEEE International Conference on Robotics and Automation (ICRA), 2023

Yong Liu

144

20 Jan 2023

Multiview Compressive Coding for 3D ReconstructionComputer Vision and Pattern Recognition (CVPR), 2023

Chaozheng Wu

Justin Johnson

Jitendra Malik

Christoph Feichtenhofer

Georgia Gkioxari

286

19 Jan 2023

Booster: a Benchmark for Depth from Images of Specular and Transparent SurfacesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

Pierluigi Zama Ramirez

Alex Costanzino

Fabio Tosi

Matteo Poggi

223

19 Jan 2023

SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded NetworkIEEE International Conference on Robotics and Automation (ICRA), 2023

D. Shim

H. J. Kim

ViT MDE

271

17 Jan 2023

Scene-Aware 3D Multi-Human Motion Capture from a Single Camera

347

12 Jan 2023