Vision Transformers for Dense Prediction

IEEE International Conference on Computer Vision (ICCV), 2021

24 March 2021

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (2138★)

Papers citing "Vision Transformers for Dense Prediction"

50 / 1,224 papers shown

GigaSLAM: Large-Scale Monocular SLAM with Hierarchical Gaussian Splats

344

11 Mar 2025

1LoRA: Summation Compression for Very Low-Rank Adaptation

222

11 Mar 2025

LBM: Latent Bridge Matching for Fast Image-to-Image Translation

456

10 Mar 2025

Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity

Matthew Johnson-Roberson

Xiaonan Huang

3DV

280

08 Mar 2025

EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images

Rohit Menon

Nils Dengler

Sicong Pan

Gokul Krishna Chenchani

Maren Bennewitz

EDL

470

06 Mar 2025

Underlying Semantic Diffusion for Effective and Efficient In-Context Learning

317

06 Mar 2025

S2Gaussian: Sparse-View Super-Resolution 3D Gaussian SplattingComputer Vision and Pattern Recognition (CVPR), 2025

441

06 Mar 2025

Is Pre-training Applicable to the Decoder for Dense Prediction?

473

05 Mar 2025

COARSE: Collaborative Pseudo-Labeling with Coarse Real Labels for Off-Road Semantic Segmentation

348

05 Mar 2025

SVDC: Consistent Direct Time-of-Flight Video Depth Completion with Frequency Selective FusionComputer Vision and Pattern Recognition (CVPR), 2025

407

03 Mar 2025

MUSt3R: Multi-view Network for Stereo 3D ReconstructionComputer Vision and Pattern Recognition (CVPR), 2025

290

03 Mar 2025

Blind Augmentation: Calibration-free Camera Distortion Model Estimation for Real-time Mixed-reality ConsistencyIEEE Transactions on Visualization and Computer Graphics (TVCG), 2025

200

03 Mar 2025

Bridging Spectral-wise and Multi-spectral Depth Estimation via Geometry-guided Contrastive LearningIEEE International Conference on Robotics and Automation (ICRA), 2025

365

02 Mar 2025

Bring Your Own Grasp Generator: Leveraging Robot Grasp Generation for Prosthetic GraspingIEEE International Conference on Robotics and Automation (ICRA), 2025

Giuseppe Stracquadanio

272

01 Mar 2025

Back to the Future Cyclopean Stereo: a human perception approach combining deep and geometric constraints

Sherlon Almeida da Silva

Davi Geiger

Luiz Velho

Moacir Antonelli Ponti

290

28 Feb 2025

TrackGS: Optimizing COLMAP-Free 3D Gaussian Splatting with Global Track Constraints

328

27 Feb 2025

UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler

544

27 Feb 2025

LAM: Large Avatar Model for One-shot Animatable Gaussian Head

556

25 Feb 2025

FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks

Tanawan Premsri

Parisa Kordjamshidi

371

25 Feb 2025

Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain ModelInternational Conference on Learning Representations (ICLR), 2025

562

24 Feb 2025

Challenges of Multi-Modal Coreset Selection for Depth Prediction

Viktor Moskvoretskii

Narek Alvandian

201

20 Feb 2025

L4P: Towards Unified Low-Level 4D Vision Perception

468

18 Feb 2025

FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse ViewsComputer Vision and Pattern Recognition (CVPR), 2025

607

17 Feb 2025

NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing

Shutong Zhang

357

15 Feb 2025

CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape RecoveryIEEE International Conference on Robotics and Automation (ICRA), 2025

478

13 Feb 2025

Matrix3D: Large Photogrammetry Model All-in-OneComputer Vision and Pattern Recognition (CVPR), 2025

689

11 Feb 2025

Semantic to Structure: Learning Structural Representations for Infringement DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

233

11 Feb 2025

Revisiting Gradient-based Uncertainty for Monocular Depth EstimationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025

Julia Hornauer

Amir El-Ghoussani

Vasileios Belagiannis

UQCV

285

09 Feb 2025

Edge Attention Module for Object Classification

Santanu Roy

Ashvath Suresh

Archit Gupta

235

05 Feb 2025

Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic EncodingTowards Autonomous Robotic Systems (TAROS), 2025

313

01 Feb 2025

MonoDINO-DETR: Depth-Enhanced Monocular 3D Object Detection Using a Vision Foundation Model

412

01 Feb 2025

CheapNVS: Real-Time On-Device Narrow-Baseline Novel View SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

320

24 Jan 2025

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward PassComputer Vision and Pattern Recognition (CVPR), 2025

769

169

23 Jan 2025

Enhancing Monocular Depth Estimation with Multi-Source Auxiliary TasksIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025

302

22 Jan 2025

Continuous 3D Perception Model with Persistent StateComputer Vision and Pattern Recognition (CVPR), 2025

355

236

21 Jan 2025

Survey on Monocular Metric Depth Estimation

Jiuling Zhang

VLM

726

21 Jan 2025

Video Depth Anything: Consistent Depth Estimation for Super-Long VideosComputer Vision and Pattern Recognition (CVPR), 2025

619

114

21 Jan 2025

See In Detail: Enhancing Sparse-view 3D Gaussian Splatting with Local Depth and Semantic RegularizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

341

20 Jan 2025

FutureDepth: Learning to Predict the Future Improves Video Depth EstimationEuropean Conference on Computer Vision (ECCV), 2024

516

17 Jan 2025

MAMo: Leveraging Memory and Attention for Monocular Video Depth EstimationIEEE International Conference on Computer Vision (ICCV), 2023

600

17 Jan 2025

MonSter++: Unified Stereo Matching, Multi-view Stereo, and Real-time Stereo with Monodepth Priors

...

387

15 Jan 2025

Advancing Semantic Future Prediction through Multimodal Visual Sequence TransformersComputer Vision and Pattern Recognition (CVPR), 2025

392

14 Jan 2025

Kolmogorov-Arnold Network for Remote Sensing Image Semantic Segmentation

242

13 Jan 2025

OneLLM: One Framework to Align All Modalities with LanguageComputer Vision and Pattern Recognition (CVPR), 2023

577

198

10 Jan 2025

Powerful Design of Small Vision Transformer on CIFAR10

Gent Wu

ViT

269

07 Jan 2025

A Novel Vision Transformer for Camera-LiDAR Fusion based Traffic Object SegmentationInternational Conference on Agents and Artificial Intelligence (ICAART), 2025

195

06 Jan 2025

PatchRefiner V2: Fast and Lightweight Real-Domain High-Resolution Metric Depth Estimation

350

03 Jan 2025

TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions

331

02 Jan 2025

MetricDepth: Enhancing Monocular Depth Estimation with Deep Metric Learning

343

31 Dec 2024

TiGDistill-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning Distillation

31 Dec 2024