Vision Transformers for Dense Prediction

IEEE International Conference on Computer Vision (ICCV), 2021

24 March 2021

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (2138★)

Papers citing "Vision Transformers for Dense Prediction"

50 / 1,223 papers shown

StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes

210

19 Sep 2025

DACoN: DINO for Anime Paint Bucket Colorization with Any Number of Reference Images

Kazuma Nagata

Naoshi Kaneko

DiffM

221

18 Sep 2025

Depth AnyEvent: A Cross-Modal Distillation Paradigm for Event-Based Monocular Depth Estimation

174

18 Sep 2025

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

...

290

16 Sep 2025

Towards Foundational Models for Single-Chip Radar

191

15 Sep 2025

UnLoc: Leveraging Depth Uncertainties for Floorplan Localization

139

14 Sep 2025

AD-GS: Alternating Densification for Sparse-Input 3D Gaussian Splatting

157

13 Sep 2025

LayerLock: Non-collapsing Representation Learning with Progressive Freezing

167

12 Sep 2025

Loc$^2$: Interpretable Cross-View Localization via Depth-Lifted Local Feature Matching

Loc

^2

: Interpretable Cross-View Localization via Depth-Lifted Local Feature Matching

254

11 Sep 2025

PanoLAM: Large Avatar Model for Gaussian Full-Head Synthesis from One-shot Unposed Image

267

09 Sep 2025

Faster VGGT with Block-Sparse Global Attention

Chung-Shien Brian Wang

116

08 Sep 2025

JRN-Geo: A Joint Perception Network based on RGB and Normal images for Cross-view Geo-localizationIEEE International Conference on Robotics and Automation (ICRA), 2025

127

06 Sep 2025

FlowSeek: Optical Flow Made Easier with Depth Foundation Models and Motion Bases

Matteo Poggi

Fabio Tosi

106

05 Sep 2025

From Editor to Dense Geometry Estimator

212

04 Sep 2025

DUViN: Diffusion-Based Underwater Visual Navigation via Knowledge-Transferred Depth Features

130

03 Sep 2025

Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots

...

130

02 Sep 2025

RiverScope: High-Resolution River Masking Dataset

...

128

02 Sep 2025

ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association

122

01 Sep 2025

ER-LoRA: Effective-Rank Guided Adaptation for Weather-Generalized Depth Estimation

258

31 Aug 2025

SegDINO: An Efficient Design for Medical and Natural Image Segmentation with DINO-V3

242

31 Aug 2025

FastAvatar: Towards Unified Fast High-Fidelity 3D Avatar Reconstruction with Large Gaussian Reconstruction Transformers

27 Aug 2025

SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization

100

25 Aug 2025

HAMSt3R: Human-Aware Multi-view Stereo 3D Reconstruction

22 Aug 2025

SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse WeatherEuropean Conference on Computer Vision (ECCV), 2025

192

22 Aug 2025

Representation Learning with Adaptive Superpixel Coding

128

21 Aug 2025

SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass

391

21 Aug 2025

Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds

162

20 Aug 2025

GasTwinFormer: A Hybrid Vision Transformer for Livestock Methane Emission Segmentation and Dietary Classification in Optical Gas Imaging

20 Aug 2025

ROVR-Open-Dataset: A Large-Scale Depth Dataset for Autonomous Driving

...

247

19 Aug 2025

PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis

234

19 Aug 2025

Local Scale Equivariance with Latent Deep Equilibrium Canonicalizer

154

19 Aug 2025

Online 3D Gaussian Splatting Modeling with Novel View SelectionInternational Joint Conference on Artificial Intelligence (IJCAI), 2025

211

19 Aug 2025

STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer

178

14 Aug 2025

Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment

199

12 Aug 2025

TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation

145

11 Aug 2025

Mem4D: Decoupling Static and Dynamic Memory for Dynamic Scene Reconstruction

200

11 Aug 2025

Learning an Implicit Physics Model for Image-based Fluid Simulation

11 Aug 2025

Matrix-3D: Omnidirectional Explorable 3D World Generation

...

130

11 Aug 2025

VesselRW: Weakly Supervised Subcutaneous Vessel Segmentation via Learned Random Walk Propagation

Ayaan Nooruddin Siddiqui

Mahnoor Zaidi

Ayesha Nazneen Shahbaz

Priyadarshini Chatterjee

Krishnan Menon Iyer

257

09 Aug 2025

Edge Detection for Organ Boundaries via Top Down Refinement and SubPixel Upsampling

278

09 Aug 2025

DualResolution Residual Architecture with Artifact Suppression for Melanocytic Lesion Segmentation

Vikram Singh

Kabir Malhotra

Rohan Desai

Ananya Shankaracharya

Priyadarshini Chatterjee

Krishnan Menon Iyer

MedIm

289

09 Aug 2025

CF3: Compact and Fast 3D Feature Fields

209

07 Aug 2025

EndoMatcher: Generalizable Endoscopic Image Matcher via Multi-Domain Pre-training for Robot-Assisted Surgery

07 Aug 2025

AR as an Evaluation Playground: Bridging Metrics and Visual Perception of Computer Vision Models

Ashkan Ganj

Yiqin Zhao

Tian Guo

103

06 Aug 2025

DET-GS: Depth- and Edge-Aware Regularization for High-Fidelity 3D Gaussian Splatting

172

06 Aug 2025

Extending Foundational Monocular Depth Estimators to Fisheye Cameras with Calibration Tokens

Suchisrit Gangopadhyay

318

06 Aug 2025

Monocular Depth Estimation with Global-Aware Discretization and Local Context Modeling

174

05 Aug 2025

Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images

...

314

05 Aug 2025

Deeply Dual Supervised learning for melanoma recognition

Rujosh Polma

Krishnan Menon Iyer

227

04 Aug 2025

Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images

156

04 Aug 2025