Vision Transformers for Dense Prediction

IEEE International Conference on Computer Vision (ICCV), 2021

24 March 2021

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (2138★)

Papers citing "Vision Transformers for Dense Prediction"

50 / 1,228 papers shown

4DLangVGGT: 4D Language-Visual Geometry Grounded Transformer

283

04 Dec 2025

COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence

...

251

04 Dec 2025

ReasonX: MLLM-Guided Intrinsic Image Decomposition

117

03 Dec 2025

Unique Lives, Shared World: Learning from Single-Life Videos

...

236

03 Dec 2025

Label-Efficient Hyperspectral Image Classification via Spectral FiLM Modulation of Low-Level Pretrained Diffusion Features

Yuzhen Hu

Biplab Banerjee

Saurabh Prasad

155

03 Dec 2025

MUT3R: Motion-aware Updating Transformer for Dynamic 3D Reconstruction

235

03 Dec 2025

AVGGT: Rethinking Global Attention for Accelerating VGGT

250

02 Dec 2025

FlashVGGT: Efficient and Scalable Visual Geometry Transformers with Compressed Descriptor Attention

Zipeng Wang

Dan Xu

ViT

177

01 Dec 2025

Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model

261

30 Nov 2025

Learning What Helps: Task-Aligned Context Selection for Vision Tasks

120

29 Nov 2025

Shoe Style-Invariant and Ground-Aware Learning for Dense Foot Contact Estimation

Daniel Sungho Jung

Kyoung Mu Lee

130

27 Nov 2025

Controllable 3D Object Generation with Single Image PromptInternational Conference on Pattern Recognition (ICPR), 2025

Jaeseok Lee

Jaekoo Lee

DiffM

146

27 Nov 2025

ColonAdapter: Geometry Estimation Through Foundation Model Adaptation for ColonoscopyIEEE Robotics and Automation Letters (IEEE RA-L), 2025

104

27 Nov 2025

Training-Free Diffusion Priors for Text-to-Image Generation via Optimization-based Visual Inversion

Samuele DellÉrba

Andrew D. Bagdanov

221

25 Nov 2025

AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend

Hengyi Wang

Lourdes Agapito

183

25 Nov 2025

Vision-Language Enhanced Foundation Model for Semi-supervised Medical Image Segmentation

Aggelos K. Katsaggelos

VLM

312

24 Nov 2025

4D-VGGT: A General Foundation Model with SpatioTemporal Awareness for Dynamic Scene Geometry Estimation

154

23 Nov 2025

Frequency-Adaptive Sharpness Regularization for Improving 3D Gaussian Splatting Generalization

Youngsik Yun

Dongjun Gu

Youngjung Uh

203

22 Nov 2025

MuM: Multi-View Masked Image Modeling for 3D Vision

329

21 Nov 2025

NoPe-NeRF++: Local-to-Global Optimization of NeRF with No Pose Prior

163

21 Nov 2025

CylinderDepth: Cylindrical Spatial Attention for Multi-View Consistent Self-Supervised Surround Depth Estimation

175

20 Nov 2025

CuriGS: Curriculum-Guided Gaussian Splatting for Sparse View Synthesis

536

20 Nov 2025

LEGO-SLAM: Language-Embedded Gaussian Optimization SLAM

180

20 Nov 2025

NoPo-Avatar: Generalizable and Animatable Avatars from Sparse Inputs without Human Poses

Jing Wen

Alexander Schwing

Shenlong Wang

153

20 Nov 2025

Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling

Minseok Seo

Mark Hamilton

Changick Kim

289

20 Nov 2025

RoMa v2: Harder Better Faster Denser Feature Matching

607

19 Nov 2025

PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation

134

18 Nov 2025

EGSA-PT:Edge-Guided Spatial Attention with Progressive Training for Monocular Depth Estimation and Segmentation of Transparent Objects

Gbenga Omotara

Ramy M. A. Farag

Seyed Mohamad Ali Tousi

G.N. DeSouza

MDE

330

18 Nov 2025

Geometry Meets Light: Leveraging Geometric Priors for Universal Photometric Stereo under Limited Multi-Illumination Cues

143

17 Nov 2025

Depth Anything 3: Recovering the Visual Space from Any Views

999

162

13 Nov 2025

Navigating the Wild: Pareto-Optimal Visual Decision-Making in Image Space

125

11 Nov 2025

Sparse4DGS: 4D Gaussian Splatting for Sparse-Frame Dynamic Scene Reconstruction

151

10 Nov 2025

FlowFeat: Pixel-Dense Embedding of Motion Profiles

422

10 Nov 2025

MUSE: Multi-Scale Dense Self-Distillation for Nucleus Detection and Classification

...

355

07 Nov 2025

GraspView: Active Perception Scoring and Best-View Optimization for Robotic Grasping in Cluttered Environments

181

06 Nov 2025

Diffusion-Guided Mask-Consistent Paired Mixing for Endoscopic Image Segmentation

279

05 Nov 2025

Densemarks: Learning Canonical Embeddings for Human Heads Images via Point Tracks

297

04 Nov 2025

Terrain-Enhanced Resolution-aware Refinement Attention for Off-Road Segmentation

Seongkyu Choi

Jhonghyun An

136

03 Nov 2025

SPADE: Sparsity Adaptive Depth Estimator for Zero-Shot, Real-Time, Monocular Depth Estimation in Underwater Environments

217

29 Oct 2025

More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models

591

27 Oct 2025

SwiftEmbed: Ultra-Fast Text Embeddings via Static Token Lookup for Real-Time Applications

Edouard Lansiaux

Antoine Simonet

Eric Wiel

194

27 Oct 2025

WaveMAE: Wavelet decomposition Masked Auto-Encoder for Remote Sensing

159

26 Oct 2025

Cross-view Localization and Synthesis -- Datasets, Challenges and Opportunities

N. Xu

R. Qin

DiffM

204

26 Oct 2025

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

...

526

26 Oct 2025

EndoSfM3D: Learning to 3D Reconstruct Any Endoscopic Surgery Scene using Self-supervised Foundation Model

Changhao Zhang

Matthew J. Clarkson

Mobarak I. Hoque

171

25 Oct 2025

Unveiling the Spatial-temporal Effective Receptive Fields of Spiking Neural Networks

219

24 Oct 2025

S3OD: Towards Generalizable Salient Object Detection with Synthetic Data

Orest Kupyn

Hirokatsu Kataoka

Christian Rupprecht

185

24 Oct 2025

Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views

...

235

21 Oct 2025

PLANA3R: Zero-shot Metric Planar 3D Reconstruction via Feed-Forward Planar Splatting

265

21 Oct 2025

Mapping Hidden Heritage: Self-supervised Pre-training on High-Resolution LiDAR DEM Derivatives for Archaeological Stone Wall Detection

196

20 Oct 2025