Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2103.13413
Cited By

Vision Transformers for Dense Prediction

Vision Transformers for Dense Prediction

IEEE International Conference on Computer Vision (ICCV), 2021

24 March 2021

Alexey Bochkovskiy

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (2138★)

Papers citing "Vision Transformers for Dense Prediction"

50 / 1,223 papers shown

Elucidating the Role of Feature Normalization in IJEPA

Elucidating the Role of Feature Normalization in IJEPA

103

0

0

04 Aug 2025

Qwen-Image Technical Report

Qwen-Image Technical Report

...

340

239

0

04 Aug 2025

No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views

No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views

Krystian Mikolajczyk

352

9

0

02 Aug 2025

CoProU-VO: Combining Projected Uncertainty for End-to-End Unsupervised Monocular Visual Odometry

CoProU-VO: Combining Projected Uncertainty for End-to-End Unsupervised Monocular Visual Odometry

Oussema Dhaouadi

120

0

0

01 Aug 2025

Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization

Gaussian Splatting Feature Fields for Privacy-Preserving Visual LocalizationComputer Vision and Pattern Recognition (CVPR), 2025

Maxime Pietrantoni

Torsten Sattler

265

1

0

31 Jul 2025

MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion

MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion

Tarasha Khurana

127

4

0

31 Jul 2025

Unleashing the Power of Motion and Depth: A Selective Fusion Strategy for RGB-D Video Salient Object Detection

Unleashing the Power of Motion and Depth: A Selective Fusion Strategy for RGB-D Video Salient Object Detection

149

0

0

29 Jul 2025

PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction

PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction

125

1

0

29 Jul 2025

Ov3R: Open-Vocabulary Semantic 3D Reconstruction from RGB Videos

Ov3R: Open-Vocabulary Semantic 3D Reconstruction from RGB Videos

532

1

0

29 Jul 2025

SAMwave: Wavelet-Driven Feature Enrichment for Effective Adaptation of Segment Anything Model

SAMwave: Wavelet-Driven Feature Enrichment for Effective Adaptation of Segment Anything Model

Koteswar Rao Jerripothula

165

0

0

27 Jul 2025

UniCT Depth: Event-Image Fusion Based Monocular Depth Estimation with Convolution-Compensated ViT Dual SA Block

UniCT Depth: Event-Image Fusion Based Monocular Depth Estimation with Convolution-Compensated ViT Dual SA BlockInternational Joint Conference on Artificial Intelligence (IJCAI), 2025

252

0

0

26 Jul 2025

DepthFlow: Exploiting Depth-Flow Structural Correlations for Unsupervised Video Object Segmentation

DepthFlow: Exploiting Depth-Flow Structural Correlations for Unsupervised Video Object Segmentation

152

0

0

26 Jul 2025

Event-Based De-Snowing for Autonomous Driving

Event-Based De-Snowing for Autonomous Driving

Manasi Muglikar

Nico Messikommer

Davide Scaramuzza

96

1

0

25 Jul 2025

LONG3R: Long Sequence Streaming 3D Reconstruction

LONG3R: Long Sequence Streaming 3D Reconstruction

166

14

0

24 Jul 2025

DepthDark: Robust Monocular Depth Estimation for Low-Light Environments

DepthDark: Robust Monocular Depth Estimation for Low-Light Environments

236

3

0

24 Jul 2025

Dens3R: A Foundation Model for 3D Geometry Prediction

Dens3R: A Foundation Model for 3D Geometry Prediction

...

271

8

0

22 Jul 2025

Sparse-View 3D Reconstruction: Recent Advances and Open Challenges

Sparse-View 3D Reconstruction: Recent Advances and Open Challenges

201

1

0

22 Jul 2025

A Practical Investigation of Spatially-Controlled Image Generation with Transformers

A Practical Investigation of Spatially-Controlled Image Generation with Transformers

Harleen Hanspal

Petru-Daniel Tudosiu

210

0

0

21 Jul 2025

DAViD: Data-efficient and Accurate Vision Models from Synthetic Data

DAViD: Data-efficient and Accurate Vision Models from Synthetic Data

Antonio Criminisi

T. Baltrušaitis

157

1

0

21 Jul 2025

An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks

An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks

239

0

0

20 Jul 2025

Region-aware Depth Scale Adaptation with Sparse Measurements

Region-aware Depth Scale Adaptation with Sparse Measurements

201

0

0

20 Jul 2025

Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey

Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey

...

Hanspeter Pfister

Fangneng Zhan

641

8

0

19 Jul 2025

PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations

PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations

199

1

0

18 Jul 2025

Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation

Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation

...

181

3

0

15 Jul 2025

Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion

Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion

Aleksandar Jevtić

Christoph Reich

Christian Rupprecht

312

2

0

08 Jul 2025

Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory

Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory

137

26

0

03 Jul 2025

SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment

SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment

272

3

0

03 Jul 2025

DepthART: Monocular Depth Estimation as Autoregressive Refinement Task

DepthART: Monocular Depth Estimation as Autoregressive Refinement Task

Bulat Gabdullin

Nina Konovalova

Nikolay Patakin

Dmitry Senushkin

376

2

0

01 Jul 2025

WAFT: Warping-Alone Field Transforms for Optical Flow

WAFT: Warping-Alone Field Transforms for Optical Flow

233

2

0

26 Jun 2025

StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation

StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation

Kostas Daniilidis

247

3

0

25 Jun 2025

Light of Normals: Unified Feature Representation for Universal Photometric Stereo

Light of Normals: Unified Feature Representation for Universal Photometric Stereo

...

Satoshi Ikehata

329

5

0

23 Jun 2025

Pixel-Optimization-Free Patch Attack on Stereo Depth Estimation

Pixel-Optimization-Free Patch Attack on Stereo Depth Estimation

218

0

0

21 Jun 2025

RaCalNet: Radar Calibration Network for Sparse-Supervised Metric Depth Estimation

RaCalNet: Radar Calibration Network for Sparse-Supervised Metric Depth Estimation

308

1

0

18 Jun 2025

RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories

RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories

Qiang-qiang Wang

258

1

0

18 Jun 2025

DepthSeg: Depth prompting in remote sensing semantic segmentation

DepthSeg: Depth prompting in remote sensing semantic segmentation

148

0

0

17 Jun 2025

Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry

Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry

...

Takashi Shibuya

224

3

0

16 Jun 2025

GS-2DGS: Geometrically Supervised 2DGS for Reflective Object Reconstruction

GS-2DGS: Geometrically Supervised 2DGS for Reflective Object ReconstructionComputer Vision and Pattern Recognition (CVPR), 2025

Chuong H. Nguyen

216

8

0

16 Jun 2025

TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast

TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast

209

0

0

16 Jun 2025

ViLLa: A Neuro-Symbolic approach for Animal Monitoring

ViLLa: A Neuro-Symbolic approach for Animal Monitoring

102

0

0

12 Jun 2025

DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos

DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos

Thu Nguyen-Phuoc

...

Ming-Hsuan Yang

Richard Newcombe

281

2

0

11 Jun 2025

ScaleLSD: Scalable Deep Line Segment Detection StreamlinedComputer Vision and Pattern Recognition (CVPR), 2025

189

1

0

11 Jun 2025

3DGeoDet: General-purpose Geometry-aware Image-based 3D Object DetectionIEEE transactions on multimedia (TMM), 2025

323

2

0

11 Jun 2025

UniForward: Unified 3D Scene and Semantic Field Reconstruction via Feed-Forward Gaussian Splatting from Only Sparse-View Images

230

4

0

11 Jun 2025

UFM: A Simple Path towards Unified Dense Correspondence with Flow

Nikhil Varma Keetha

...

Sebastian A. Scherer

188

0

0

10 Jun 2025

JAFAR: Jack up Any Feature at Any Resolution

JAFAR: Jack up Any Feature at Any Resolution

Jean-Emmanuel Haugeard

498

6

0

10 Jun 2025

GoTrack: Generic 6DoF Object Pose Refinement and Tracking

GoTrack: Generic 6DoF Object Pose Refinement and Tracking

Van Nguyen Nguyen

Christian Forster

Sindi Shkodrani

Vincent Lepetit

196

1

0

08 Jun 2025

THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation

THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation

143

0

0

07 Jun 2025

Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration

Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration

166

6

0

06 Jun 2025

NTIRE 2025 Challenge on HR Depth from Images of Specular and Transparent Surfaces

NTIRE 2025 Challenge on HR Depth from Images of Specular and Transparent Surfaces

Pierluigi Zama Ramirez

Luigi Di Stefano

Alex Costanzino

...

247

25

0

06 Jun 2025

Deep Learning Reforms Image Matching: A Survey and Outlook

336

2

0

05 Jun 2025

1 2 3 4 5...23 24 25