ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.13413
  4. Cited By
Vision Transformers for Dense Prediction

Vision Transformers for Dense Prediction

IEEE International Conference on Computer Vision (ICCV), 2021
24 March 2021
René Ranftl
Alexey Bochkovskiy
V. Koltun
    ViTMDE
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (2138★)

Papers citing "Vision Transformers for Dense Prediction"

50 / 1,224 papers shown
Perception Encoder: The best visual embeddings are not at the output of the network
Perception Encoder: The best visual embeddings are not at the output of the network
Daniel Bolya
Po-Yao (Bernie) Huang
Peize Sun
Jang Hyun Cho
Andrea Madotto
...
Shiyu Dong
Nikhila Ravi
Daniel Li
Piotr Dollár
Christoph Feichtenhofer
ObjDVOS
678
118
0
17 Apr 2025
SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling
SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling
Yasin Almalioglu
Andrzej Kucik
Geoffrey French
Dafni Antotsiou
Alexander Adam
Cedric Archambeau
310
1
0
17 Apr 2025
Regist3R: Incremental Registration with Stereo Foundation Model
Regist3R: Incremental Registration with Stereo Foundation Model
Sidun Liu
Wenyu Li
Peng Qiao
Yong Dou
3DV
434
7
0
16 Apr 2025
TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion
TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage FusionComputer Vision and Pattern Recognition (CVPR), 2025
Yanjie Wang
Jiajian Li
Chaoyi Hong
Ruibo Li
Liusheng Sun
Xiao-yang Song
Zhe Wang
Zhiguo Cao
Guosheng Lin
MDE
287
4
0
16 Apr 2025
Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image
Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image
Tao Wen
Jiadong Wang
Yuxiao Chen
Shugong Xu
Fangqiu Yi
Xuelong Li
MDE
360
1
0
16 Apr 2025
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual PerceptionInternational Conference on Learning Representations (ICLR), 2025
Ziqi Pang
Xin Xu
Yu-Xiong Wang
DiffM
491
1
0
15 Apr 2025
SARFormer -- An Acquisition Parameter Aware Vision Transformer for Synthetic Aperture Radar Data
SARFormer -- An Acquisition Parameter Aware Vision Transformer for Synthetic Aperture Radar Data
Jonathan Prexl
M. Recla
M. Schmitt
255
2
0
11 Apr 2025
PMNI: Pose-free Multi-view Normal Integration for Reflective and Textureless Surface Reconstruction
PMNI: Pose-free Multi-view Normal Integration for Reflective and Textureless Surface ReconstructionComputer Vision and Pattern Recognition (CVPR), 2025
Mingzhi Pei
Xu Cao
Xiangyi Wang
Heng Guo
Zhanyu Ma
3DV
310
0
0
11 Apr 2025
Novel Pooling-based VGG-Lite for Pneumonia and Covid-19 Detection from Imbalanced Chest X-Ray Datasets
Novel Pooling-based VGG-Lite for Pneumonia and Covid-19 Detection from Imbalanced Chest X-Ray DatasetsIEEE Transactions on Emerging Topics in Computational Intelligence (TETCI), 2025
Santanu Roy
Ashvath Suresh
Palak Sahu
Tulika Rudra Gupta
355
0
0
10 Apr 2025
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution
Gene Chou
Wenqi Xian
Guandao Yang
Mohamed Abdelfattah
Bharath Hariharan
Noah Snavely
Ning Yu
P. Debevec
MDE
466
6
0
09 Apr 2025
MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular Detection
MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular DetectionComputer Vision and Pattern Recognition (CVPR), 2025
Rishubh Parihar
Srinjay Sarkar
Sarthak Vora
Jogendra Nath Kundu
R. V. Babu
1.0K
1
0
09 Apr 2025
D$^2$USt3R: Enhancing 3D Reconstruction for Dynamic Scenes
D2^22USt3R: Enhancing 3D Reconstruction for Dynamic Scenes
Jisang Han
Honggyu An
Jaewoo Jung
Takuya Narihira
Junyoung Seo
Kazumi Fukuda
Chaehyun Kim
Sunghwan Hong
Yuki Mitsufuji
Seungryong Kim
297
8
0
08 Apr 2025
POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction
POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction
Songyan Zhang
Yongtao Ge
Jinyuan Tian
Guangkai Xu
Hao Chen
Chen Lv
Chunhua Shen
3DPC
322
14
0
08 Apr 2025
Window Token Concatenation for Efficient Visual Large Language Models
Window Token Concatenation for Efficient Visual Large Language Models
Jiayi Zhang
Wentao Bao
Botao Ye
Zhen Tan
Tianlong Chen
Huan Liu
Yu Kong
VLM
277
1
0
05 Apr 2025
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2025
Xin Zhang
Robby T. Tan
Mamba
305
3
0
04 Apr 2025
Optimizing 4D Gaussians for Dynamic Scene Video from Single Landscape Images
Optimizing 4D Gaussians for Dynamic Scene Video from Single Landscape ImagesInternational Conference on Learning Representations (ICLR), 2025
In-Hwan Jin
Haesoo Choo
Seong-Hun Jeong
Heemoon Park
Junghwan Kim
Oh-joon Kwon
Kyeongbo Kong
3DGS
337
1
0
04 Apr 2025
PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation
PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation
Lihua Liu
Jiehong Lin
Zhenxin Liu
Kui Jia
311
1
0
03 Apr 2025
Monocular and Generalizable Gaussian Talking Head Animation
Monocular and Generalizable Gaussian Talking Head AnimationComputer Vision and Pattern Recognition (CVPR), 2025
Shengjie Gong
Haoyang Li
Jiapeng Tang
Dongming Hu
Shuangping Huang
Hao Chen
Tianshui Chen
Zhuoman Liu
3DGS
234
7
0
01 Apr 2025
ADGaussian: Generalizable Gaussian Splatting for Autonomous Driving with Multi-modal Inputs
ADGaussian: Generalizable Gaussian Splatting for Autonomous Driving with Multi-modal Inputs
Qi Song
Chenghong Li
Haotong Lin
Sida Peng
Rui Huang
3DGS
389
3
0
01 Apr 2025
Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views
Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed ViewsComputer Vision and Pattern Recognition (CVPR), 2025
Chong Bao
Xiyu Zhang
Zehao Yu
Jiale Shi
Guofeng Zhang
Songyou Peng
Zhaopeng Cui
3DGS3DV
226
5
0
31 Mar 2025
Enhancing Image Resolution of Solar Magnetograms: A Latent Diffusion Model Approach
Enhancing Image Resolution of Solar Magnetograms: A Latent Diffusion Model Approach
Francesco P. Ramunno
Paolo Massa
Vitaliy Kinakh
Brandon Panos
A. Csillaghy
Slava Voloshynovskiy
DiffM
272
0
0
31 Mar 2025
Easi3R: Estimating Disentangled Motion from DUSt3R Without Training
Easi3R: Estimating Disentangled Motion from DUSt3R Without Training
Xingyu Chen
Yue Chen
Yuliang Xiu
Andreas Geiger
Anpei Chen
3DPCVGen
401
47
0
31 Mar 2025
NeoARCADE: Robust Calibration for Distance Estimation to Support Assistive Drones for the Visually Impaired
NeoARCADE: Robust Calibration for Distance Estimation to Support Assistive Drones for the Visually Impaired
Suman Raj
Bhavani A Madhabhavi
Madhav Kumar
Prabhav Gupta
Yogesh Simmhan
341
1
0
31 Mar 2025
BoundMatch: Boundary detection applied to semi-supervised segmentation
BoundMatch: Boundary detection applied to semi-supervised segmentationIEEE Access (IEEE Access), 2025
Haruya Ishikawa
Yoshimitsu Aoki
580
0
0
30 Mar 2025
Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model
Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model
Jannik Endres
Oliver Hahn
Charles Corbière
Simone Schaub-Meyer
Stefan Roth
Alexandre Alahi
MDE
386
0
0
30 Mar 2025
One Look is Enough: Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation on High-Resolution Images
One Look is Enough: Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation on High-Resolution Images
Byeongjun Kwon
Munchurl Kim
VLMMDE
361
0
0
28 Mar 2025
MVSAnywhere: Zero-Shot Multi-View Stereo
MVSAnywhere: Zero-Shot Multi-View StereoComputer Vision and Pattern Recognition (CVPR), 2025
Sergio Izquierdo
Mohamed Sayed
Michael Firman
Guillermo Garcia-Hernando
Daniyar Turmukhambetov
Javier Civera
Oisin Mac Aodha
Gabriel J. Brostow
Jamie Watson
3DV
374
13
0
28 Mar 2025
Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges
Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges
Ukcheol Shin
Jinsun Park
3DVMDE
257
0
0
28 Mar 2025
DuckSegmentation: A segmentation model based on the AnYue Hemp Duck Dataset
DuckSegmentation: A segmentation model based on the AnYue Hemp Duck Dataset
Ling Feng
Tianyu Xie
Wei Ma
Ruijie Fu
Yujiao Shi
Jun Li
Bei Zhou
181
0
0
27 Mar 2025
The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs
The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs
Jonathan Sauder
Viktor Domazetoski
G. Banc-Prandi
Gabriela Perna
Anders Meibom
D. Tuia
297
6
0
25 Mar 2025
FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust Fusion
FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust Fusion
Pihai Sun
Junjun Jiang
Yuanqi Yao
Youyu Chen
Wenbo Zhao
Kui Jiang
Xianming Liu
MDE
473
0
0
25 Mar 2025
Semi-SMD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving
Semi-SMD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving
Yusen Xie
Zhengmin Huang
Shaojie Shen
Jun Ma
428
1
0
25 Mar 2025
Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial Images
Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial Images
Yara AlaaEldin
Francesca Odone
MDE
383
0
0
23 Mar 2025
ClaraVid: A Holistic Scene Reconstruction Benchmark From Aerial Perspective With Delentropy-Based Complexity Profiling
ClaraVid: A Holistic Scene Reconstruction Benchmark From Aerial Perspective With Delentropy-Based Complexity Profiling
Radu Beche
Sergiu Nedevschi
955
1
0
22 Mar 2025
Co-op: Correspondence-based Novel Object Pose Estimation
Co-op: Correspondence-based Novel Object Pose EstimationComputer Vision and Pattern Recognition (CVPR), 2025
Sungphill Moon
Hyeontae Son
Dongcheol Hur
Sangwook Kim
3DH
254
5
0
22 Mar 2025
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2025
Fanghua Yu
Jinjin Gu
Jinfan Hu
Zheyuan Li
Chao Dong
DiffM
488
1
0
21 Mar 2025
Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors
Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene PriorsComputer Vision and Pattern Recognition (CVPR), 2025
Wonbong Jang
Philippe Weinzaepfel
Vincent Leroy
Lourdes Agapito
Jérôme Revaud
278
34
0
21 Mar 2025
Radar-Guided Polynomial Fitting for Metric Depth Estimation
Radar-Guided Polynomial Fitting for Metric Depth Estimation
Patrick Rim
Hyoungseob Park
Vadim Ezhov
Jeffrey Moon
Alex Wong
MDE
373
3
0
21 Mar 2025
QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the EdgeComputer Vision and Pattern Recognition (CVPR), 2025
Xuan Shen
Weize Ma
Jing Liu
Changdi Yang
Rui Ding
...
Wei Niu
Yanzhi Wang
Pu Zhao
Jun Lin
Jiuxiang Gu
MQ
355
7
0
20 Mar 2025
Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras
Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras
Beilei Cui
Long Bai
Mobarakol Islam
An-Chi Wang
Tianhao Shen
...
Feng Li
Daming Gao
Zhongliang Jiang
Nassir Navab
Hongliang Ren
MedIm
245
0
0
20 Mar 2025
GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose Estimation
GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose EstimationComputer Vision and Pattern Recognition (CVPR), 2025
Zinqin Huang
Gu Wang
Chenyangguang Zhang
Ruida Zhang
Xiu Li
Xiangyang Ji
314
4
0
19 Mar 2025
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual SceneComputer Vision and Pattern Recognition (CVPR), 2025
Shengqiong Wu
Hao Fei
Jingkang Yang
Xiaochen Li
Juncheng Li
Hao Zhang
Tat-Seng Chua
314
4
0
19 Mar 2025
Bolt3D: Generating 3D Scenes in Seconds
Bolt3D: Generating 3D Scenes in Seconds
Stanislaw Szymanowicz
Jason Y. Zhang
P. Srinivasan
Ruiqi Gao
Arthur Brussee
Aleksander Holynski
Ricardo Martín Brualla
Jonathan T. Barron
Philipp Henzler
408
27
0
18 Mar 2025
Learning Efficient Fuse-and-Refine for Feed-Forward 3D Gaussian Splatting
Learning Efficient Fuse-and-Refine for Feed-Forward 3D Gaussian Splatting
Yiming Wang
Lucy Chai
Xuan Luo
Michael Niemeyer
Manuel Lagunas
Stephen Lombardi
Siyu Tang
Tiancheng Sun
3DGS
542
1
0
18 Mar 2025
Deblur Gaussian Splatting SLAM
Deblur Gaussian Splatting SLAM
Francesco Girlanda
D. Rozumnyi
Marc Pollefeys
Martin R. Oswald
3DGS
271
0
0
16 Mar 2025
VGGT: Visual Geometry Grounded TransformerComputer Vision and Pattern Recognition (CVPR), 2025
Jianyuan Wang
Minghao Chen
Nikita Karaev
Andrea Vedaldi
Christian Rupprecht
David Novotny
ViT
521
550
0
14 Mar 2025
Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural RepresentationsComputer Vision and Pattern Recognition (CVPR), 2025
Xunzhi Zheng
Dan Xu
AI4CE
274
3
0
13 Mar 2025
Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis
Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis
Chen Zhao
Xuan Wang
Tong Zhang
Saqib Javed
Mathieu Salzmann
3DGS
1.3K
3
0
13 Mar 2025
VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames
Zhiqi Li
Chengrui Dong
Yiming Chen
Zhangchi Huang
Peidong Liu
3DGSViT
272
7
0
13 Mar 2025
Knowledge Consultation for Semi-Supervised Semantic Segmentation
Thuan Than
Nhat-Anh Nguyen-Dang
Dung Nguyen
Salwa K. Al Khatib
Ahmed Elhagry
Hai T. Phan
Yihui He
Zhiqiang Shen
Marios Savvides
Dang T. Huynh
VLM
391
2
0
12 Mar 2025
Previous
123...567...232425
Next
Page 6 of 25
Pageof 25