ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.13413
  4. Cited By
Vision Transformers for Dense Prediction

Vision Transformers for Dense Prediction

IEEE International Conference on Computer Vision (ICCV), 2021
24 March 2021
René Ranftl
Alexey Bochkovskiy
V. Koltun
    ViTMDE
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (2138★)

Papers citing "Vision Transformers for Dense Prediction"

50 / 1,223 papers shown
Elucidating the Role of Feature Normalization in IJEPA
Elucidating the Role of Feature Normalization in IJEPA
Adam Colton
103
0
0
04 Aug 2025
Qwen-Image Technical Report
Qwen-Image Technical Report
Chenfei Wu
Jiahao Nick Li
Jingren Zhou
Junyang Lin
Kaiyuan Gao
...
Yichang Zhang
Yongqiang Zhu
Y. Wu
Yuxuan Cai
Zenan Liu
DiffMVLM
340
239
0
04 Aug 2025
No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views
No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views
Ranran Huang
Krystian Mikolajczyk
3DGS
352
9
0
02 Aug 2025
CoProU-VO: Combining Projected Uncertainty for End-to-End Unsupervised Monocular Visual Odometry
CoProU-VO: Combining Projected Uncertainty for End-to-End Unsupervised Monocular Visual Odometry
Jingchao Xie
Oussema Dhaouadi
Weirong Chen
Johannes Meier
Jacques Kaiser
Daniel Cremers
120
0
0
01 Aug 2025
Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization
Gaussian Splatting Feature Fields for Privacy-Preserving Visual LocalizationComputer Vision and Pattern Recognition (CVPR), 2025
Maxime Pietrantoni
G. Csurka
Torsten Sattler
265
1
0
31 Jul 2025
MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion
MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion
Zihan Wang
Jeff Tan
Tarasha Khurana
Neehar Peri
Deva Ramanan
127
4
0
31 Jul 2025
Unleashing the Power of Motion and Depth: A Selective Fusion Strategy for RGB-D Video Salient Object Detection
Unleashing the Power of Motion and Depth: A Selective Fusion Strategy for RGB-D Video Salient Object Detection
Jiahao He
Daerji Suolang
Keren Fu
Qijun Zhao
149
0
0
29 Jul 2025
PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction
PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction
Jiahui Ren
Mochu Xiang
Jiajun Zhu
Yuchao Dai
125
1
0
29 Jul 2025
Ov3R: Open-Vocabulary Semantic 3D Reconstruction from RGB Videos
Ov3R: Open-Vocabulary Semantic 3D Reconstruction from RGB Videos
Ziren Gong
Xiaohan Li
Fabio Tosi
Jiawei Han
S. Mattoccia
Jianfei Cai
Matteo Poggi
3DPC
532
1
0
29 Jul 2025
SAMwave: Wavelet-Driven Feature Enrichment for Effective Adaptation of Segment Anything Model
SAMwave: Wavelet-Driven Feature Enrichment for Effective Adaptation of Segment Anything Model
Saurabh Yadav
Avi Gupta
Koteswar Rao Jerripothula
VLM
165
0
0
27 Jul 2025
UniCT Depth: Event-Image Fusion Based Monocular Depth Estimation with Convolution-Compensated ViT Dual SA Block
UniCT Depth: Event-Image Fusion Based Monocular Depth Estimation with Convolution-Compensated ViT Dual SA BlockInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Luoxi Jing
Dianxi Shi
Zhe Liu
Songchang Jin
Chunping Qiu
Ziteng Qiao
Yuxian Li
Jianqiang Xia
ViTMDE
252
0
0
26 Jul 2025
DepthFlow: Exploiting Depth-Flow Structural Correlations for Unsupervised Video Object Segmentation
DepthFlow: Exploiting Depth-Flow Structural Correlations for Unsupervised Video Object Segmentation
Suhwan Cho
Minhyeok Lee
Jungho Lee
Donghyeong Kim
Sangyoun Lee
VOS
152
0
0
26 Jul 2025
Event-Based De-Snowing for Autonomous Driving
Event-Based De-Snowing for Autonomous Driving
Manasi Muglikar
Nico Messikommer
Marco Cannici
Davide Scaramuzza
96
1
0
25 Jul 2025
LONG3R: Long Sequence Streaming 3D Reconstruction
LONG3R: Long Sequence Streaming 3D Reconstruction
Zhuoguang Chen
Minghui Qin
Tianyuan Yuan
Zhe Liu
Hang Zhao
166
14
0
24 Jul 2025
DepthDark: Robust Monocular Depth Estimation for Low-Light Environments
DepthDark: Robust Monocular Depth Estimation for Low-Light Environments
Longjian Zeng
Zunjie Zhu
Rongfeng Lu
Ming Lu
Bolun Zheng
C. Yan
Anke Xue
VLMMDE
236
3
0
24 Jul 2025
Dens3R: A Foundation Model for 3D Geometry Prediction
Dens3R: A Foundation Model for 3D Geometry Prediction
X. Fang
Jingnan Gao
Zhe Wang
Zhuo Chen
X. Ren
...
Qiaomu Ren
Zhonglei Yang
Xiaokang Yang
Manwen Liao
Chengfei Lyu
3DVAI4CE
271
8
0
22 Jul 2025
Sparse-View 3D Reconstruction: Recent Advances and Open Challenges
Sparse-View 3D Reconstruction: Recent Advances and Open Challenges
Tanveer Younis
Zhanglin Cheng
3DGS
201
1
0
22 Jul 2025
A Practical Investigation of Spatially-Controlled Image Generation with Transformers
A Practical Investigation of Spatially-Controlled Image Generation with Transformers
Guoxuan Xia
Harleen Hanspal
Petru-Daniel Tudosiu
Shifeng Zhang
Sarah Parisot
210
0
0
21 Jul 2025
DAViD: Data-efficient and Accurate Vision Models from Synthetic Data
DAViD: Data-efficient and Accurate Vision Models from Synthetic Data
F. Saleh
S. Aliakbarian
Charlie Hewitt
Lohit Petikam
Xiao-Xian
Antonio Criminisi
T. Cashman
T. Baltrušaitis
157
1
0
21 Jul 2025
An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks
An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks
Xinyi Wu
S. Landgraf
Markus Ulrich
R. Qin
3DGS3DV
239
0
0
20 Jul 2025
Region-aware Depth Scale Adaptation with Sparse Measurements
Region-aware Depth Scale Adaptation with Sparse Measurements
Rizhao Fan
Tianfang Ma
Zhigen Li
Ning An
Jian Cheng
201
0
0
20 Jul 2025
Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey
Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey
Jiahui Zhang
Yuelei Li
Anpei Chen
Muyu Xu
Kunhao Liu
...
Hanspeter Pfister
Paul Liang
Shijian Lu
Fangneng Zhan
Fangneng Zhan
641
8
0
19 Jul 2025
PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations
PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations
Yu Wei
Jiahui Zhang
Xiaoqin Zhang
Ling Shao
Shijian Lu
3DV
199
1
0
18 Jul 2025
Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation
Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation
Zhen Xu
Hongyu Zhou
Sida Peng
Haotong Lin
Haoyu Guo
...
Yue Wang
Ruizhen Hu
Yiyi Liao
Xiaowei Zhou
Hujun Bao
VLM
181
3
0
15 Jul 2025
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
Aleksandar Jevtić
Christoph Reich
Felix Wimbauer
Oliver Hahn
Christian Rupprecht
Stefan Roth
Daniel Cremers
312
2
0
08 Jul 2025
Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory
Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory
Yuqi Wu
Wenzhao Zheng
Jie Zhou
Jiwen Lu
137
26
0
03 Jul 2025
SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment
SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment
Qi Xu
Dongxu Wei
Lingzhe Zhao
Wenpu Li
Zhangchi Huang
Shunping Ji
Peidong Liu
3DV
272
3
0
03 Jul 2025
DepthART: Monocular Depth Estimation as Autoregressive Refinement Task
DepthART: Monocular Depth Estimation as Autoregressive Refinement Task
Bulat Gabdullin
Nina Konovalova
Nikolay Patakin
Dmitry Senushkin
Anton Konushin
MDE
376
2
0
01 Jul 2025
WAFT: Warping-Alone Field Transforms for Optical Flow
WAFT: Warping-Alone Field Transforms for Optical Flow
Yihan Wang
Gaowen Liu
233
2
0
26 Jun 2025
StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation
StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation
Haodong Li
Chen Wang
Jiahui Lei
Kostas Daniilidis
Lingjie Liu
DiffMVGenMDE
247
3
0
25 Jun 2025
Light of Normals: Unified Feature Representation for Universal Photometric Stereo
Light of Normals: Unified Feature Representation for Universal Photometric Stereo
Hong Li
Houyuan Chen
Chongjie Ye
Zhaoxi Chen
Bohan Li
...
Baochang Zhang
Satoshi Ikehata
Boxin Shi
Anyi Rao
Hao Zhao
329
5
0
23 Jun 2025
Pixel-Optimization-Free Patch Attack on Stereo Depth Estimation
Pixel-Optimization-Free Patch Attack on Stereo Depth Estimation
Hangcheng Liu
Xu Kuang
Xingshuo Han
Xingwan Wu
Haoran Ou
Shangwei Guo
Xingyi Huang
Tao Xiang
Tianwei Zhang
AAML
218
0
0
21 Jun 2025
RaCalNet: Radar Calibration Network for Sparse-Supervised Metric Depth Estimation
RaCalNet: Radar Calibration Network for Sparse-Supervised Metric Depth Estimation
Xingrui Qin
Wentao Zhao
Chuan Cao
Yihe Niu
Houcheng Jiang
Houcheng Jiang
Rui Guo
Jingchuan Wang
308
1
0
18 Jun 2025
RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories
RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories
Qingsong Yan
Qiang-qiang Wang
Kaiyong Zhao
Jie Chen
Bo Li
Xiaowen Chu
Fei Deng
258
1
0
18 Jun 2025
DepthSeg: Depth prompting in remote sensing semantic segmentation
DepthSeg: Depth prompting in remote sensing semantic segmentation
Ning Zhou
Shanxiong Chen
Mingting Zhou
Haigang Sui
Lieyun Hu
Han Li
Li Hua
Qiming Zhou
VLMMDE
148
0
0
17 Jun 2025
Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry
Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry
Junyoung Seo
Jisang Han
Jaewoo Jung
Siyoon Jin
Joungbin Lee
...
Takashi Shibuya
Donghoon Ahn
Shoukang Hu
Seungryong Kim
Yuki Mitsufuji
VGen
224
3
0
16 Jun 2025
GS-2DGS: Geometrically Supervised 2DGS for Reflective Object Reconstruction
GS-2DGS: Geometrically Supervised 2DGS for Reflective Object ReconstructionComputer Vision and Pattern Recognition (CVPR), 2025
Jinguang Tong
Xuesong Li
F. A. Maken
Sundaram Muthu
Lars Petersson
Chuong H. Nguyen
Hongdong Li
3DGS
216
8
0
16 Jun 2025
TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast
TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast
Beilei Cui
Yiming Huang
Long Bai
Hongliang Ren
209
0
0
16 Jun 2025
ViLLa: A Neuro-Symbolic approach for Animal Monitoring
ViLLa: A Neuro-Symbolic approach for Animal Monitoring
Harsha Koduri
102
0
0
12 Jun 2025
DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos
DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos
C. Lin
Zhaoyang Lv
Songyin Wu
Zhen Xu
Thu Nguyen-Phuoc
...
Ming-Hsuan Yang
Yuheng Ren
Richard Newcombe
Zhao Dong
Zhengqin Li
3DGS
281
2
0
11 Jun 2025
ScaleLSD: Scalable Deep Line Segment Detection StreamlinedComputer Vision and Pattern Recognition (CVPR), 2025
Zeran Ke
Bin Tan
Xianwei Zheng
Yujun Shen
Tianfu Wu
Nan Xue
189
1
0
11 Jun 2025
3DGeoDet: General-purpose Geometry-aware Image-based 3D Object DetectionIEEE transactions on multimedia (TMM), 2025
Yi Zhang
Y. X. R. Wang
Yawen Cui
Lap-Pui Chau
3DPC
323
2
0
11 Jun 2025
UniForward: Unified 3D Scene and Semantic Field Reconstruction via Feed-Forward Gaussian Splatting from Only Sparse-View Images
Qijian Tian
Xin Tan
Jingyu Gong
Yuan Xie
Lizhuang Ma
3DGS
230
4
0
11 Jun 2025
UFM: A Simple Path towards Unified Dense Correspondence with Flow
Yuchen Zhang
Nikhil Varma Keetha
Chenwei Lyu
Bhuvan Jhamb
Yutian Chen
...
Shreyas Jha
Yaoyu Hu
Deva Ramanan
Sebastian A. Scherer
Wenshan Wang
188
0
0
10 Jun 2025
JAFAR: Jack up Any Feature at Any Resolution
JAFAR: Jack up Any Feature at Any Resolution
Paul Couairon
Loick Chambon
Louis Serrano
Jean-Emmanuel Haugeard
Matthieu Cord
Nicolas Thome
MDE
498
6
0
10 Jun 2025
GoTrack: Generic 6DoF Object Pose Refinement and Tracking
GoTrack: Generic 6DoF Object Pose Refinement and Tracking
Van Nguyen Nguyen
Christian Forster
Sindi Shkodrani
Vincent Lepetit
Bugra Tekin
Cem Keskin
Tomás Hodan
VOT
196
1
0
08 Jun 2025
THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation
THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation
Mingqi Gao
Haoran Duan
Tianlu Zhang
Jungong Han
143
0
0
07 Jun 2025
Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration
Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration
Fanhu Zeng
Deli Yu
Zhenglun Kong
Hao Tang
ViT
166
6
0
06 Jun 2025
NTIRE 2025 Challenge on HR Depth from Images of Specular and Transparent Surfaces
NTIRE 2025 Challenge on HR Depth from Images of Specular and Transparent Surfaces
Pierluigi Zama Ramirez
Fabio Tosi
Luigi Di Stefano
Radu Timofte
Alex Costanzino
...
Jing Cao
Shenyi Li
Kui Jiang
Junjun Jiang
Y. Huang
MDE
247
25
0
06 Jun 2025
Deep Learning Reforms Image Matching: A Survey and Outlook
Shihua Zhang
Zizhuo Li
Kaining Zhang
Yifan Lu
Yuxin Deng
Linfeng Tang
Xingyu Jiang
Jiayi Ma
3DV
336
2
0
05 Jun 2025
Previous
12345...232425
Next