ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.13413
  4. Cited By
Vision Transformers for Dense Prediction

Vision Transformers for Dense Prediction

IEEE International Conference on Computer Vision (ICCV), 2021
24 March 2021
René Ranftl
Alexey Bochkovskiy
V. Koltun
    ViTMDE
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (2138★)

Papers citing "Vision Transformers for Dense Prediction"

50 / 1,223 papers shown
JointSplat: Probabilistic Joint Flow-Depth Optimization for Sparse-View Gaussian Splatting
JointSplat: Probabilistic Joint Flow-Depth Optimization for Sparse-View Gaussian Splatting
Yang Xiao
Guoan Xu
Qiang Wu
Wenjing Jia
3DGS
238
1
0
04 Jun 2025
HumanRAM: Feed-forward Human Reconstruction and Animation Model using Transformers
HumanRAM: Feed-forward Human Reconstruction and Animation Model using Transformers
Zhiyuan Yu
Zhe Li
Hujun Bao
Can Yang
Xiaowei Zhou
3DH
254
3
0
03 Jun 2025
Generative Perception of Shape and Material from Differential Motion
Generative Perception of Shape and Material from Differential Motion
Xinran Nicole Han
Ko Nishino
T. Zickler
DiffMVGen
373
0
0
03 Jun 2025
Towards In-the-wild 3D Plane Reconstruction from a Single Image
Towards In-the-wild 3D Plane Reconstruction from a Single ImageComputer Vision and Pattern Recognition (CVPR), 2025
Jiachen Liu
Jingbo Xia
Sili Chen
Sharon X. Huang
Hengkai Guo
3DV
215
5
0
03 Jun 2025
SAB3R: Semantic-Augmented Backbone in 3D Reconstruction
SAB3R: Semantic-Augmented Backbone in 3D Reconstruction
Xuweiyi Chen
Tian Xia
Sihan Xu
Jianing Yang
Joyce Chai
Zezhou Cheng
314
2
0
02 Jun 2025
Rig3R: Rig-Aware Conditioning for Learned 3D Reconstruction
Rig3R: Rig-Aware Conditioning for Learned 3D Reconstruction
Samuel Li
Pujith Kachana
Prajwal Chidananda
Saurabh Nair
Yasutaka Furukawa
Matthew Brown
245
5
0
02 Jun 2025
Flying Co-Stereo: Enabling Long-Range Aerial Dense Mapping via Collaborative Stereo Vision of Dynamic-Baseline
Flying Co-Stereo: Enabling Long-Range Aerial Dense Mapping via Collaborative Stereo Vision of Dynamic-Baseline
Zhaoying Wang
Xingxing Zuo
Wei Dong
153
0
0
31 May 2025
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
Yang-tian Sun
Xin Yu
Zehuan Huang
Yi-Hua Huang
Yuan-Chen Guo
Ziyi Yang
Yan-Pei Cao
Xiaojuan Qi
DiffMVGenMDE
215
4
0
30 May 2025
MaskAdapt: Unsupervised Geometry-Aware Domain Adaptation Using Multimodal Contextual Learning and RGB-Depth Masking
MaskAdapt: Unsupervised Geometry-Aware Domain Adaptation Using Multimodal Contextual Learning and RGB-Depth Masking
Numair Nadeem
Muhammad Asad
Saeed Anwar
Abdul Bais
198
2
0
29 May 2025
SpatialSplat: Efficient Semantic 3D from Sparse Unposed Images
SpatialSplat: Efficient Semantic 3D from Sparse Unposed Images
Yu Sheng
Jiajun Deng
Xinran Zhang
Yu Zhang
Bei Hua
Yanyong Zhang
Jianmin Ji
3DGS
278
6
0
29 May 2025
Bridging Geometric and Semantic Foundation Models for Generalized Monocular Depth Estimation
Bridging Geometric and Semantic Foundation Models for Generalized Monocular Depth Estimation
Sanggyun Ma
Wonjoon Choi
Jihun Park
Jaeyeul Kim
Seunghun Lee
Jiwan Seo
S. Im
243
0
0
29 May 2025
Learning Fine-Grained Geometry for Sparse-View Splatting via Cascade Depth Loss
Learning Fine-Grained Geometry for Sparse-View Splatting via Cascade Depth Loss
Wenjun Lu
Haodong Chen
Anqi Yi
Yuk Ying Chung
Zhiyong Wang
Kun Hu
168
0
0
28 May 2025
RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination
RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination
Chong Zeng
Yue Dong
Pieter Peers
Hongzhi Wu
Xin Tong
189
5
0
28 May 2025
CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation
CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation
Pardis Taghavi
Tian Liu
Renjie Li
Reza Langari
Zhengzhong Tu
ISeg
503
0
0
28 May 2025
Styl3R: Instant 3D Stylized Reconstruction for Arbitrary Scenes and Styles
Styl3R: Instant 3D Stylized Reconstruction for Arbitrary Scenes and Styles
Peng Wang
Xiang Liu
Peidong Liu
238
2
0
27 May 2025
DepthMatch: Semi-Supervised RGB-D Scene Parsing through Depth-Guided Regularization
DepthMatch: Semi-Supervised RGB-D Scene Parsing through Depth-Guided RegularizationIEEE Signal Processing Letters (IEEE SPL), 2025
Jianxin Huang
Jiahang Li
S. Vityazev
Alexander Dvorkovich
Rui Fan
3DVMDE
250
4
0
26 May 2025
OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks
OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks
Jiayu Wang
Yang Jiao
Yue Yu
Tianwen Qian
Shaoxiang Chen
Yue Yu
Yu Jiang
MLLMLM&MAELM
259
0
0
24 May 2025
Semantic segmentation with reward
Semantic segmentation with reward
Xie Ting
Ye Huang
Zhilin Liu
Lixin Duan
506
0
0
23 May 2025
EMRA-proxy: Enhancing Multi-Class Region Semantic Segmentation in Remote Sensing Images with Attention Proxy
EMRA-proxy: Enhancing Multi-Class Region Semantic Segmentation in Remote Sensing Images with Attention Proxy
Yichun Yu
Yuqing Lan
Zhihuan Xing
Xiaoyi Yang
Tingyue Tang
Dan Yu
136
0
0
23 May 2025
MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models
MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation ModelsComputer Vision and Pattern Recognition (CVPR), 2025
Yifan Liu
Keyu Fan
Weihao Yu
Chenxin Li
Hao Lu
Yixuan Yuan
3DGSMDE
357
8
0
21 May 2025
Diving into the Fusion of Monocular Priors for Generalized Stereo Matching
Diving into the Fusion of Monocular Priors for Generalized Stereo Matching
Chengtang Yao
Lidong Yu
Zhidan Liu
Jiaxi Zeng
Yuwei Wu
Yunde Jia
MDE
373
3
0
20 May 2025
Intra-class Patch Swap for Self-Distillation
Intra-class Patch Swap for Self-Distillation
Hongjun Choi
Eun Som Jeon
Ankita Shukla
Pavan Turaga
283
0
0
20 May 2025
3D Visual Illusion Depth Estimation
3D Visual Illusion Depth Estimation
Chengtang Yao
Zhidan Liu
Jiaxi Zeng
Lidong Yu
Yuwei Wu
Yunde Jia
MDE
633
1
0
19 May 2025
VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold
VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold
Dominic Maggio
Hyungtae Lim
Luca Carlone
381
41
0
18 May 2025
Always Clear Depth: Robust Monocular Depth Estimation under Adverse Weather
Always Clear Depth: Robust Monocular Depth Estimation under Adverse WeatherInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Kui Jiang
Jing Cao
Zhaocheng Yu
Junjun Jiang
Jingchun Zhou
MDE
285
2
0
18 May 2025
FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation
FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation
Jun Guo
Xiaojian Ma
Yikai Wang
Min Yang
Huaping Liu
Qing Li
VGen
288
8
0
15 May 2025
FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static Decoupling
FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static DecouplingIEEE International Conference on Robotics and Automation (ICRA), 2025
Yue Wen
Liang Song
Yi Liu
Siting Zhu
Yanzi Miao
Lijun Han
Hesheng Wang
350
3
0
14 May 2025
Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image AnalysisIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Bingxin Ke
Kevin Qu
Tianfu Wang
Nando Metzger
Shengyu Huang
Bo Li
Anton Obukhov
Konrad Schindler
DiffMVLM
378
31
0
14 May 2025
MELLM: A Flow-Guided Large Language Model for Micro-Expression Understanding
MELLM: A Flow-Guided Large Language Model for Micro-Expression Understanding
Zhengye Zhang
Zhengye Zhang
Shifeng Liu
Xinglong Mao
Xinglong Mao
Tong Xu
Tong Xu
Enhong Chen
MLLM
388
3
0
11 May 2025
Camera-Only Bird's Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles
Camera-Only Bird's Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles
Anupkumar Bochare
105
0
0
09 May 2025
DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion
DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint DiffusionComputer Vision and Pattern Recognition (CVPR), 2025
Qitao Zhao
Amy Lin
Jeff Tan
Jason Y. Zhang
Deva Ramanan
Shubham Tulsiani
VGen
402
7
0
08 May 2025
VGLD: Visually-Guided Linguistic Disambiguation for Monocular Depth Scale Recovery
VGLD: Visually-Guided Linguistic Disambiguation for Monocular Depth Scale Recovery
Bojin Wu
Jing Chen
MDE
531
0
0
05 May 2025
Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction
Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction
Simon Giebenhain
Tobias Kirschstein
Martin Rünz
Lourdes Agapito
Matthias Nießner
CVBM3DH
376
7
0
01 May 2025
JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers
JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers
Kwon Byung-Ki
Jingdong Sun
Lee Hyoseok
Chong Luo
Tae-Hyun Oh
652
4
0
01 May 2025
Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining
Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining
Xun Guo
Yunfeng Yan
Weizhen He
Yiheng Deng
Yangyang Zhong
Pengxin Luo
Donglian Qi
VLM
415
1
0
29 Apr 2025
Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video
Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular VideoComputer Vision and Pattern Recognition (CVPR), 2025
Hoang Chuong Nguyen
Wei Mao
Jose M. Alvarez
Miaomiao Liu
234
0
0
28 Apr 2025
Category-Level and Open-Set Object Pose Estimation for Robotics
Category-Level and Open-Set Object Pose Estimation for Robotics
Peter Honig
Matthias Hirschmanner
Markus Vincze
200
0
0
28 Apr 2025
Leveraging Multi-Modal Saliency and Fusion for Gaze Target Detection
Leveraging Multi-Modal Saliency and Fusion for Gaze Target Detection
Athul M. Mathew
Arshad Ali Khan
Thariq Khalid
Faroq AL-Tam
R. Souissi
400
2
0
27 Apr 2025
Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models
Examining the Impact of Optical Aberrations to Image Classification and Object Detection ModelsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Patrick Müller
Alexander Braun
Margret Keuper
278
2
0
25 Apr 2025
The Fourth Monocular Depth Estimation Challenge
The Fourth Monocular Depth Estimation Challenge
Anton Obukhov
Matteo Poggi
Fabio Tosi
Ripudaman Singh Arora
Jaime Spencer
...
Tuan-Anh Yang
Minh-Quang Nguyen
T. Tran
Albert Luginov
Muhammad Shahzad
MDE
971
4
0
24 Apr 2025
Federated EndoViT: Pretraining Vision Transformers via Federated Learning on Endoscopic Image Collections
Federated EndoViT: Pretraining Vision Transformers via Federated Learning on Endoscopic Image Collections
Max Kirchner
Alexander C. Jenke
S. Bodenstedt
Fiona Kolbinger
Oliver Saldanha
Jakob N. Kather
M. Wagner
Stefanie Speidel
FedMLMedIm
369
4
0
23 Apr 2025
SmallGS: Gaussian Splatting-based Camera Pose Estimation for Small-Baseline Videos
SmallGS: Gaussian Splatting-based Camera Pose Estimation for Small-Baseline Videos
Yuxin Yao
Yan Zhang
Zhening Huang
Joan Lasenby
3DGS
316
2
0
22 Apr 2025
Landmark-Free Preoperative-to-Intraoperative Registration in Laparoscopic Liver Resection
Landmark-Free Preoperative-to-Intraoperative Registration in Laparoscopic Liver ResectionIEEE Transactions on Medical Imaging (IEEE TMI), 2025
Jun Zhou
Bingchen Gao
Kai Wang
Jialun Pei
Pheng-Ann Heng
Jing Qin
MedIm
346
3
0
21 Apr 2025
VistaDepth: Improving far-range Depth Estimation with Spectral Modulation and Adaptive Reweighting
VistaDepth: Improving far-range Depth Estimation with Spectral Modulation and Adaptive Reweighting
Mingxia Zhan
Li Zhang
Xiaomeng Chu
Beibei Wang
Yanyong Zhang
Yanyong Zhang
MDE
574
0
0
21 Apr 2025
Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction
Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction
Weirong Chen
Ganlin Zhang
Felix Wimbauer
Rui Wang
Nikita Araslanov
Andrea Vedaldi
Daniel Cremers
333
10
0
20 Apr 2025
PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling
PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling
Alara Dirik
Tuanfeng Y. Wang
Duygu Ceylan
Stefanos Zafeiriou
Anna Frühstück
DiffM
242
5
0
19 Apr 2025
Visual Consensus Prompting for Co-Salient Object Detection
Visual Consensus Prompting for Co-Salient Object DetectionComputer Vision and Pattern Recognition (CVPR), 2025
Jinqiao Wang
Nana Yu
Zihao Zhang
Yahong Han
219
2
0
19 Apr 2025
Mono3R: Exploiting Monocular Cues for Geometric 3D Reconstruction
Mono3R: Exploiting Monocular Cues for Geometric 3D Reconstruction
Wenyu Li
Sidun Liu
Peng Qiao
Yong Dou
352
3
0
18 Apr 2025
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models
Haiwen Huang
Anpei Chen
Volodymyr Havrylov
Andreas Geiger
Dan Zhang
215
9
0
18 Apr 2025
St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World
St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World
Haiwen Feng
Junyi Zhang
Qianqian Wang
Yufei Ye
Pengcheng Yu
Michael J. Black
Trevor Darrell
Angjoo Kanazawa
VGen3DV
328
38
0
17 Apr 2025
Previous
123456...232425
Next