ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.13413
  4. Cited By
Vision Transformers for Dense Prediction

Vision Transformers for Dense Prediction

IEEE International Conference on Computer Vision (ICCV), 2021
24 March 2021
René Ranftl
Alexey Bochkovskiy
V. Koltun
    ViTMDE
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (2138★)

Papers citing "Vision Transformers for Dense Prediction"

50 / 1,223 papers shown
Title
StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes
StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes
Zhengri Wu
Yiran Wang
Yu Wen
Zeyu Zhang
Biao Wu
Hao Tang
MDE
198
3
0
19 Sep 2025
DACoN: DINO for Anime Paint Bucket Colorization with Any Number of Reference Images
DACoN: DINO for Anime Paint Bucket Colorization with Any Number of Reference Images
Kazuma Nagata
Naoshi Kaneko
DiffM
200
0
0
18 Sep 2025
Depth AnyEvent: A Cross-Modal Distillation Paradigm for Event-Based Monocular Depth Estimation
Depth AnyEvent: A Cross-Modal Distillation Paradigm for Event-Based Monocular Depth Estimation
Luca Bartolomei
Enrico Mannocci
Fabio Tosi
Matteo Poggi
S. Mattoccia
MDEVGen
149
0
0
18 Sep 2025
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Nikhil Varma Keetha
Norman Muller
Johannes Schönberger
Lorenzo Porzi
Yuchen Zhang
...
Samuel Rota Buló
Christian Richardt
Deva Ramanan
Sebastian A. Scherer
Peter Kontschieder
256
47
0
16 Sep 2025
Towards Foundational Models for Single-Chip Radar
Towards Foundational Models for Single-Chip Radar
Tianshu Huang
Akarsh Prabhakara
Chuhan Chen
Jay Karhade
Deva Ramanan
Matthew O'Toole
Anthony G. Rowe
145
1
0
15 Sep 2025
UnLoc: Leveraging Depth Uncertainties for Floorplan Localization
UnLoc: Leveraging Depth Uncertainties for Floorplan Localization
Matthias Wüest
Francis Engelmann
O. Mikšík
Marc Pollefeys
Dániel Baráth
120
1
0
14 Sep 2025
AD-GS: Alternating Densification for Sparse-Input 3D Gaussian Splatting
AD-GS: Alternating Densification for Sparse-Input 3D Gaussian Splatting
Gurutva Patle
Nilay Girgaonkar
Nagabhushan Somraj
R. Soundararajan
3DGS
139
0
0
13 Sep 2025
LayerLock: Non-collapsing Representation Learning with Progressive Freezing
LayerLock: Non-collapsing Representation Learning with Progressive Freezing
Goker Erdogan
Nikhil Parthasarathy
Catalin Ionescu
Drew A. Hudson
Alexander Lerchner
Andrew Zisserman
Mehdi S. M. Sajjadi
João Carreira
112
0
0
12 Sep 2025
Loc$^2$: Interpretable Cross-View Localization via Depth-Lifted Local Feature Matching
Loc2^22: Interpretable Cross-View Localization via Depth-Lifted Local Feature Matching
Zimin Xia
Chenghao Xu
Alexandre Alahi
MDE
232
0
0
11 Sep 2025
PanoLAM: Large Avatar Model for Gaussian Full-Head Synthesis from One-shot Unposed Image
PanoLAM: Large Avatar Model for Gaussian Full-Head Synthesis from One-shot Unposed Image
Peng Li
Yisheng He
Yingdong Hu
Yuan Dong
Weihao Yuan
Yuan Liu
Siyu Zhu
Yike Guo
Z. Dong
Wenhan Luo
3DGS
223
0
0
09 Sep 2025
Faster VGGT with Block-Sparse Global Attention
Faster VGGT with Block-Sparse Global Attention
Chung-Shien Brian Wang
Christian Schmidt
Jens Piekenbrinck
Bastian Leibe
ViT
110
6
0
08 Sep 2025
JRN-Geo: A Joint Perception Network based on RGB and Normal images for Cross-view Geo-localization
JRN-Geo: A Joint Perception Network based on RGB and Normal images for Cross-view Geo-localizationIEEE International Conference on Robotics and Automation (ICRA), 2025
Hongyu Zhou
Y. Zhang
Tingsong Huang
Fawei Ge
Man Qi
X. Zhang
Y. Zhang
104
0
0
06 Sep 2025
FlowSeek: Optical Flow Made Easier with Depth Foundation Models and Motion Bases
FlowSeek: Optical Flow Made Easier with Depth Foundation Models and Motion Bases
Matteo Poggi
Fabio Tosi
100
2
0
05 Sep 2025
From Editor to Dense Geometry Estimator
From Editor to Dense Geometry Estimator
Jiyuan Wang
Chunyu Lin
Lei-huan Sun
Rongying Liu
Lang Nie
Mingxing Li
K. Liao
Xiangxiang Chu
Yao-Min Zhao
DiffMMDE
202
6
0
04 Sep 2025
DUViN: Diffusion-Based Underwater Visual Navigation via Knowledge-Transferred Depth Features
DUViN: Diffusion-Based Underwater Visual Navigation via Knowledge-Transferred Depth Features
Jinghe Yang
Minh-Quan Le
Mingming Gong
Ye Pu
112
1
0
03 Sep 2025
Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots
Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots
Minghuan Liu
Zhengbang Zhu
Xiaoshen Han
Peng Hu
Haotong Lin
...
Xinghang Li
Yong Yu
Weinan Zhang
Tao Kong
Bingyi Kang
112
2
0
02 Sep 2025
RiverScope: High-Resolution River Masking Dataset
RiverScope: High-Resolution River Masking Dataset
Rangel Daroya
Taylor Rowley
Jonathan Flores
Elisa Friedmann
Fiona Bennitt
...
Thomas E. Howard
Yanqi Ye
Audrey Turcotte
Colin J. Gleason
Subhransu Maji
88
1
0
02 Sep 2025
ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association
ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association
Ganlin Zhang
Shenhan Qian
Xi Wang
Daniel Cremers
112
3
0
01 Sep 2025
ER-LoRA: Effective-Rank Guided Adaptation for Weather-Generalized Depth Estimation
ER-LoRA: Effective-Rank Guided Adaptation for Weather-Generalized Depth Estimation
Weilong Yan
Xin Zhang
Robby T. Tan
MDE
243
0
0
31 Aug 2025
SegDINO: An Efficient Design for Medical and Natural Image Segmentation with DINO-V3
SegDINO: An Efficient Design for Medical and Natural Image Segmentation with DINO-V3
Sicheng Yang
Hongqiu Wang
Zhaohu Xing
Sixiang Chen
Lei Zhu
188
3
0
31 Aug 2025
FastAvatar: Towards Unified Fast High-Fidelity 3D Avatar Reconstruction with Large Gaussian Reconstruction Transformers
FastAvatar: Towards Unified Fast High-Fidelity 3D Avatar Reconstruction with Large Gaussian Reconstruction Transformers
Yue Wu
Yufan Wu
Wen Li
Y. Lu
Kairui Feng
Xuanhong Chen
3DGS
68
1
0
27 Aug 2025
SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization
SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization
Junyuan Deng
Heng Li
Tao Xie
Weiqiang Ren
Qian Zhang
Ping Tan
Xiaoyang Guo
ViT
80
2
0
25 Aug 2025
HAMSt3R: Human-Aware Multi-view Stereo 3D Reconstruction
HAMSt3R: Human-Aware Multi-view Stereo 3D Reconstruction
Sara Rojas
M. Armando
Bernard Ghamen
Philippe Weinzaepfel
Vincent Leroy
Grégory Rogez
3DH
65
3
0
22 Aug 2025
SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather
SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse WeatherEuropean Conference on Computer Vision (ECCV), 2025
Edoardo Palladin
Roland Dietze
Praveen Narayanan
Mario Bijelic
Felix Heide
172
7
0
22 Aug 2025
Representation Learning with Adaptive Superpixel Coding
Representation Learning with Adaptive Superpixel Coding
Mahmoud Khalil
Ahmad Khalil
A. Ngom
ViT
108
0
0
21 Aug 2025
SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass
SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass
Yanxu Meng
Haoning Wu
Ya Zhang
Weidi Xie
VGen
352
8
0
21 Aug 2025
Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds
Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds
Jia Lu
Taoran Yi
Jiemin Fang
Chen-Ning Yang
Chuiyun Wu
Wei Shen
Wenyu Liu
Qi Tian
X. Wang
3DH3DGS
145
1
0
20 Aug 2025
GasTwinFormer: A Hybrid Vision Transformer for Livestock Methane Emission Segmentation and Dietary Classification in Optical Gas Imaging
GasTwinFormer: A Hybrid Vision Transformer for Livestock Methane Emission Segmentation and Dietary Classification in Optical Gas Imaging
Toqi Tahamid Sarker
M. Embaby
Taminul Islam
A. AbuGhazaleh
Khaled R Ahmed
68
0
0
20 Aug 2025
ROVR-Open-Dataset: A Large-Scale Depth Dataset for Autonomous Driving
ROVR-Open-Dataset: A Large-Scale Depth Dataset for Autonomous Driving
Xianda Guo
Ruijun Zhang
Yiqun Duan
Ruilin Wang
Matteo Poggi
...
Mike Horton
Yuan Si
Long Chen
Hao Zhao
Long Chen
3DVMDE
230
0
0
19 Aug 2025
PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis
PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis
Chunji Lv
Zequn Chen
Donglin Di
Weinan Zhang
Hao Li
Wei Chen
Changsheng Li
Changsheng Li
3DGSAI4CE
224
1
0
19 Aug 2025
Local Scale Equivariance with Latent Deep Equilibrium Canonicalizer
Local Scale Equivariance with Latent Deep Equilibrium Canonicalizer
Md Ashiqur Rahman
Chiao-An Yang
Michael N. Cheng
Lim Jun Hao
Jeremiah Jiang
Teck-Yian Lim
Raymond A. Yeh
BDL
136
0
0
19 Aug 2025
Online 3D Gaussian Splatting Modeling with Novel View Selection
Online 3D Gaussian Splatting Modeling with Novel View SelectionInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Byeonggwon Lee
Junkyu Park
Khang Truong Giang
Soohwan Song
3DGS3DV
173
1
0
19 Aug 2025
STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer
STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer
Yushi Lan
Yihang Luo
Fangzhou Hong
Shangchen Zhou
Honghua Chen
Zhaoyang Lyu
Shuai Yang
Bo Dai
Chen Change Loy
Xingang Pan
ViT3DPC3DGS
141
11
0
14 Aug 2025
Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment
Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment
Shi-Chen Zhang
Yunheng Li
Yu-Huan Wu
Qibin Hou
Ming-Ming Cheng
SSeg
184
1
0
12 Aug 2025
TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation
TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation
Huawei Sun
Zixu Wang
Hao Feng
Julius Ott
Lorenzo Servadei
Robert Wille
132
1
0
11 Aug 2025
Mem4D: Decoupling Static and Dynamic Memory for Dynamic Scene Reconstruction
Mem4D: Decoupling Static and Dynamic Memory for Dynamic Scene Reconstruction
Xudong Cai
S. Wang
Peng Wang
Yongcai Wang
Zhaoxin Fan
Zhaoxin Fan
T. Zhang
Jianrong Tao
Yeying Jin
Deying Li
177
2
0
11 Aug 2025
Learning an Implicit Physics Model for Image-based Fluid Simulation
Learning an Implicit Physics Model for Image-based Fluid Simulation
Emily Yue-Ting Jia
Jiageng Mao
Zhiyuan Gao
Yajie Zhao
Yue Wang
3DHVGenAI4CE
61
0
0
11 Aug 2025
Matrix-3D: Omnidirectional Explorable 3D World Generation
Matrix-3D: Omnidirectional Explorable 3D World Generation
Zhongqi Yang
Wenhang Ge
Yuqi Li
J. Chen
Haoyuan Li
...
Eric Li
Yang Liu
Yikai Wang
Hao Guo
Yahui Zhou
VGen
107
9
0
11 Aug 2025
VesselRW: Weakly Supervised Subcutaneous Vessel Segmentation via Learned Random Walk Propagation
VesselRW: Weakly Supervised Subcutaneous Vessel Segmentation via Learned Random Walk Propagation
Ayaan Nooruddin Siddiqui
Mahnoor Zaidi
Ayesha Nazneen Shahbaz
Priyadarshini Chatterjee
Krishnan Menon Iyer
241
0
0
09 Aug 2025
Edge Detection for Organ Boundaries via Top Down Refinement and SubPixel Upsampling
Edge Detection for Organ Boundaries via Top Down Refinement and SubPixel Upsampling
Aarav Mehta
Priya Deshmukh
Vikram Singh
Siddharth Malhotra
Krishnan Menon Iyer
Tanvi Iyer
MedIm
252
0
0
09 Aug 2025
DualResolution Residual Architecture with Artifact Suppression for Melanocytic Lesion Segmentation
DualResolution Residual Architecture with Artifact Suppression for Melanocytic Lesion Segmentation
Vikram Singh
Kabir Malhotra
Rohan Desai
Ananya Shankaracharya
Priyadarshini Chatterjee
Krishnan Menon Iyer
MedIm
280
0
0
09 Aug 2025
CF3: Compact and Fast 3D Feature Fields
CF3: Compact and Fast 3D Feature Fields
Hyunjoon Lee
Joonkyu Min
Jaesik Park
3DGS
173
1
0
07 Aug 2025
EndoMatcher: Generalizable Endoscopic Image Matcher via Multi-Domain Pre-training for Robot-Assisted Surgery
EndoMatcher: Generalizable Endoscopic Image Matcher via Multi-Domain Pre-training for Robot-Assisted Surgery
Bingyu Yang
Qingyao Tian
Yimeng Geng
Huai Liao
Xinyan Huang
Jiebo Luo
Hongbin Liu
MedIm
61
0
0
07 Aug 2025
AR as an Evaluation Playground: Bridging Metrics and Visual Perception of Computer Vision Models
AR as an Evaluation Playground: Bridging Metrics and Visual Perception of Computer Vision Models
Ashkan Ganj
Yiqin Zhao
Tian Guo
84
0
0
06 Aug 2025
DET-GS: Depth- and Edge-Aware Regularization for High-Fidelity 3D Gaussian Splatting
DET-GS: Depth- and Edge-Aware Regularization for High-Fidelity 3D Gaussian Splatting
Zexu Huang
Min Xu
Stuart Perry
3DGS
159
0
0
06 Aug 2025
Extending Foundational Monocular Depth Estimators to Fisheye Cameras with Calibration Tokens
Extending Foundational Monocular Depth Estimators to Fisheye Cameras with Calibration Tokens
Suchisrit Gangopadhyay
Jung-Hee Kim
Xien Chen
Patrick Rim
Hyoungseob Park
Alex Wong
MDE
309
1
0
06 Aug 2025
Monocular Depth Estimation with Global-Aware Discretization and Local Context Modeling
Monocular Depth Estimation with Global-Aware Discretization and Local Context Modeling
Heng Wu
Qian Zhang
Guixu Zhang
MDE
152
0
0
05 Aug 2025
Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images
Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images
Xiangyu Sun
Haoyi Jiang
Liu Liu
Seungtae Nam
Gyeongjin Kang
...
Wei Sui
Zhizhong Su
Wenyu Liu
Xinggang Wang
Eunbyung Park
3DGS
266
5
0
05 Aug 2025
Deeply Dual Supervised learning for melanoma recognition
Deeply Dual Supervised learning for melanoma recognition
Rujosh Polma
Krishnan Menon Iyer
207
0
0
04 Aug 2025
Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images
Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images
Philipp Wulff
Felix Wimbauer
Dominik Muhle
Daniel Cremers
MDE
144
0
0
04 Aug 2025
Previous
123456...232425
Next