ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.00520
  4. Cited By
Vision Transformer with Deformable Attention

Vision Transformer with Deformable Attention

3 January 2022
Zhuofan Xia
Xuran Pan
S. Song
Li Erran Li
Gao Huang
    ViT
ArXivPDFHTML

Papers citing "Vision Transformer with Deformable Attention"

50 / 50 papers shown
Title
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach
Shouyang Dong
Yuanbo Wen
Jun Bi
Di Huang
Jiaming Guo
...
Yifan Hao
Xuehai Zhou
Tianshi Chen
Qi Guo
Yunji Chen
20
0
0
04 May 2025
SC3EF: A Joint Self-Correlation and Cross-Correspondence Estimation Framework for Visible and Thermal Image Registration
SC3EF: A Joint Self-Correlation and Cross-Correspondence Estimation Framework for Visible and Thermal Image Registration
Xi Tong
Xing Luo
Jiangxin Yang
Yanpeng Cao
24
0
0
17 Apr 2025
FASR-Net: Unsupervised Shadow Removal Leveraging Inherent Frequency Priors
FASR-Net: Unsupervised Shadow Removal Leveraging Inherent Frequency Priors
Tao Lin
Qingwang Wang
Qiwei Liang
Minghua Tang
Yuxuan Sun
CVBM
52
0
0
08 Apr 2025
MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations
MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations
Hongyu Ke
Jack Morris
K. Oguchi
Xiaofei Cao
Yongkang Liu
Haoxin Wang
Yi Ding
Mamba
71
0
0
18 Mar 2025
Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution
Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution
Cuixin Yang
Rongkang Dong
Jun Xiao
Cong Zhang
Kin-Man Lam
Fei Zhou
Guoping Qiu
81
1
0
17 Jan 2025
Breaking the Low-Rank Dilemma of Linear Attention
Breaking the Low-Rank Dilemma of Linear Attention
Qihang Fan
Huaibo Huang
Ran He
28
0
0
12 Nov 2024
Towards Global Localization using Multi-Modal Object-Instance Re-Identification
Towards Global Localization using Multi-Modal Object-Instance Re-Identification
Aneesh Chavan
Vaibhav Agrawal
Vineeth Bhat
Sarthak Chittawar
Siddharth Srivastava
Chetan Arora
K. M. Krishna
80
0
0
18 Sep 2024
MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for
  Efficient Pedestrian Detection
MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection
Xiangbo Gao
A. Kanu-Asiegbu
Xiaoxiao Du
Mamba
26
0
0
02 Aug 2024
SACNet: A Spatially Adaptive Convolution Network for 2D Multi-organ Medical Segmentation
SACNet: A Spatially Adaptive Convolution Network for 2D Multi-organ Medical Segmentation
Lin Zhang
Wenbo Gao
Jie Yi
Yunyun Yang
38
0
0
14 Jul 2024
DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting
DeformTime: Capturing Variable Dependencies with Deformable Attention for Time Series Forecasting
Yuxuan Shu
Vasileios Lampos
AI4TS
AI4CE
58
0
0
11 Jun 2024
TwinS: Revisiting Non-Stationarity in Multivariate Time Series
  Forecasting
TwinS: Revisiting Non-Stationarity in Multivariate Time Series Forecasting
Jiaxi Hu
Qingsong Wen
Sijie Ruan
Li Liu
Yuxuan Liang
AI4TS
28
5
0
06 Jun 2024
Sharing Key Semantics in Transformer Makes Efficient Image Restoration
Sharing Key Semantics in Transformer Makes Efficient Image Restoration
Bin Ren
Yawei Li
Jingyun Liang
Rakesh Ranjan
Mengyuan Liu
Rita Cucchiara
Luc Van Gool
Ming-Hsuan Yang
N. Sebe
30
3
0
30 May 2024
Vision Transformer with Sparse Scan Prior
Vision Transformer with Sparse Scan Prior
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
ViT
36
5
0
22 May 2024
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation
Volodymyr Fedynyak
Yaroslav Romanus
Bohdan Hlovatskyi
Bohdan Sydor
Oles Dobosevych
Igor Babin
Roman Riazantsev
VOS
40
3
0
11 May 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques
  and Insights
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
A. Kazerouni
I. Hacihaliloglu
Dorit Merhof
41
7
0
28 Mar 2024
Multi-Human Mesh Recovery with Transformers
Multi-Human Mesh Recovery with Transformers
Zeyu Wang
Zhenzhen Weng
Serena Yeung-Levy
3DH
27
1
0
26 Feb 2024
FViT: A Focal Vision Transformer with Gabor Filter
FViT: A Focal Vision Transformer with Gabor Filter
Yulong Shi
Mingwei Sun
Yongshuai Wang
Rui Wang
47
4
0
17 Feb 2024
Self-supervised Video Object Segmentation with Distillation Learning of
  Deformable Attention
Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention
Quang-Trung Truong
Duc Thanh Nguyen
Binh-Son Hua
Sai-Kit Yeung
VOS
34
1
0
25 Jan 2024
SCHEME: Scalable Channel Mixer for Vision Transformers
SCHEME: Scalable Channel Mixer for Vision Transformers
Deepak Sridhar
Yunsheng Li
Nuno Vasconcelos
18
0
0
01 Dec 2023
EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention
EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention
Yulong Shi
Mingwei Sun
Yongshuai Wang
Hui Sun
Zengqiang Chen
29
4
0
10 Oct 2023
Low-Resolution Self-Attention for Semantic Segmentation
Low-Resolution Self-Attention for Semantic Segmentation
Yu-Huan Wu
Shi-Chen Zhang
Yun-Hai Liu
Le Zhang
Xin Zhan
Daquan Zhou
Jiashi Feng
Ming-Ming Cheng
Liangli Zhen
ViT
32
3
0
08 Oct 2023
Addressing Data Misalignment in Image-LiDAR Fusion on Point Cloud
  Segmentation
Addressing Data Misalignment in Image-LiDAR Fusion on Point Cloud Segmentation
Wei-Jong Yang
Guan-Cheng Lee
11
0
0
26 Sep 2023
UniBEV: Multi-modal 3D Object Detection with Uniform BEV Encoders for
  Robustness against Missing Sensor Modalities
UniBEV: Multi-modal 3D Object Detection with Uniform BEV Encoders for Robustness against Missing Sensor Modalities
Shiming Wang
Holger Caesar
Liangliang Nan
Julian F. P. Kooij
38
11
0
25 Sep 2023
3D Transformer based on deformable patch location for differential
  diagnosis between Alzheimer's disease and Frontotemporal dementia
3D Transformer based on deformable patch location for differential diagnosis between Alzheimer's disease and Frontotemporal dementia
H. Nguyen
Michael Clement
Boris Mansencal
Pierrick Coupé
MedIm
21
0
0
06 Sep 2023
UGCANet: A Unified Global Context-Aware Transformer-based Network with
  Feature Alignment for Endoscopic Image Analysis
UGCANet: A Unified Global Context-Aware Transformer-based Network with Feature Alignment for Endoscopic Image Analysis
Pham Vu Hung
N. Manh
Nguyen Thi Oanh
N. T. Thuy
D. V. Sang
ViT
MedIm
19
3
0
12 Jul 2023
Lightweight Vision Transformer with Bidirectional Interaction
Lightweight Vision Transformer with Bidirectional Interaction
Qihang Fan
Huaibo Huang
Xiaoqiang Zhou
Ran He
ViT
29
27
0
01 Jun 2023
DarSwin: Distortion Aware Radial Swin Transformer
DarSwin: Distortion Aware Radial Swin Transformer
Akshay Athwale
Ichrak Shili
Émile Bergeron
Arman Afrasiyabi
Justin Lague
Ola Ahmad
Jean-François Lalonde
21
6
0
19 Apr 2023
Revisiting the Evaluation of Image Synthesis with GANs
Revisiting the Evaluation of Image Synthesis with GANs
Mengping Yang
Ceyuan Yang
Yichi Zhang
Qingyan Bai
Yujun Shen
Bo Dai
EGVM
17
7
0
04 Apr 2023
Vision Transformer with Quadrangle Attention
Vision Transformer with Quadrangle Attention
Qiming Zhang
Jing Zhang
Yufei Xu
Dacheng Tao
ViT
19
38
0
27 Mar 2023
Grid-Centric Traffic Scenario Perception for Autonomous Driving: A
  Comprehensive Review
Grid-Centric Traffic Scenario Perception for Autonomous Driving: A Comprehensive Review
Yining Shi
Kun Jiang
Jiusi Li
Zelin Qian
Jun Wen
Mengmeng Yang
Ke Wang
Diange Yang
76
25
0
02 Mar 2023
Efficiency 360: Efficient Vision Transformers
Efficiency 360: Efficient Vision Transformers
Badri N. Patro
Vijay Srinivas Agneeswaran
19
6
0
16 Feb 2023
FGAHOI: Fine-Grained Anchors for Human-Object Interaction Detection
FGAHOI: Fine-Grained Anchors for Human-Object Interaction Detection
Shuailei Ma
Yuefeng Wang
Shanze Wang
Ying-yu Wei
24
33
0
08 Jan 2023
Rethinking Vision Transformers for MobileNet Size and Speed
Rethinking Vision Transformers for MobileNet Size and Speed
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
21
157
0
15 Dec 2022
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group
  Propagation
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
Chenhongyi Yang
Jiarui Xu
Shalini De Mello
Elliot J. Crowley
X. Wang
ViT
27
21
0
13 Dec 2022
ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient
  Self-Supervised Monocular Depth Estimation
ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient Self-Supervised Monocular Depth Estimation
Daitao Xing
Jinglin Shen
C. Ho
Anthony Tzes
ViT
MDE
14
4
0
12 Dec 2022
Cross-Modal Learning with 3D Deformable Attention for Action Recognition
Cross-Modal Learning with 3D Deformable Attention for Action Recognition
Sangwon Kim
Dasom Ahn
ByoungChul Ko
ViT
3DPC
25
24
0
12 Dec 2022
Peeling the Onion: Hierarchical Reduction of Data Redundancy for
  Efficient Vision Transformer Training
Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
Zhenglun Kong
Haoyu Ma
Geng Yuan
Mengshu Sun
Yanyue Xie
...
Tianlong Chen
Xiaolong Ma
Xiaohui Xie
Zhangyang Wang
Yanzhi Wang
ViT
26
22
0
19 Nov 2022
Dynamic Graph Message Passing Networks for Visual Recognition
Dynamic Graph Message Passing Networks for Visual Recognition
Li Zhang
Mohan Chen
Anurag Arnab
Xiangyang Xue
Philip H. S. Torr
GNN
18
1
0
20 Sep 2022
Doubly Deformable Aggregation of Covariance Matrices for Few-shot
  Segmentation
Doubly Deformable Aggregation of Covariance Matrices for Few-shot Segmentation
Zhitong Xiong
Haopeng Li
Xiao Xiang Zhu
22
35
0
30 Jul 2022
Behind Every Domain There is a Shift: Adapting Distortion-aware Vision
  Transformers for Panoramic Semantic Segmentation
Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation
Jiaming Zhang
Kailun Yang
Haowen Shi
Simon Reiß
Kunyu Peng
Chaoxiang Ma
Haodong Fu
Philip H. S. Torr
Kaiwei Wang
Rainer Stiefelhagen
ViT
MDE
24
35
0
25 Jul 2022
HiFormer: Hierarchical Multi-scale Representations Using Transformers
  for Medical Image Segmentation
HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation
Moein Heidari
A. Kazerouni
Milad Soltany Kadarvish
Reza Azad
Ehsan Khodapanah Aghdam
Julien Cohen-Adad
Dorit Merhof
MedIm
ViT
25
171
0
18 Jul 2022
EATFormer: Improving Vision Transformer Inspired by Evolutionary
  Algorithm
EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm
Jiangning Zhang
Xiangtai Li
Yabiao Wang
Chengjie Wang
Yibo Yang
Yong Liu
Dacheng Tao
ViT
28
32
0
19 Jun 2022
VSA: Learning Varied-Size Window Attention in Vision Transformers
VSA: Learning Varied-Size Window Attention in Vision Transformers
Qiming Zhang
Yufei Xu
Jing Zhang
Dacheng Tao
22
53
0
18 Apr 2022
Glance and Focus Networks for Dynamic Visual Recognition
Glance and Focus Networks for Dynamic Visual Recognition
Gao Huang
Yulin Wang
Kangchen Lv
Haojun Jiang
Wenhui Huang
Pengfei Qi
S. Song
3DH
66
49
0
09 Jan 2022
CMT: Convolutional Neural Networks Meet Vision Transformers
CMT: Convolutional Neural Networks Meet Vision Transformers
Jianyuan Guo
Kai Han
Han Wu
Yehui Tang
Chunjing Xu
Yunhe Wang
Chang Xu
ViT
337
629
0
13 Jul 2021
Visual Parser: Representing Part-whole Hierarchies with Transformers
Visual Parser: Representing Part-whole Hierarchies with Transformers
Shuyang Sun
Xiaoyu Yue
S. Bai
Philip H. S. Torr
50
27
0
13 Jul 2021
Transformer in Transformer
Transformer in Transformer
Kai Han
An Xiao
Enhua Wu
Jianyuan Guo
Chunjing Xu
Yunhe Wang
ViT
282
1,518
0
27 Feb 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,604
0
24 Feb 2021
Bottleneck Transformers for Visual Recognition
Bottleneck Transformers for Visual Recognition
A. Srinivas
Tsung-Yi Lin
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
SLR
270
973
0
27 Jan 2021
Semantic Understanding of Scenes through the ADE20K Dataset
Semantic Understanding of Scenes through the ADE20K Dataset
Bolei Zhou
Hang Zhao
Xavier Puig
Tete Xiao
Sanja Fidler
Adela Barriuso
Antonio Torralba
SSeg
249
1,821
0
18 Aug 2016
1