Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2106.04263
Cited By
v1
v2
v3
v4
v5 (latest)
On the Connection between Local Attention and Dynamic Depth-wise Convolution
International Conference on Learning Representations (ICLR), 2021
8 June 2021
Qi Han
Zejia Fan
Jingdong Sun
Lei-huan Sun
Ming-Ming Cheng
Jiaying Liu
Jingdong Wang
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (184★)
Papers citing
"On the Connection between Local Attention and Dynamic Depth-wise Convolution"
50 / 56 papers shown
Alias-Free ViT: Fractional Shift Invariance via Linear Attention
H. Michaeli
Daniel Soudry
228
1
0
26 Oct 2025
IONext: Unlocking the Next Era of Inertial Odometry
Shanshan Zhang
Qi Zhang
Siyue Wang
Tianshui Wen
Liqin Wu
Ziheng Zhou
Xuemin Hong
Ao Peng
Lingxiang Zheng
Yu Yang
227
1
0
23 Jul 2025
Adaptive Dual-domain Learning for Underwater Image Enhancement
AAAI Conference on Artificial Intelligence (AAAI), 2025
Lingtao Peng
Liheng Bian
392
8
0
27 Apr 2025
RCCFormer: A Robust Crowd Counting Network Based on Transformer
Peng Liu
Heng-Chao Li
Sen Lei
Nanqing Liu
Bin Feng
Xiao Wu
233
2
0
07 Apr 2025
VMamba: Visual State Space Model
Neural Information Processing Systems (NeurIPS), 2024
Yue Liu
Yunjie Tian
Yuzhong Zhao
Hongtian Yu
Lingxi Xie
Yaowei Wang
Qixiang Ye
Jianbin Jiao
Yunfan Liu
Mamba
1.4K
2,039
0
31 Dec 2024
Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers
Shwai He
Tao Ge
Zheyu Shen
Bowei Tian
Xiaoyang Wang
Ang Li
MoE
540
6
0
17 Oct 2024
big.LITTLE Vision Transformer for Efficient Visual Recognition
He Guo
Yulong Wang
Zixuan Ye
Jifeng Dai
Yuwen Xiong
ViT
295
4
0
14 Oct 2024
Unifying Dimensions: A Linear Adaptive Approach to Lightweight Image Super-Resolution
Zhenyu Hu
Wanjie Sun
259
1
0
26 Sep 2024
MALT: Multi-scale Action Learning Transformer for Online Action Detection
Zhipeng Yang
Ruoyu Wang
Yang Tan
Liping Xie
OffRL
270
7
0
31 May 2024
Demystify Mamba in Vision: A Linear Attention Perspective
Dongchen Han
Ziyi Wang
Zhuofan Xia
Yizeng Han
Yifan Pu
Chunjiang Ge
Jun Song
Shiji Song
Bo Zheng
Gao Huang
Mamba
417
200
0
26 May 2024
Partial Large Kernel CNNs for Efficient Super-Resolution
Dongheon Lee
Seokju Yun
Youngmin Ro
SupR
255
8
0
18 Apr 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
Amirhossein Kazerouni
Ilker Hacihaliloglu
Dorit Merhof
348
14
0
28 Mar 2024
HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Ting Yao
Yehao Li
Yingwei Pan
Tao Mei
ViT
221
41
0
18 Mar 2024
Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Computer Vision and Pattern Recognition (CVPR), 2024
Linwei Chen
Lin Gu
Ying Fu
843
117
0
08 Mar 2024
Multi-step Temporal Modeling for UAV Tracking
Xiaoying Yuan
Tingfa Xu
Xincong Liu
Ying Wang
Haolin Qin
Yuqiang Fang
Jianan Li
257
25
0
07 Mar 2024
ConvTimeNet: A Deep Hierarchical Fully Convolutional Model for Multivariate Time Series Analysis
Mingyue Cheng
Jiqian Yang
Tingyue Pan
Qi Liu
Zhi Li
AI4TS
285
47
0
03 Mar 2024
How Do Humans Write Code? Large Models Do It the Same Way Too
Long Li
Xuzheng He
LRM
214
0
0
24 Feb 2024
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications
Computer Vision and Pattern Recognition (CVPR), 2024
Yuwen Xiong
Zhiqi Li
Yuntao Chen
Feng Wang
Xizhou Zhu
...
Jiaming Song
Yu Qiao
Lewei Lu
Jie Zhou
Jifeng Dai
212
179
0
11 Jan 2024
Factorization Vision Transformer: Modeling Long Range Dependency with Local Window Cost
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Haolin Qin
Daquan Zhou
Tingfa Xu
Ziyang Bian
Jianan Li
255
20
0
14 Dec 2023
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Meng Lou
Hong-Yu Zhou
Sibei Yang
Yizhou Yu
Chuan Wu
Yizhou Yu
ViT
656
118
0
30 Oct 2023
Interpret Vision Transformers as ConvNets with Dynamic Convolutions
Chong Zhou
Chen Change Loy
Bo Dai
ViT
310
1
0
19 Sep 2023
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Zhuofan Xia
Xuran Pan
Shiji Song
Li Erran Li
Gao Huang
ViT
356
43
0
04 Sep 2023
SPANet: Frequency-balancing Token Mixer using Spectral Pooling Aggregation Modulation
IEEE International Conference on Computer Vision (ICCV), 2023
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Dong Hwan Kim
MoE
373
34
0
22 Aug 2023
SCSC: Spatial Cross-scale Convolution Module to Strengthen both CNNs and Transformers
Xijun Wang
Xiaojie Chu
Chunrui Han
Xiangyu Zhang
ViT
166
1
0
14 Aug 2023
Dual Aggregation Transformer for Image Super-Resolution
IEEE International Conference on Computer Vision (ICCV), 2023
Zheng Chen
Yulun Zhang
Jinjin Gu
Lingyu Kong
Yunbo Wang
Feng Yu
ViT
380
334
0
07 Aug 2023
Frequency Disentangled Features in Neural Image Compression
International Conference on Information Photonics (ICIP), 2023
Ali Zafari
Atefeh Khoshkhahtinat
P. Mehta
Mohammad Saeed Ebrahimi Saadabadi
Mohammad Akyash
Nasser M. Nasrabadi
269
19
0
04 Aug 2023
Recent Advances of Local Mechanisms in Computer Vision: A Survey and Outlook of Recent Work
Qiangchang Wang
Yilong Yin
352
1
0
02 Jun 2023
Implicit Temporal Modeling with Learnable Alignment for Video Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
S. Tu
Jingdong Sun
Zuxuan Wu
Zhi-Qi Cheng
Hang-Rui Hu
Yu-Gang Jiang
366
63
0
20 Apr 2023
Transformer-Based Visual Segmentation: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Xiangtai Li
Henghui Ding
Haobo Yuan
Wenwei Zhang
Jiangmiao Pang
Guangliang Cheng
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
ViT
MedIm
579
281
0
19 Apr 2023
InceptionNeXt: When Inception Meets ConvNeXt
Computer Vision and Pattern Recognition (CVPR), 2023
Weihao Yu
Pan Zhou
Shuicheng Yan
Xinchao Wang
655
310
0
29 Mar 2023
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Muhammad Usama
Junaid Qadir
521
76
0
21 Mar 2023
KBNet: Kernel Basis Network for Image Restoration
Yuanhang Zhang
Dasong Li
Xiaoyu Shi
Dailan He
Kangning Song
Xiaogang Wang
Hongwei Qin
Jiaming Song
287
80
0
06 Mar 2023
DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition
IEEE transactions on multimedia (IEEE TMM), 2023
Jiayu Jiao
Yuyao Tang
Kun-Li Channing Lin
Yipeng Gao
Jinhua Ma
Yaowei Wang
Wei-Shi Zheng
MedIm
ViT
345
281
0
03 Feb 2023
DLGSANet: Lightweight Dynamic Local and Global Self-Attention Networks for Image Super-Resolution
IEEE International Conference on Computer Vision (ICCV), 2023
Xiang Li
Jin-shan Pan
Jinhui Tang
Jiangxin Dong
215
66
0
05 Jan 2023
Adaptively Clustering Neighbor Elements for Image-Text Generation
Zihua Wang
Xu Yang
Hanwang Zhang
Haiyang Xu
Mingshi Yan
Feisi Huang
Yu Zhang
VLM
631
0
0
05 Jan 2023
A Close Look at Spatial Modeling: From Attention to Convolution
Xu Ma
Huan Wang
Can Qin
Kunpeng Li
Xing Zhao
Jie Fu
Yun Fu
ViT
3DPC
202
13
0
23 Dec 2022
Reversible Column Networks
International Conference on Learning Representations (ICLR), 2022
Yuxuan Cai
Yi Zhou
Qi Han
Jianjian Sun
Xiangwen Kong
Jun Yu Li
Xiangyu Zhang
VLM
344
90
0
22 Dec 2022
Rethinking Vision Transformers for MobileNet Size and Speed
IEEE International Conference on Computer Vision (ICCV), 2022
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
463
296
0
15 Dec 2022
FsaNet: Frequency Self-attention for Semantic Segmentation
IEEE Transactions on Image Processing (IEEE TIP), 2022
Fengyu Zhang
Ashkan Panahi
Guangjun Gao
AI4TS
327
57
0
28 Nov 2022
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Qibin Hou
Cheng Lu
Mingg-Ming Cheng
Jiashi Feng
ViT
310
239
0
22 Nov 2022
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Computer Vision and Pattern Recognition (CVPR), 2022
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Jiaming Song
Xiaogang Wang
Yu Qiao
VLM
668
1,058
0
10 Nov 2022
Demystify Transformers & Convolutions in Modern Image Deep Networks
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Jifeng Dai
Min Shi
Weiyun Wang
Sitong Wu
Linjie Xing
...
Lewei Lu
Jie Zhou
Xiaogang Wang
Botian Shi
Xiao-hua Hu
ViT
360
11
0
10 Nov 2022
MetaFormer Baselines for Vision
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Weihao Yu
Chenyang Si
Pan Zhou
Mi Luo
Yichen Zhou
Jiashi Feng
Shuicheng Yan
Xinchao Wang
MoE
329
304
0
24 Oct 2022
Understanding the Covariance Structure of Convolutional Filters
International Conference on Learning Representations (ICLR), 2022
Asher Trockman
Devin Willmott
J. Zico Kolter
344
18
0
07 Oct 2022
DMFormer: Closing the Gap Between CNN and Vision Transformers
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zimian Wei
H. Pan
Lujun Li
Menglong Lu
Xin-Yi Niu
Peijie Dong
Dongsheng Li
ViT
408
7
0
16 Sep 2022
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection
European Conference on Computer Vision (ECCV), 2022
Yuetian Weng
Zizheng Pan
Mingfei Han
Xiaojun Chang
Bohan Zhuang
ViT
230
31
0
21 Jul 2022
Rethinking Attention Mechanism in Time Series Classification
Information Sciences (Inf. Sci.), 2022
Bowen Zhao
Huanlai Xing
Xinhan Wang
Fuhong Song
Zhiwen Xiao
AI4TS
227
52
0
14 Jul 2022
LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs
Computer Vision and Pattern Recognition (CVPR), 2022
Yukang Chen
Jianhui Liu
Xinming Zhang
Xiaojuan Qi
Jiaya Jia
334
139
0
21 Jun 2022
EfficientFormer: Vision Transformers at MobileNet Speed
Neural Information Processing Systems (NeurIPS), 2022
Yanyu Li
Geng Yuan
Yang Wen
Eric Hu
Georgios Evangelidis
Sergey Tulyakov
Yanzhi Wang
Jian Ren
ViT
868
576
0
02 Jun 2022
Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives
Jun Li
Junyu Chen
Yucheng Tang
Ce Wang
Bennett A. Landman
S. K. Zhou
ViT
OOD
MedIm
547
185
0
02 Jun 2022
1
2
Next
Page 1 of 2