Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2106.04803
Cited By
v1
v2 (latest)
CoAtNet: Marrying Convolution and Attention for All Data Sizes
Neural Information Processing Systems (NeurIPS), 2021
9 June 2021
Zihang Dai
Hanxiao Liu
Quoc V. Le
Mingxing Tan
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"CoAtNet: Marrying Convolution and Attention for All Data Sizes"
50 / 510 papers shown
Title
EVCC: Enhanced Vision Transformer-ConvNeXt-CoAtNet Fusion for Classification
Kazi Reyazul Hasan
M. Rahman
Wasif Jalal
Sadif Ahmed
Shahriar Raj
Mubasshira Musarrat
Muhammad Abdullah Adnan
ViT
72
0
0
24 Nov 2025
GRPO-RM: Fine-Tuning Representation Models via GRPO-Driven Reinforcement Learning
Yanchen Xu
Ziheng Jiao
H. Zhang
Xuelong Li
272
0
0
19 Nov 2025
Multi-refined Feature Enhanced Sentiment Analysis Using Contextual Instruction
Peter Atandoh
Jie Zou
Weikang Guo
Jiwei Wei
Zheng Wang
154
0
0
01 Nov 2025
Kernelized Sparse Fine-Tuning with Bi-level Parameter Competition for Vision Models
Shufan Shen
Junshu Sun
Shuhui Wang
Qingming Huang
136
0
0
28 Oct 2025
Attentive Convolution: Unifying the Expressivity of Self-Attention with Convolutional Efficiency
Hao Yu
H. G. Chen
Yan Jiang
Wei Peng
Zhaodong Sun
Samuel Kaski
Guoying Zhao
133
0
0
23 Oct 2025
Translution: Unifying Self-attention and Convolution for Adaptive and Relative Modeling
Hehe Fan
Yi Yang
Mohan S. Kankanhalli
Fei Wu
ViT
92
0
0
11 Oct 2025
Universal Neural Architecture Space: Covering ConvNets, Transformers and Everything in Between
Ondřej Týbl
Lukáš Neumann
AI4CE
184
0
0
07 Oct 2025
Hierarchical Deep Fusion Framework for Multi-dimensional Facial Forgery Detection - The 2024 Global Deepfake Image Detection Challenge
Kohou Wang
Huan Hu
Xiang Liu
Z. Chen
Ping Chen
Zhaoxiang Liu
Shiguo Lian
92
0
0
16 Sep 2025
SAGA: Selective Adaptive Gating for Efficient and Expressive Linear Attention
Yuan Cao
Dong Wang
84
0
0
16 Sep 2025
LEGO: Spatial Accelerator Generation and Optimization for Tensor Applications
International Symposium on High-Performance Computer Architecture (HPCA), 2025
Yujun Lin
Zhekai Zhang
Song Han
128
1
0
15 Sep 2025
CoAtNeXt:An Attention-Enhanced ConvNeXtV2-Transformer Hybrid Model for Gastric Tissue Classification
Mustafa Yurdakul
Şakir Tasdemir
60
0
0
11 Sep 2025
Leveraging Transfer Learning and Mobile-enabled Convolutional Neural Networks for Improved Arabic Handwritten Character Recognition
IEEE Access (IEEE Access), 2025
Mohsine El Khayati
Ayyad Maafiri
Yassine Himeur
Hamzah Ali Alkhazaleh
Shadi Atalla
Wathiq Mansoor
101
0
0
05 Sep 2025
VCMamba: Bridging Convolutions with Multi-Directional Mamba for Efficient Visual Representation
Mustafa Munir
Alex Zhang
R. Marculescu
Mamba
212
0
0
04 Sep 2025
Image Quality Assessment for Machines: Paradigm, Large-scale Database, and Models
Xiaoqi Wang
Yun Zhang
Weisi Lin
124
0
0
27 Aug 2025
The Maximum Coverage Model and Recommendation System for UAV Vertiports Location Planning
Chunliang Hua
Xiao Hu
Jiayang Sun
Zeyuan Yang
148
0
0
18 Aug 2025
Learning Spatial Decay for Vision Transformers
Yuxin Mao
Zhen Qin
Jinxing Zhou
Bin Fan
Jing Zhang
Yiran Zhong
Yuchao Dai
100
1
0
13 Aug 2025
Topological Structure Description for Artcode Detection Using the Shape of Orientation Histogram
Liming Xu
Dave Towey
Andrew P French
Steve Benford
92
0
0
13 Aug 2025
Calibration Attention: Instance-wise Temperature Scaling for Vision Transformers
Wenhao Liang
Wei Emma Zhang
Lin Yue
Miao Xu
Olaf Maennel
L. Yao
128
0
0
12 Aug 2025
A Guide to Robust Generalization: The Impact of Architecture, Pre-training, and Optimization Strategy
M. Heuillet
Rishika Bhagwatkar
Jonas Ngnawé
Y. Pequignot
Alexandre Larouche
Christian Gagné
Irina Rish
Ola Ahmad
Audrey Durand
OOD
AAML
VLM
144
1
0
12 Aug 2025
UniConvNet: Expanding Effective Receptive Field while Maintaining Asymptotically Gaussian Distribution for ConvNets of Any Scale
Yuhao Wang
Wei Xi
192
1
0
12 Aug 2025
GVCCS: A Dataset for Contrail Identification and Tracking on Visible Whole Sky Camera Sequences
Gabriel Jarry
Ramon Dalmau
Philippe Very
Franck Ballerini
Stephania-Denisa Bocu
198
1
0
24 Jul 2025
Iwin Transformer: Hierarchical Vision Transformer using Interleaved Windows
Simin Huo
Ning Li
ViT
228
0
0
24 Jul 2025
Vision Transformers for End-to-End Quark-Gluon Jet Classification from Calorimeter Images
Md Abrar Jahin
Shahriar Soudeep
Arian Rahman Aditta
M. F. Mridha
Nafiz Fahad
Md. Jakir Hossen
ViT
129
2
0
17 Jun 2025
Uncertainty-Aware Remaining Lifespan Prediction from Images
Tristan Kenneweg
Philip Kenneweg
Barbara Hammer
MedIm
182
0
0
16 Jun 2025
DuoFormer: Leveraging Hierarchical Representations by Local and Global Attention Vision Transformer
Xiaoya Tang
Bodong Zhang
M. M. Ho
Beatrice Knudsen
Tolga Tasdizen
ViT
MedIm
141
0
0
15 Jun 2025
DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding
Bin Guo
John H.L. Hansen
214
1
0
11 Jun 2025
Bayesian Neural Scaling Law Extrapolation with Prior-Data Fitted Networks
Dongwoo Lee
Dong Bok Lee
Steven Adriaensen
Juho Lee
Sung Ju Hwang
Frank Hutter
Seon Joo Kim
Hae Beom Lee
BDL
348
0
0
29 May 2025
Structured Initialization for Vision Transformers
Jianqiao Zheng
Xueqian Li
Hemanth Saratchandran
Simon Lucey
ViT
189
0
0
26 May 2025
PiT: Progressive Diffusion Transformer
Jiafu Wu
Yabiao Wang
Jian Li
Jinlong Peng
Yun Cao
Chengjie Wang
Jiangning Zhang
568
0
0
19 May 2025
Unified Sparse-Matrix Representations for Diverse Neural Architectures
Yuzhou Zhu
156
0
0
11 May 2025
ORXE: Orchestrating Experts for Dynamically Configurable Efficiency
Qingyuan Wang
Guoxin Wang
B. Cardiff
Deepu John
211
0
0
07 May 2025
False Promises in Medical Imaging AI? Assessing Validity of Outperformance Claims
Evangelia Christodoulou
Annika Reinke
Pascaline Andrè
Patrick Godau
P. Kalinowski
...
Amber L. Simpson
A. Kopp-Schneider
Gaël Varoquaux
O. Colliot
Lena Maier-Hein
262
3
0
07 May 2025
DCS-ST for Classification of Breast Cancer Histopathology Images with Limited Annotations
Applied Sciences (AS), 2025
Liu Suxing
Byungwon Min
409
0
0
06 May 2025
Corner Cases: How Size and Position of Objects Challenge ImageNet-Trained Models
Mishal Fatima
Steffen Jung
Margret Keuper
263
1
0
06 May 2025
Making Acoustic Side-Channel Attacks on Noisy Keyboards Viable with LLM-Assisted Spectrograms' "Typo" Correction
Workshop on Offensive Technologies (WOOT), 2025
Seyyed Ali Ayati
Jin Hyun Park
Yichen Cai
Marcus Botacin
116
0
0
15 Apr 2025
GFT: Gradient Focal Transformer
Boris Kriuk
Simranjit Kaur Gill
Shoaib Aslam
Amir Fakhrutdinov
196
0
0
14 Apr 2025
DefMamba: Deformable Visual State Space Model
Computer Vision and Pattern Recognition (CVPR), 2025
Leiye Liu
Miao Zhang
Jihao Yin
Tingwei Liu
Wei Ji
Yongri Piao
Huchuan Lu
Mamba
312
6
0
08 Apr 2025
HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning
IEEE transactions on multimedia (TMM), 2025
Hao Wang
Shuo Zhang
Biao Leng
ViT
568
4
0
03 Apr 2025
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
410
0
0
31 Mar 2025
LSNet: See Large, Focus Small
Computer Vision and Pattern Recognition (CVPR), 2025
Ao Wang
Hui Chen
Zijia Lin
Jiawei Han
Guiguang Ding
279
11
0
29 Mar 2025
vGamba: Attentive State Space Bottleneck for efficient Long-range Dependencies in Visual Recognition
Yunusa Haruna
A. Lawan
Mamba
563
0
0
27 Mar 2025
DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition
Computer Vision and Pattern Recognition (CVPR), 2025
Caoshuo Li
Tanzhe Li
Xiaobin Hu
Donghao Luo
Taisong Jin
222
4
0
19 Mar 2025
A Comprehensive LLM-powered Framework for Driving Intelligence Evaluation
IEEE International Conference on Robotics and Automation (ICRA), 2025
Shanhe You
Xuewen Luo
Xinhe Liang
Jiashu Yu
Chen Zheng
Jiangtao Gong
223
0
0
07 Mar 2025
TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba
Xiuwei Chen
Sihao Lin
Xiao Dong
Sihao Lin
Meng Cao
Jiawei Han
Yina Zhuang
J. N. Han
Hang Xu
Xiaodan Liang
Mamba
324
4
0
21 Feb 2025
E2ENet: Dynamic Sparse Feature Fusion for Accurate and Efficient 3D Medical Image Segmentation
Neural Information Processing Systems (NeurIPS), 2023
Boqian Wu
Q. Xiao
Shiwei Liu
Lu Yin
Mykola Pechenizkiy
Decebal Constantin Mocanu
M. V. Keulen
Elena Mocanu
MedIm
309
14
0
20 Feb 2025
DFCon: Attention-Driven Supervised Contrastive Learning for Robust Deepfake Detection
MD Sadik Hossain Shanto
Mahir Labib Dihan
Souvik Ghosh
Riad Ahmed Anonto
Hafijul Hoque Chowdhury
...
Rakib Ahsan
Md Tanvir Hassan
MD Roqunuzzaman Sojib
Sheikh Azizul Hakim
M. Saifur Rahman
CVBM
210
1
0
28 Jan 2025
Deep-BrownConrady: Prediction of Camera Calibration and Distortion Parameters Using Deep Learning and Synthetic Data
IEEE Transactions on Automation Science and Engineering (T-ASE), 2025
Faiz Muhammad Chaudhry
Jarno Ralli
Jerome Leudet
Fahad Sohrab
Farhad Pakdaman
Pierre Corbani
Moncef Gabbouj
145
0
0
24 Jan 2025
Parallel Sequence Modeling via Generalized Spatial Propagation Network
Computer Vision and Pattern Recognition (CVPR), 2025
Hongjun Wang
Wonmin Byeon
Jiarui Xu
Liang Feng
Ka Chun Cheung
Xiaolong Wang
Kai Han
Jan Kautz
Sifei Liu
814
3
0
21 Jan 2025
VMamba: Visual State Space Model
Neural Information Processing Systems (NeurIPS), 2024
Yue Liu
Yunjie Tian
Yuzhong Zhao
Hongtian Yu
Lingxi Xie
Yaowei Wang
Qixiang Ye
Jianbin Jiao
Yunfan Liu
Mamba
1.0K
1,474
0
31 Dec 2024
Unity is Strength: Unifying Convolutional and Transformeral Features for Better Person Re-Identification
Yuhao Wang
Pingping Zhang
Xuehu Liu
Zhengzheng Tu
Huchuan Lu
225
7
0
23 Dec 2024
1
2
3
4
...
9
10
11
Next