Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.04803
Cited By
CoAtNet: Marrying Convolution and Attention for All Data Sizes
9 June 2021
Zihang Dai
Hanxiao Liu
Quoc V. Le
Mingxing Tan
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CoAtNet: Marrying Convolution and Attention for All Data Sizes"
50 / 482 papers shown
Title
ORXE: Orchestrating Experts for Dynamically Configurable Efficiency
Qingyuan Wang
Guoxin Wang
B. Cardiff
Deepu John
35
0
0
07 May 2025
False Promises in Medical Imaging AI? Assessing Validity of Outperformance Claims
Evangelia Christodoulou
Annika Reinke
Pascaline Andrè
Patrick Godau
P. Kalinowski
...
Amber L. Simpson
A. Kopp-Schneider
Gaël Varoquaux
O. Colliot
Lena Maier-Hein
38
0
0
07 May 2025
Corner Cases: How Size and Position of Objects Challenge ImageNet-Trained Models
Mishal Fatima
Steffen Jung
M. Keuper
33
0
0
06 May 2025
DCS-ST for Classification of Breast Cancer Histopathology Images with Limited Annotations
Liu Suxing
Byungwon Min
35
0
0
06 May 2025
Leveraging Depth Maps and Attention Mechanisms for Enhanced Image Inpainting
Jin Hyun Park
Harine Choi
Praewa Pitiphat
45
0
0
29 Apr 2025
Making Acoustic Side-Channel Attacks on Noisy Keyboards Viable with LLM-Assisted Spectrograms' "Typo" Correction
Seyyed Ali Ayati
Jin Hyun Park
Yichen Cai
Marcus Botacin
28
0
0
15 Apr 2025
GFT: Gradient Focal Transformer
Boris Kriuk
Simranjit Kaur Gill
Shoaib Aslam
Amir Fakhrutdinov
31
0
0
14 Apr 2025
DefMamba: Deformable Visual State Space Model
Leiye Liu
Miao Zhang
Jihao Yin
Tingwei Liu
Wei Ji
Yongri Piao
Huchuan Lu
Mamba
55
0
0
08 Apr 2025
HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning
Hao Wang
Shuo Zhang
Biao Leng
ViT
67
0
0
03 Apr 2025
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
37
0
0
31 Mar 2025
LSNet: See Large, Focus Small
Ao Wang
Hui Chen
Zijia Lin
J. Han
Guiguang Ding
37
0
0
29 Mar 2025
vGamba: Attentive State Space Bottleneck for efficient Long-range Dependencies in Visual Recognition
Yunusa Haruna
A. Lawan
Mamba
50
0
0
27 Mar 2025
DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition
Caoshuo Li
Tanzhe Li
Xiaobin Hu
Donghao Luo
Taisong Jin
53
0
0
19 Mar 2025
A Comprehensive LLM-powered Framework for Driving Intelligence Evaluation
Shanhe You
Xuewen Luo
Xinhe Liang
Jiashu Yu
Chen Zheng
Jiangtao Gong
64
0
0
07 Mar 2025
TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba
Xiuwei Chen
Sihao Lin
Xiao Dong
Z. Chen
Meng Cao
J. Han
Hang Xu
Xiaodan Liang
Mamba
63
0
0
24 Feb 2025
E2ENet: Dynamic Sparse Feature Fusion for Accurate and Efficient 3D Medical Image Segmentation
Boqian Wu
Q. Xiao
Shiwei Liu
Lu Yin
Mykola Pechenizkiy
D. Mocanu
M. V. Keulen
Elena Mocanu
MedIm
53
4
0
20 Feb 2025
DFCon: Attention-Driven Supervised Contrastive Learning for Robust Deepfake Detection
MD Sadik Hossain Shanto
Mahir Labib Dihan
Souvik Ghosh
Riad Ahmed Anonto
Hafijul Hoque Chowdhury
...
Rakib Ahsan
Md Tanvir Hassan
MD Roqunuzzaman Sojib
Sheikh Azizul Hakim
M. Saifur Rahman
CVBM
71
0
0
28 Jan 2025
Parallel Sequence Modeling via Generalized Spatial Propagation Network
Hongjun Wang
Wonmin Byeon
Jiarui Xu
Jinwei Gu
Ka Chun Cheung
Xiaolong Wang
Kai Han
Jan Kautz
Sifei Liu
117
0
0
21 Jan 2025
Vim-F: Visual State Space Model Benefiting from Learning in the Frequency Domain
Juntao Zhang
Kun Bian
Peng Cheng
You Zhou
Jianning Liu
Wenbo An
Jun Zhou
Kun Shao
Mamba
47
2
0
08 Jan 2025
VMamba: Visual State Space Model
Yue Liu
Yunjie Tian
Yuzhong Zhao
Hongtian Yu
Lingxi Xie
Yaowei Wang
Qixiang Ye
Jianbin Jiao
Yunfan Liu
Mamba
113
609
0
31 Dec 2024
Unity is Strength: Unifying Convolutional and Transformeral Features for Better Person Re-Identification
Yuhao Wang
Pingping Zhang
Xuehu Liu
Zhengzheng Tu
Huchuan Lu
42
3
0
23 Dec 2024
Knowledge Migration Framework for Smart Contract Vulnerability Detection
Luqi Wang
Wenbao Jiang
81
0
0
15 Dec 2024
Joint multi-dimensional dynamic attention and transformer for general image restoration
Huan Zhang
Xu Zhang
Nian Cai
Jianglei Di
Yun Zhang
ViT
35
1
0
12 Nov 2024
Breaking the Low-Rank Dilemma of Linear Attention
Qihang Fan
Huaibo Huang
Ran He
33
0
0
12 Nov 2024
AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Anil Kag
Huseyin Coskun
Jierun Chen
Junli Cao
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
Jian Ren
46
3
0
07 Nov 2024
Harmformer: Harmonic Networks Meet Transformers for Continuous Roto-Translation Equivariance
Tomáš Karella
Adam Harmanec
J. Kotera
Jan Blažek
F. Šroubek
28
1
0
06 Nov 2024
Cross Feature Fusion of Fundus Image and Generated Lesion Map for Referable Diabetic Retinopathy Classification
Dahyun Mok
Junghyun Bum
Le Duc Tai
Hyunseung Choo
MedIm
29
0
0
06 Nov 2024
Expanding Sparse Tuning for Low Memory Usage
Shufan Shen
Junshu Sun
Xiangyang Ji
Qingming Huang
Shuhui Wang
40
0
0
04 Nov 2024
MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image Segmentation
Yufeng Jiang
Zongxi Li
Xiangyan Chen
Haoran Xie
Jing Cai
Mamba
37
1
0
31 Oct 2024
TEAM: Topological Evolution-aware Framework for Traffic Forecasting--Extended Version
Duc Kieu
Tung Kieu
Peng Han
Bin Yang
Christian S. Jensen
Bac Le
AI4TS
16
1
0
24 Oct 2024
DCT-HistoTransformer: Efficient Lightweight Vision Transformer with DCT Integration for histopathological image analysis
Mahtab Ranjbar
Mehdi Mohebbi
Mahdi Cherakhloo
Bijan Vosoughi. Vahdat
MedIm
21
0
0
24 Oct 2024
SFB-net for cardiac segmentation: Bridging the semantic gap with attention
Nicolas Portal
Nadjia Kachenoura
Thomas Dietenbeck
Catherine Achard
28
0
0
24 Oct 2024
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context
Maximilian Augustin
Syed Shakib Sarwar
Mostafa Elhoushi
Sai Qian Zhang
Yuecheng Li
B. D. Salvo
23
0
0
23 Oct 2024
TAS: Distilling Arbitrary Teacher and Student via a Hybrid Assistant
Guopeng Li
Qiang Wang
K. Yan
Shouhong Ding
Yuan Gao
Gui-Song Xia
26
0
0
16 Oct 2024
MoH: Multi-Head Attention as Mixture-of-Head Attention
Peng Jin
Bo Zhu
Li Yuan
Shuicheng Yan
MoE
29
13
0
15 Oct 2024
ED-ViT: Splitting Vision Transformer for Distributed Inference on Edge Devices
Xiang Liu
Yijun Song
Xia Li
Yifei Sun
Huiying Lan
Zemin Liu
Linshan Jiang
Jialin Li
17
1
0
15 Oct 2024
HorGait: A Hybrid Model for Accurate Gait Recognition in LiDAR Point Cloud Planar Projections
Jiaxing Hao
Yanxi Wang
Zhigang Chang
Hongmin Gao
Zihao Cheng
Chen Wu
Xin Zhao
Peiye Fang
Rachmat Muwardi
ViT
21
0
0
11 Oct 2024
QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model
Fei Xie
Weijia Zhang
Zhongdao Wang
Chao Ma
Mamba
24
3
0
09 Oct 2024
Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading
Fang Gao
XueTao Li
Jiabao Wang
Shengheng Ma
Jun Yu
28
0
0
08 Oct 2024
ResTNet: Defense against Adversarial Policies via Transformer in Computer Go
Tai-Lin Wu
Ti-Rong Wu
Chung-Chin Shih
Yan-Ru Ju
I-Chen Wu
AAML
23
0
0
07 Oct 2024
Cross Resolution Encoding-Decoding For Detection Transformers
Ashish Kumar
Jaesik Park
ViT
26
0
0
05 Oct 2024
Designing Concise ConvNets with Columnar Stages
Ashish Kumar
Jaesik Park
MQ
23
0
0
05 Oct 2024
Universal Medical Image Representation Learning with Compositional Decoders
Kaini Wang
Ling Yang
Siping Zhou
Guangquan Zhou
Wentao Zhang
Bin Cui
Shuo Li
SSL
MedIm
31
0
0
30 Sep 2024
Mammo-Clustering: A Multi-views Tri-level Information Fusion Context Clustering Framework for Localization and Classification in Mammography
Shilong Yang
Chulong Zhang
Qi Zang
Juan Yu
Liang Zeng
...
Yexuan Xing
Xin Pan
Qi Li
Xiaokun Liang
Yaoqin Xie
40
0
0
23 Sep 2024
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks
Meng Lou
Yunxiang Fu
Yizhou Yu
Mamba
53
5
0
15 Sep 2024
VFA: Vision Frequency Analysis of Foundation Models and Human
Mohammad Javad Darvishi Bayazi
Md Rifat Arefin
Jocelyn Faubert
Irina Rish
VLM
34
1
0
09 Sep 2024
Efficient Training of Large Vision Models via Advanced Automated Progressive Learning
Changlin Li
Jiawei Zhang
Sihao Lin
Zongxin Yang
Junwei Liang
Xiaodan Liang
Xiaojun Chang
VLM
21
0
0
06 Sep 2024
LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones
Moritz Nottebaum
Matteo Dunnhofer
C. Micheloni
ViT
29
1
0
05 Sep 2024
TBConvL-Net: A Hybrid Deep Learning Architecture for Robust Medical Image Segmentation
Shahzaib Iqbal
Tariq M. Khan
Syed S. Naqvi
Asim Naveed
Erik H. W. Meijering
MedIm
48
6
0
05 Sep 2024
The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge
Shutong Niu
Ruoyu Wang
Jun Du
Gaobin Yang
Yanhui Tu
...
Tian Gao
Genshun Wan
Feng Ma
Jia Pan
Jianqing Gao
34
4
0
03 Sep 2024
1
2
3
4
...
8
9
10
Next