ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.04803
  4. Cited By
CoAtNet: Marrying Convolution and Attention for All Data Sizes
v1v2 (latest)

CoAtNet: Marrying Convolution and Attention for All Data Sizes

Neural Information Processing Systems (NeurIPS), 2021
9 June 2021
Zihang Dai
Hanxiao Liu
Quoc V. Le
Mingxing Tan
    ViT
ArXiv (abs)PDFHTML

Papers citing "CoAtNet: Marrying Convolution and Attention for All Data Sizes"

50 / 510 papers shown
Overparameterization from Computational Constraints
Overparameterization from Computational ConstraintsNeural Information Processing Systems (NeurIPS), 2022
Sanjam Garg
S. Jha
Saeed Mahloujifar
Mohammad Mahmoody
Mingyuan Wang
159
3
0
27 Aug 2022
Efficient Attention-free Video Shift Transformers
Efficient Attention-free Video Shift Transformers
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
ViT
211
1
0
23 Aug 2022
Image as a Foreign Language: BEiT Pretraining for All Vision and
  Vision-Language Tasks
Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Wenhui Wang
Hangbo Bao
Li Dong
Johan Bjorck
Zhiliang Peng
...
Kriti Aggarwal
O. Mohammed
Saksham Singhal
Subhojit Som
Furu Wei
MLLMVLMViT
609
706
0
22 Aug 2022
TaCo: Textual Attribute Recognition via Contrastive Learning
TaCo: Textual Attribute Recognition via Contrastive LearningAAAI Conference on Artificial Intelligence (AAAI), 2022
Chang Nie
Yiqing Hu
Yanqiu Qu
Hao Liu
Deqiang Jiang
Bo Ren
250
1
0
22 Aug 2022
Conviformers: Convolutionally guided Vision Transformer
Conviformers: Convolutionally guided Vision Transformer
Mohit Vaishnav
Thomas Fel
I. F. Rodriguez
Thomas Serre
ViT
305
2
0
17 Aug 2022
SensorSCAN: Self-Supervised Learning and Deep Clustering for Fault
  Diagnosis in Chemical Processes
SensorSCAN: Self-Supervised Learning and Deep Clustering for Fault Diagnosis in Chemical ProcessesArtificial Intelligence (AIJ), 2022
Maksim Golyadkin
Vitaliy Pozdnyakov
L. Zhukov
Ilya Makarov
165
13
0
17 Aug 2022
In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze
  Estimation
In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze EstimationBritish Machine Vision Conference (BMVC), 2022
Bolin Lai
Miao Liu
Fiona Ryan
James M. Rehg
ViT
239
50
0
08 Aug 2022
Advancing Plain Vision Transformer Towards Remote Sensing Foundation
  Model
Advancing Plain Vision Transformer Towards Remote Sensing Foundation ModelIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022
Di Wang
Qiming Zhang
Yufei Xu
Jing Zhang
Bo Du
Dacheng Tao
Guang Dai
334
319
0
08 Aug 2022
Combined CNN Transformer Encoder for Enhanced Fine-grained Human Action
  Recognition
Combined CNN Transformer Encoder for Enhanced Fine-grained Human Action Recognition
M. C. Leong
Haosong Zhang
Huibin Tan
Liyuan Li
J. Lim
ViT
186
11
0
03 Aug 2022
A Novel Transformer Network with Shifted Window Cross-Attention for
  Spatiotemporal Weather Forecasting
A Novel Transformer Network with Shifted Window Cross-Attention for Spatiotemporal Weather ForecastingIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (IEEE JSTARS), 2022
Alabi Bojesomo
Hasan Al-Marzouqi
P. Liatsis
210
20
0
02 Aug 2022
HorNet: Efficient High-Order Spatial Interactions with Recursive Gated
  Convolutions
HorNet: Efficient High-Order Spatial Interactions with Recursive Gated ConvolutionsNeural Information Processing Systems (NeurIPS), 2022
Yongming Rao
Wenliang Zhao
Yansong Tang
Jie Zhou
Ser-Nam Lim
Jiwen Lu
ViT
408
332
0
28 Jul 2022
Convolutional Embedding Makes Hierarchical Vision Transformer Stronger
Convolutional Embedding Makes Hierarchical Vision Transformer StrongerEuropean Conference on Computer Vision (ECCV), 2022
Cong Wang
Hongmin Xu
Xiong Zhang
Li Wang
Zhitong Zheng
Haifeng Liu
ViT
101
29
0
27 Jul 2022
TreeSketchNet: From Sketch To 3D Tree Parameters Generation
TreeSketchNet: From Sketch To 3D Tree Parameters GenerationACM Transactions on Intelligent Systems and Technology (ACM TIST), 2022
Gilda Manfredi
N. Capece
U. Erra
M. Gruosso
167
16
0
25 Jul 2022
Online Continual Learning with Contrastive Vision Transformer
Online Continual Learning with Contrastive Vision TransformerEuropean Conference on Computer Vision (ECCV), 2022
Zhen Wang
Liu Liu
Yajing Kong
Jiaxian Guo
Dacheng Tao
CLL
172
42
0
24 Jul 2022
HybMT: Hybrid Meta-Predictor based ML Algorithm for Fast Test Vector
  Generation
HybMT: Hybrid Meta-Predictor based ML Algorithm for Fast Test Vector GenerationAsia and South Pacific Design Automation Conference (ASP-DAC), 2022
Shruti Pandey
J. Jayadeva
S. Sarangi
180
2
0
22 Jul 2022
Cost Aggregation with 4D Convolutional Swin Transformer for Few-Shot
  Segmentation
Cost Aggregation with 4D Convolutional Swin Transformer for Few-Shot SegmentationEuropean Conference on Computer Vision (ECCV), 2022
Sunghwan Hong
Seokju Cho
Jisu Nam
Stephen Lin
Seung Wook Kim
ViT
280
174
0
22 Jul 2022
Weakly Supervised Object Localization via Transformer with Implicit
  Spatial Calibration
Weakly Supervised Object Localization via Transformer with Implicit Spatial CalibrationEuropean Conference on Computer Vision (ECCV), 2022
Haotian Bai
Ruimao Zhang
Jiong Wang
Xiang Wan
WSOL
318
44
0
21 Jul 2022
SplitMixer: Fat Trimmed From MLP-like Models
SplitMixer: Fat Trimmed From MLP-like Models
Ali Borji
Sikun Lin
188
3
0
21 Jul 2022
AutoDiCE: Fully Automated Distributed CNN Inference at the Edge
AutoDiCE: Fully Automated Distributed CNN Inference at the Edge
Xiaotian Guo
A. Pimentel
T. Stefanov
58
2
0
20 Jul 2022
Vision Transformers: From Semantic Segmentation to Dense Prediction
Vision Transformers: From Semantic Segmentation to Dense PredictionInternational Journal of Computer Vision (IJCV), 2022
Li Zhang
Jiachen Lu
Sixiao Zheng
Xinxuan Zhao
Xiatian Zhu
Yanwei Fu
Tao Xiang
Jianfeng Feng
Philip H. S. Torr
ViT
270
16
0
19 Jul 2022
Towards Trustworthy Healthcare AI: Attention-Based Feature Learning for
  COVID-19 Screening With Chest Radiography
Towards Trustworthy Healthcare AI: Attention-Based Feature Learning for COVID-19 Screening With Chest Radiography
Kai Ma
Pengcheng Xi
K. Habashy
Ashkan Ebadi
Stéphane Tremblay
Alexander Wong
ViTMedIm
96
2
0
19 Jul 2022
Parameterization of Cross-Token Relations with Relative Positional
  Encoding for Vision MLP
Parameterization of Cross-Token Relations with Relative Positional Encoding for Vision MLPACM Multimedia (ACM MM), 2022
Zhicai Wang
Y. Hao
Xingyu Gao
Hao Zhang
Shuo Wang
Tingting Mu
Xiangnan He
196
8
0
15 Jul 2022
Next-ViT: Next Generation Vision Transformer for Efficient Deployment in
  Realistic Industrial Scenarios
Next-ViT: Next Generation Vision Transformer for Efficient Deployment in Realistic Industrial Scenarios
Jiashi Li
Xin Xia
W. Li
Huixia Li
Xing Wang
Xuefeng Xiao
Rui Wang
Min Zheng
Xin Pan
ViT
229
199
0
12 Jul 2022
Pure Transformers are Powerful Graph Learners
Pure Transformers are Powerful Graph LearnersNeural Information Processing Systems (NeurIPS), 2022
Jinwoo Kim
Tien Dat Nguyen
Seonwoo Min
Sungjun Cho
Moontae Lee
Honglak Lee
Seunghoon Hong
392
248
0
06 Jul 2022
Softmax-free Linear Transformers
Softmax-free Linear TransformersInternational Journal of Computer Vision (IJCV), 2022
Jiachen Lu
Junge Zhang
Xiatian Zhu
Jianfeng Feng
Tao Xiang
Li Zhang
ViT
211
14
0
05 Jul 2022
FFCNet: Fourier Transform-Based Frequency Learning and Complex
  Convolutional Network for Colon Disease Classification
FFCNet: Fourier Transform-Based Frequency Learning and Complex Convolutional Network for Colon Disease ClassificationInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
Kaini Wang
Yuting He
Shuaishuai Zhuang
Juzheng Miao
Xiaopu He
Ping Zhou
Guanyu Yang
Guangquan Zhou
Shuo Li
109
18
0
04 Jul 2022
Rethinking Query-Key Pairwise Interactions in Vision Transformers
Rethinking Query-Key Pairwise Interactions in Vision Transformers
Cheng-rong Li
Yangxin Liu
213
0
0
01 Jul 2022
Measuring Forgetting of Memorized Training Examples
Measuring Forgetting of Memorized Training ExamplesInternational Conference on Learning Representations (ICLR), 2022
Matthew Jagielski
Om Thakkar
Florian Tramèr
Daphne Ippolito
Katherine Lee
...
Eric Wallace
Shuang Song
Abhradeep Thakurta
Nicolas Papernot
Chiyuan Zhang
TDI
364
132
0
30 Jun 2022
Transfer Learning with Deep Tabular Models
Transfer Learning with Deep Tabular ModelsInternational Conference on Learning Representations (ICLR), 2022
Roman Levin
Valeriia Cherepanova
Avi Schwarzschild
Arpit Bansal
C. Bayan Bruss
Tom Goldstein
A. Wilson
Micah Goldblum
OODFedMLLMTD
283
73
0
30 Jun 2022
RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network
RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid NetworkConference on Machine Learning and Systems (MLSys), 2022
Vitaliy Chiley
Vithursan Thangarasa
Abhay Gupta
Anshul Samar
Joel Hestness
D. DeCoste
193
13
0
28 Jun 2022
ZoDIAC: Zoneout Dropout Injection Attention Calculation
ZoDIAC: Zoneout Dropout Injection Attention Calculation
Zanyar Zohourianshahzadi
Terrance Boult
Jugal Kalita
256
0
0
28 Jun 2022
Revisiting Architecture-aware Knowledge Distillation: Smaller Models and
  Faster Search
Revisiting Architecture-aware Knowledge Distillation: Smaller Models and Faster Search
Taehyeon Kim
Heesoo Myeong
Se-Young Yun
164
3
0
27 Jun 2022
Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens
  in 3D Space
Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D SpaceNeural Information Processing Systems (NeurIPS), 2022
Jinghuan Shang
Srijan Das
Michael S. Ryoo
315
14
0
23 Jun 2022
A novel adversarial learning strategy for medical image classification
A novel adversarial learning strategy for medical image classification
Zong Fan
Xiaohui Zhang
Jacob A. Gasienica
Jennifer Potts
S. Ruan
W. Thorstad
Hiram Gay
Pengfei Song
Xiaowei Wang
Hua Li
GANMedIm
243
6
0
23 Jun 2022
Explanation-based Counterfactual Retraining(XCR): A Calibration Method
  for Black-box Models
Explanation-based Counterfactual Retraining(XCR): A Calibration Method for Black-box Models
Liu Zhendong
Wenyu Jiang
Yan Zhang
Chongjun Wang
CML
154
0
0
22 Jun 2022
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for
  Mobile Vision Applications
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications
Muhammad Maaz
Abdelrahman M. Shaker
Hisham Cholakkal
Salman Khan
Syed Waqas Zamir
Rao Muhammad Anwer
Fahad Shahbaz Khan
ViT
291
292
0
21 Jun 2022
Vicinity Vision Transformer
Vicinity Vision TransformerIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Weixuan Sun
Zhen Qin
Huiyuan Deng
Jianyuan Wang
Yi Zhang
Kaihao Zhang
Nick Barnes
Stan Birchfield
Lingpeng Kong
Yiran Zhong
ViT
206
45
0
21 Jun 2022
Global Context Vision Transformers
Global Context Vision TransformersInternational Conference on Machine Learning (ICML), 2022
Ali Hatamizadeh
Hongxu Yin
Greg Heinrich
Jan Kautz
Pavlo Molchanov
ViT
465
189
0
20 Jun 2022
Video Capsule Endoscopy Classification using Focal Modulation Guided
  Convolutional Neural Network
Video Capsule Endoscopy Classification using Focal Modulation Guided Convolutional Neural Network
Abhishek Srivastava
Nikhil Kumar Tomar
Ulas Bagci
Debesh Jha
MedIm
127
21
0
16 Jun 2022
Write and Paint: Generative Vision-Language Models are Unified Modal
  Learners
Write and Paint: Generative Vision-Language Models are Unified Modal LearnersInternational Conference on Learning Representations (ICLR), 2022
Shizhe Diao
Wangchunshu Zhou
Xinsong Zhang
Jiawei Wang
MLLMAI4CE
294
19
0
15 Jun 2022
SP-ViT: Learning 2D Spatial Priors for Vision Transformers
SP-ViT: Learning 2D Spatial Priors for Vision TransformersBritish Machine Vision Conference (BMVC), 2022
Yuxuan Zhou
Wangmeng Xiang
Chong Li
Biao Wang
Xihan Wei
Lei Zhang
Margret Keuper
Xia Hua
ViT
117
19
0
15 Jun 2022
Efficient Adaptive Ensembling for Image Classification
Efficient Adaptive Ensembling for Image Classification
A. Bruno
Davide Moroni
M. Martinelli
187
22
0
15 Jun 2022
Differentiable Top-k Classification Learning
Differentiable Top-k Classification LearningInternational Conference on Machine Learning (ICML), 2022
Felix Petersen
Hilde Kuehne
Christian Borgelt
Oliver Deussen
273
40
0
15 Jun 2022
Peripheral Vision Transformer
Peripheral Vision TransformerNeural Information Processing Systems (NeurIPS), 2022
Juhong Min
Yucheng Zhao
Chong Luo
Minsu Cho
ViTMDE
238
35
0
14 Jun 2022
On Data Scaling in Masked Image Modeling
On Data Scaling in Masked Image ModelingComputer Vision and Pattern Recognition (CVPR), 2022
Zhenda Xie
Zheng Zhang
Yue Cao
Yutong Lin
Yixuan Wei
Jingdong Sun
Han Hu
209
68
0
09 Jun 2022
Unveiling Transformers with LEGO: a synthetic reasoning task
Unveiling Transformers with LEGO: a synthetic reasoning task
Yi Zhang
A. Backurs
Sébastien Bubeck
Ronen Eldan
Suriya Gunasekar
Tal Wagner
LRM
417
101
0
09 Jun 2022
From Attribution Maps to Human-Understandable Explanations through
  Concept Relevance Propagation
From Attribution Maps to Human-Understandable Explanations through Concept Relevance PropagationNature Machine Intelligence (Nat. Mach. Intell.), 2022
Reduan Achtibat
Maximilian Dreyer
Ilona Eisenbraun
S. Bosse
Thomas Wiegand
Wojciech Samek
Sebastian Lapuschkin
FAtt
248
197
0
07 Jun 2022
EfficientFormer: Vision Transformers at MobileNet Speed
EfficientFormer: Vision Transformers at MobileNet SpeedNeural Information Processing Systems (NeurIPS), 2022
Yanyu Li
Geng Yuan
Yang Wen
Eric Hu
Georgios Evangelidis
Sergey Tulyakov
Yanzhi Wang
Jian Ren
ViT
712
519
0
02 Jun 2022
Transforming medical imaging with Transformers? A comparative review of
  key properties, current progresses, and future perspectives
Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives
Jun Li
Junyu Chen
Yucheng Tang
Ce Wang
Bennett A. Landman
S. K. Zhou
ViTOODMedIm
433
144
0
02 Jun 2022
HiViT: Hierarchical Vision Transformer Meets Masked Image Modeling
HiViT: Hierarchical Vision Transformer Meets Masked Image Modeling
Xiaosong Zhang
Yunjie Tian
Wei Huang
QiXiang Ye
Jingdong Sun
Lingxi Xie
Qi Tian
255
39
0
30 May 2022
Previous
123...1011789
Next