v1v2 (latest)

Pay Attention to MLPs

Neural Information Processing Systems (NeurIPS), 2021

17 May 2021

Papers citing "Pay Attention to MLPs"

50 / 323 papers shown

SpectFormer: Frequency and Attention is what you need in a Vision TransformerIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Badri N. Patro

Vinay P. Namboodiri

Vijay Srinivas Agneeswaran

ViT

205

13 Apr 2023

MC-MLP:Multiple Coordinate Frames in all-MLP Architecture for Vision

171

08 Apr 2023

Revisiting the Evaluation of Image Synthesis with GANsNeural Information Processing Systems (NeurIPS), 2023

282

04 Apr 2023

ReBotNet: Fast Real-time Video EnhancementIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Jeya Maria Jose Valanarasu

260

23 Mar 2023

Boosting Convolution with Efficient MLP-Permutation for Volumetric Medical Image SegmentationIEEE Transactions on Medical Imaging (TMI), 2023

589

23 Mar 2023

Multiscale Attention via Wavelet Neural Operators for Vision Transformers

204

22 Mar 2023

ProphNet: Efficient Agent-Centric Motion Forecasting with Anchor-Informed ProposalsComputer Vision and Pattern Recognition (CVPR), 2023

357

21 Mar 2023

ALOFT: A Lightweight MLP-like Architecture with Dynamic Low-frequency Transform for Domain GeneralizationComputer Vision and Pattern Recognition (CVPR), 2023

Jintao Guo

Na Wang

Lei Qi

Yinghuan Shi

292

21 Mar 2023

Good Neighbors Are All You Need for Chinese Grapheme-to-Phoneme ConversionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

155

14 Mar 2023

AutoMLP: Automated MLP for Sequential RecommendationsThe Web Conference (WWW), 2023

Xiangyu Zhao

180

11 Mar 2023

Prismer: A Vision-Language Model with Multi-Task Experts

Linxi Fan

315

04 Mar 2023

MixVPR: Feature Mixing for Visual Place RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

A. Ali-bey

B. Chaib-draa

Philippe Giguère

215

230

03 Mar 2023

Image as Set of PointsInternational Conference on Learning Representations (ICLR), 2023

Huan Wang

186

02 Mar 2023

A Survey on Long Text Modeling with Transformers

404

28 Feb 2023

Co-Driven Recognition of Semantic Consistency via the Fusion of Transformer and HowNet Sememes Knowledge

127

21 Feb 2023

150

20 Feb 2023

Efficiency 360: Efficient Vision Transformers

Badri N. Patro

Vijay Srinivas Agneeswaran

406

16 Feb 2023

Open Problems in Applied Deep Learning

M. Raissi

AI4CE

233

26 Jan 2023

WLD-Reg: A Data-dependent Within-layer Diversity RegularizerAAAI Conference on Artificial Intelligence (AAAI), 2023

131

03 Jan 2023

Rethinking Mobile Block for Efficient Attention-based ModelsIEEE International Conference on Computer Vision (ICCV), 2023

Jiangning Zhang

Xiangtai Li

Yabiao Wang

Chengjie Wang

344

197

03 Jan 2023

BiMLP: Compact Binary Architectures for Vision Multi-Layer PerceptronsNeural Information Processing Systems (NeurIPS), 2022

Yixing Xu

Xinghao Chen

Yunhe Wang

167

29 Dec 2022

A Generalization of ViT/MLP-Mixer to GraphsInternational Conference on Machine Learning (ICML), 2022

Bryan Hooi

249

122

27 Dec 2022

OAMixer: Object-aware Mixing Layer for Vision Transformers

260

13 Dec 2022

BALF: Simple and Efficient Blur Aware Local Feature DetectorIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

288

27 Nov 2022

EurNet: Efficient Multi-Range Relational Modeling of Spatial Multi-Relational Data

179

23 Nov 2022

SimVP: Towards Simple yet Powerful Spatiotemporal Predictive LearningIEEE transactions on multimedia (IEEE TMM), 2022

Cheng Tan

Zhangyang Gao

Siyuan Li

Stan Z. Li

VLM AI4TS

273

22 Nov 2022

Unsupervised Echocardiography Registration through Patch-based MLPs and Transformers

282

21 Nov 2022

Convexifying Transformers: Improving optimization and understanding of transformer networks

229

20 Nov 2022

Astronomia ex machina: a history, primer, and outlook on neural networks in astronomyRoyal Society Open Science (RSOS), 2022

Michael J. Smith

James E. Geach

203

07 Nov 2022

How Much Does Attention Actually Attend? Questioning the Importance of Attention in Pretrained TransformersConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Hao Peng

244

07 Nov 2022

Neural Fourier Shift for Binaural Speech RenderingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Jinkyu Lee

Kyogu Lee

222

02 Nov 2022

Globally Gated Deep Linear NetworksNeural Information Processing Systems (NeurIPS), 2022

Qianyi Li

H. Sompolinsky

AI4CE

261

31 Oct 2022

QNet: A Quantum-native Sequence Encoder ArchitectureInternational Conference on Quantum Computing and Engineering (ICQCE), 2022

Wei-Yen Day

Hao-Sheng Chen

Min Sun

248

31 Oct 2022

Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language ModelsInternational Conference on Machine Learning (ICML), 2022

323

25 Oct 2022

MetaFormer Baselines for VisionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Weihao Yu

245

272

24 Oct 2022

Similarity of Neural Architectures using Adversarial Attack TransferabilityEuropean Conference on Computer Vision (ECCV), 2022

540

20 Oct 2022

Decoupling Features in Hierarchical Propagation for Video Object SegmentationNeural Information Processing Systems (NeurIPS), 2022

Zongxin Yang

Yi Yang

VOS

320

195

18 Oct 2022

SA-MLP: Distilling Graph Knowledge from GNNs into Structure-Aware MLP

Jie Chen

Junbin Gao

219

18 Oct 2022

AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Ganesh Jawahar

Subhabrata Mukherjee

Xiaodong Liu

Young Jin Kim

Muhammad Abdul-Mageed

L. Lakshmanan

Ahmed Hassan Awadallah

Sébastien Bubeck

Jianfeng Gao

MoE

184

14 Oct 2022

Are All Vision Models Created Equal? A Study of the Open-Loop to Closed-Loop Causality Gap

Ramin Hasani

Daniela Rus

220

09 Oct 2022

The Lie Derivative for Measuring Learned EquivarianceInternational Conference on Learning Representations (ICLR), 2022

293

06 Oct 2022

Centralized Feature Pyramid for Object DetectionIEEE Transactions on Image Processing (IEEE TIP), 2022

225

239

05 Oct 2022

Rethinking Performance Gains in Image Dehazing Networks

176

23 Sep 2022

Mega: Moving Average Equipped Gated AttentionInternational Conference on Learning Representations (ICLR), 2022

Graham Neubig

Luke Zettlemoyer

334

219

21 Sep 2022

Analysis of Quantization on MLP-based Vision Models

Lingran Zhao

Zhen Dong

Kurt Keutzer

136

14 Sep 2022

Pre-Training a Graph Recurrent Network for Language Representation

Yue Zhang

240

08 Sep 2022

LKD-Net: Large Kernel Convolution Network for Single Image DehazingIEEE International Conference on Multimedia and Expo (ICME), 2022

Pinjun Luo

Guoqiang Xiao

Xinbo Gao

Song Wu

145

05 Sep 2022

gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted WindowIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Mocho Go

Hideyuki Tachibana

ViT

163

24 Aug 2022

Efficient Attention-free Video Shift Transformers

Adrian Bulat

Brais Martínez

Georgios Tzimiropoulos

ViT

214

23 Aug 2022

CM-MLP: Cascade Multi-scale MLP with Axial Context Relation Encoder for Edge Segmentation of Medical ImageIEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2022

170

23 Aug 2022