v1v2 (latest)

Pay Attention to MLPs

Neural Information Processing Systems (NeurIPS), 2021

17 May 2021

Papers citing "Pay Attention to MLPs"

50 / 323 papers shown

Enhancing Targeted Attack Transferability via Diversified Weight Pruning

277

18 Aug 2022

giMLPs: Gate with Inhibition Mechanism in MLPs

163

01 Aug 2022

Doubly Deformable Aggregation of Covariance Matrices for Few-shot SegmentationEuropean Conference on Computer Vision (ECCV), 2022

Zhitong Xiong

Haopeng Li

Xiao Xiang Zhu

165

30 Jul 2022

HorNet: Efficient High-Order Spatial Interactions with Recursive Gated ConvolutionsNeural Information Processing Systems (NeurIPS), 2022

Yongming Rao

Wenliang Zhao

Yansong Tang

Jie Zhou

Ser-Nam Lim

Jiwen Lu

ViT

405

332

28 Jul 2022

TINYCD: A (Not So) Deep Learning Model For Change Detection

Andrea Codegoni

G. Lombardi

Alessandro Ferrari

231

110

26 Jul 2022

SplitMixer: Fat Trimmed From MLP-like Models

Ali Borji

Sikun Lin

188

21 Jul 2022

Assaying Out-Of-Distribution Generalization in Transfer LearningNeural Information Processing Systems (NeurIPS), 2022

F. Wenzel

Andrea Dittadi

Peter V. Gehler

Carl-Johann Simon-Gabriel

Max Horn

...

Chris Russell

Thomas Brox

Bernt Schiele

Bernhard Schölkopf

Francesco Locatello

OOD OODD AAML

392

19 Jul 2022

Research Trends and Applications of Data Augmentation Algorithms

João Fonseca

F. Bação

206

18 Jul 2022

MLP-GAN for Brain Vessel Image SegmentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Yan Yan

267

17 Jul 2022

Parameterization of Cross-Token Relations with Relative Positional Encoding for Vision MLPACM Multimedia (ACM MM), 2022

196

15 Jul 2022

Trans4Map: Revisiting Holistic Bird's-Eye-View Mapping from Egocentric Images to Allocentric Semantics with Vision TransformersIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

Kailun Yang

160

13 Jul 2022

Image and Model Transformation with Secret Key for Vision Transformer

Hitoshi Kiya

Ryota Iijima

Maungmaung Aprilpyone

Yuma Kinoshita

ViT

157

12 Jul 2022

Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and UnderstandingInternational Conference on Machine Learning (ICML), 2022

271

191

06 Jul 2022

CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse TransformersConference on Robot Learning (CoRL), 2022

Jiaqi Ma

422

310

05 Jul 2022

Less Is More: Fast Multivariate Time Series Forecasting with Light Sampling-oriented MLP Structures

Jiang Bian

343

247

04 Jul 2022

Golfer: Trajectory Prediction with Masked Goal Conditioning MnM Network

118

02 Jul 2022

QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixerInterspeech (Interspeech), 2022

114

23 Jun 2022

Time Gated Convolutional Neural Networks for Crop Classification

138

20 Jun 2022

Pre-training Vision Transformers with Formula-driven Supervised LearningComputer Vision and Pattern Recognition (CVPR), 2022

Edgar Josafat Martinez-Noriega

Nakamasa Inoue

Rio Yokota

131

18 Jun 2022

GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose EstimationPattern Recognition (Pattern Recogn.), 2022

433

13 Jun 2022

IL-MCAM: An interactive learning and multi-channel attention mechanism-based weakly supervised colorectal histopathology image classification approach

...

Xinyu Huang

157

117

07 Jun 2022

MDMLP: Image Classification from Scratch on Small Datasets with MLP

Tianxu Lv

Chongyang Bai

Chaojie Wang

171

28 May 2022

A Unified Weight Initialization Paradigm for Tensorial Convolutional Neural NetworksInternational Conference on Machine Learning (ICML), 2022

169

28 May 2022

Transformers from an Optimization PerspectiveNeural Information Processing Systems (NeurIPS), 2022

Yongyi Yang

Zengfeng Huang

David Wipf

174

27 May 2022

AdaptFormer: Adapting Vision Transformers for Scalable Visual RecognitionNeural Information Processing Systems (NeurIPS), 2022

Ping Luo

616

932

26 May 2022

Augmentation-induced Consistency Regularization for ClassificationIEEE International Joint Conference on Neural Network (IJCNN), 2022

223

25 May 2022

Sparse Mixers: Combining MoE and Mixing to build a more efficient BERTConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

James Lee-Thorp

Joshua Ainslie

MoE

220

24 May 2022

Are Message Passing Neural Networks Really Helpful for Knowledge Graph Completion?Annual Meeting of the Association for Computational Linguistics (ACL), 2022

247

21 May 2022

Unraveling Attention via Convex Duality: Analysis and Interpretations of Vision TransformersInternational Conference on Machine Learning (ICML), 2022

Arda Sahiner

Tolga Ergen

Batu Mehmet Ozturkler

John M. Pauly

Morteza Mardani

Mert Pilanci

311

17 May 2022

Sequencer: Deep LSTM for Image ClassificationNeural Information Processing Systems (NeurIPS), 2022

Yuki Tatsunami

Masato Taki

VLM ViT

341

109

04 May 2022

Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations

Dan Oneaţă

H. Cucu

118

27 Apr 2022

GPUNet: Searching the Deployable Convolution Neural Networks for GPUsComputer Vision and Pattern Recognition (CVPR), 2022

134

26 Apr 2022

Application of Transfer Learning and Ensemble Learning in Image-level Classification for Breast HistopathologyIntelligent Medicine (IM), 2022

...

Xinyu Huang

201

18 Apr 2022

Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature RepresentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Wenjing Zhu

Xiang Li

102

12 Apr 2022

The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an UtteranceIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Xin Wang

317

11 Apr 2022

Are We Really Making Much Progress in Text Classification? A Comparative Review

369

08 Apr 2022

DaViT: Dual Attention Vision TransformersEuropean Conference on Computer Vision (ECCV), 2022

Mingyu Ding

Bin Xiao

Noel Codella

Ping Luo

Jingdong Wang

Lu Yuan

ViT

374

343

07 Apr 2022

MaxViT: Multi-Axis Vision TransformerEuropean Conference on Computer Vision (ECCV), 2022

Feng Yang

487

887

04 Apr 2022

InstaFormer: Instance-Aware Image-to-Image Translation with TransformerComputer Vision and Pattern Recognition (CVPR), 2022

307

30 Mar 2022

Brain-inspired Multilayer Perceptron with Spiking NeuronsComputer Vision and Pattern Recognition (CVPR), 2022

165

28 Mar 2022

FAMLP: A Frequency-Aware MLP-Like Architecture For Domain Generalization

Yang Cao

181

24 Mar 2022

Focal Modulation NetworksNeural Information Processing Systems (NeurIPS), 2022

Jianwei Yang

Lu Yuan

343

383

22 Mar 2022

Three things everyone should know about Vision TransformersEuropean Conference on Computer Vision (ECCV), 2022

237

154

18 Mar 2022

On the Properties of Adversarially-Trained CNNs

Mattia Carletti

M. Terzi

Gian Antonio Susto

AAML

156

17 Mar 2022

Learning Audio Representations with MLPs

187

16 Mar 2022

Self-Promoted Supervision for Few-Shot TransformerEuropean Conference on Computer Vision (ECCV), 2022

Bowen Dong

179

14 Mar 2022

Efficient Language Modeling with Sparse all-MLP

Xian Li

184

14 Mar 2022

Contrastive Learning for Automotive mmWave Radar Detection Points Based Instance Segmentation

273

13 Mar 2022

HyperMixer: An MLP-based Low Cost Alternative to TransformersAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

358

07 Mar 2022

Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer

Xiaodong Liu

365

223

07 Mar 2022