Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2105.08050
Cited By
v1
v2 (latest)
Pay Attention to MLPs
Neural Information Processing Systems (NeurIPS), 2021
17 May 2021
Hanxiao Liu
Zihang Dai
David R. So
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Pay Attention to MLPs"
50 / 323 papers shown
Transformers are Multi-State RNNs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Matanel Oren
Michael Hassid
Nir Yarden
Yossi Adi
Roy Schwartz
OffRL
275
74
0
11 Jan 2024
Efficient Image Deblurring Networks based on Diffusion Models
Kang Chen
Yuanjie Liu
DiffM
318
4
0
11 Jan 2024
Learning Generalizable Models via Disentangling Spurious and Enhancing Potential Correlations
IEEE Transactions on Image Processing (TIP), 2024
Na Wang
Lei Qi
Jintao Guo
Yinghuan Shi
Yang Gao
OOD
244
7
0
11 Jan 2024
Setting the Record Straight on Transformer Oversmoothing
G. Dovonon
M. Bronstein
Matt J. Kusner
388
11
0
09 Jan 2024
Image Super-Resolution Reconstruction Network based on Enhanced Swin Transformer via Alternating Aggregation of Local-Global Features
Yuming Huang
Yingpin Chen
Changhui Wu
Hanrong Xie
Binhui Song
SupR
ViT
389
1
0
30 Dec 2023
SCHEME: Scalable Channel Mixer for Vision Transformers
Deepak Sridhar
Yunsheng Li
Nuno Vasconcelos
780
1
0
01 Dec 2023
Dimension Mixer: A Generalized Method for Structured Sparsity in Deep Neural Networks
Suman Sapkota
Binod Bhattarai
247
0
0
30 Nov 2023
Full-resolution MLPs Empower Medical Dense Prediction
Mingyuan Meng
Yuxin Xue
Da-wei Feng
Lei Bi
Jinman Kim
MedIm
213
5
0
28 Nov 2023
CMFDFormer: Transformer-based Copy-Move Forgery Detection with Continual Learning
Yaqi Liu
Chao Xia
Song Xiao
Qingxiao Guan
Wenqian Dong
Yifan Zhang
Neng H. Yu
172
3
0
22 Nov 2023
4K-Resolution Photo Exposure Correction at 125 FPS with ~8K Parameters
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Yijie Zhou
Chao Li
Jin Liang
Tianyi Xu
Xin Liu
Jun Xu
3DV
375
17
0
15 Nov 2023
Two-Stage Aggregation with Dynamic Local Attention for Irregular Time Series
Xingyu Chen
Xiaochen Zheng
Amina Mollaysa
Manuel Schürch
Ahmed Allam
Michael Krauthammer
AI4TS
218
1
0
13 Nov 2023
Hierarchically Gated Recurrent Neural Network for Sequence Modeling
Zhen Qin
Aaron Courville
Yiran Zhong
196
115
0
08 Nov 2023
Scattering Vision Transformer: Spectral Mixing Matters
Neural Information Processing Systems (NeurIPS), 2023
Badri N. Patro
Vijay Srinivas Agneeswaran
410
27
0
02 Nov 2023
Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model
International Conference on Learning Representations (ICLR), 2023
Karsten Roth
Lukas Thede
Almut Sophia Koepke
Oriol Vinyals
Olivier J. Hénaff
Zeynep Akata
AAML
309
17
0
26 Oct 2023
Unraveling Feature Extraction Mechanisms in Neural Networks
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xiaobing Sun
Jiaxi Li
Wei Lu
204
0
0
25 Oct 2023
Handling Data Heterogeneity via Architectural Design for Federated Visual Recognition
Neural Information Processing Systems (NeurIPS), 2023
Sara Pieri
Jose Renato Restom
Samuel Horvath
Hisham Cholakkal
FedML
161
9
0
23 Oct 2023
Exploring Driving Behavior for Autonomous Vehicles Based on Gramian Angular Field Vision Transformer
Junwei You
Ying Chen
Zhuoyu Jiang
Zhangchi Liu
Zilin Huang
Yifeng Ding
Bin Ran
196
6
0
21 Oct 2023
Attentive Multi-Layer Perceptron for Non-autoregressive Generation
Shuyang Jiang
Jinchao Zhang
Jiangtao Feng
Lin Zheng
Lingpeng Kong
243
0
0
14 Oct 2023
IFAST: Weakly Supervised Interpretable Face Anti-spoofing from Single-shot Binocular NIR Images
IEEE Transactions on Information Forensics and Security (IEEE TIFS), 2023
Jiancheng Huang
Donghao Zhou
Shifeng Chen
CVBM
200
4
0
29 Sep 2023
Auto-Regressive Next-Token Predictors are Universal Learners
International Conference on Machine Learning (ICML), 2023
Eran Malach
LRM
216
53
0
13 Sep 2023
Dynamic Spectrum Mixer for Visual Recognition
Zhiqiang Hu
Tao Yu
202
5
0
13 Sep 2023
Hindering Adversarial Attacks with Multiple Encrypted Patch Embeddings
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2023
AprilPyone Maungmaung
Isao Echizen
Hitoshi Kiya
AAML
184
2
0
04 Sep 2023
SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recognition
Shaojie Zhang
Jianqin Yin
Yonghao Dang
Jiajun Fu
267
14
0
30 Aug 2023
CS-Mixer: A Cross-Scale Vision MLP Model with Spatial-Channel Mixing
Jianwei Cui
David A. Araujo
Suman Saha
Md Faisal Kabir
BDL
237
1
0
25 Aug 2023
SPANet: Frequency-balancing Token Mixer using Spectral Pooling Aggregation Modulation
IEEE International Conference on Computer Vision (ICCV), 2023
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Dong Hwan Kim
MoE
214
29
0
22 Aug 2023
An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification
Harunori Kawano
Sota Shimizu
146
1
0
22 Aug 2023
Attention Is Not All You Need Anymore
Zhe Chen
304
7
0
15 Aug 2023
Block-Wise Encryption for Reliable Vision Transformer models
Hitoshi Kiya
Ryota Iijima
Teru Nagamori
147
4
0
15 Aug 2023
Spatial Gated Multi-Layer Perceptron for Land Use and Land Cover Mapping
IEEE Geoscience and Remote Sensing Letters (GRSL), 2023
Ali Jamali
Swalpa Kumar Roy
Danfeng Hong
P. M. Atkinson
Pedram Ghamisi
95
15
0
09 Aug 2023
Dual Aggregation Transformer for Image Super-Resolution
IEEE International Conference on Computer Vision (ICCV), 2023
Zheng Chen
Yulun Zhang
Jinjin Gu
Lingyu Kong
Yunbo Wang
Feng Yu
ViT
289
288
0
07 Aug 2023
Guided Distillation for Semi-Supervised Instance Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Tariq Berrada
Camille Couprie
Alahari Karteek
Jakob Verbeek
202
19
0
03 Aug 2023
Strip-MLP: Efficient Token Interaction for Vision MLP
IEEE International Conference on Computer Vision (ICCV), 2023
Guiping Cao
Shengda Luo
Wen-Fong Huang
X. Lan
Shihong Deng
Yaowei Wang
Jianguo Zhang
197
16
0
21 Jul 2023
Improving Text Semantic Similarity Modeling through a 3D Siamese Network
European Conference on Artificial Intelligence (ECAI), 2023
Jianxiang Zang
Hui Liu
3DPC
183
6
0
18 Jul 2023
Scaling MLPs: A Tale of Inductive Bias
Neural Information Processing Systems (NeurIPS), 2023
Gregor Bachmann
Sotiris Anagnostidis
Thomas Hofmann
363
54
0
23 Jun 2023
TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series Forecasting
Knowledge Discovery and Data Mining (KDD), 2023
Vijayabharathi Ekambaram
Arindam Jati
Nam H. Nguyen
Phanwadee Sinthong
Jayant Kalagnanam
AI4TS
467
274
0
14 Jun 2023
Brainformers: Trading Simplicity for Efficiency
International Conference on Machine Learning (ICML), 2023
Yan-Quan Zhou
Nan Du
Yanping Huang
Daiyi Peng
Chang Lan
...
Zhifeng Chen
Quoc V. Le
Claire Cui
J.H.J. Laundon
J. Dean
MoE
227
36
0
29 May 2023
HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition
Interspeech (Interspeech), 2023
Florian Mai
Juan Pablo Zuluaga
Titouan Parcollet
P. Motlícek
156
12
0
29 May 2023
Range-Based Equal Error Rate for Spoof Localization
Interspeech (Interspeech), 2023
Lin Zhang
Xin Wang
Erica Cooper
Nicholas W. D. Evans
Junichi Yamagishi
148
17
0
28 May 2023
Caterpillar: A Pure-MLP Architecture with Shifted-Pillars-Concatenation
ACM Multimedia (ACM MM), 2023
J. Sun
Xiaoshuang Shi
Zhiyuan Weng
Kaidi Xu
Mengqi Li
Xiao-lan Zhu
MLLM
284
4
0
28 May 2023
Exploring Automatically Perturbed Natural Language Explanations in Relation Extraction
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Wanyun Cui
Xingran Chen
LRM
AAML
191
0
0
24 May 2023
MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Tianlun Zheng
Zhineng Chen
Bin Huang
Wei Zhang
Yuran Jiang
354
15
0
24 May 2023
TriMLP: Revenge of a MLP-like Architecture in Sequential Recommendation
Yiheng Jiang
Yuanbo Xu
Yongjian Yang
Funing Yang
Pengyang Wang
Hui Xiong
226
2
0
24 May 2023
RWKV: Reinventing RNNs for the Transformer Era
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Bo Peng
Eric Alcaide
Quentin G. Anthony
Alon Albalak
Samuel Arcadinho
...
Qihang Zhao
P. Zhou
Qinghua Zhou
Jian Zhu
Rui-Jie Zhu
578
834
0
22 May 2023
Scalable Coupling of Deep Learning with Logical Reasoning
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Marianne Defresne
S. Barbe
T. Schiex
NAI
AI4CE
254
10
0
12 May 2023
Toeplitz Neural Network for Sequence Modeling
International Conference on Learning Representations (ICLR), 2023
Zhen Qin
Xiaodong Han
Weixuan Sun
Bowen He
Dong Li
Dongxu Li
Yuchao Dai
Lingpeng Kong
Yiran Zhong
AI4TS
ViT
159
49
0
08 May 2023
MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer
IEEE International Joint Conference on Neural Network (IJCNN), 2023
Yifang Xu
Yunzhuo Sun
Yang Li
Yilei Shi
Xiaoxia Zhu
S. Du
ViT
253
48
0
29 Apr 2023
Towards Effective and Interpretable Human-Agent Collaboration in MOBA Games: A Communication Perspective
International Conference on Learning Representations (ICLR), 2023
Yiming Gao
Feiyu Liu
Liang Wang
Zhenjie Lian
Weixuan Wang
...
Jiawei Wang
Qiang Fu
Wei Yang
Lanxiao Huang
Wei Liu
159
10
0
23 Apr 2023
Is Cross-modal Information Retrieval Possible without Training?
European Conference on Information Retrieval (ECIR), 2023
Hyunjin Choi
HyunJae Lee
Seongho Joe
Youngjune Gwon
129
1
0
20 Apr 2023
MLP-AIR: An Efficient MLP-Based Method for Actor Interaction Relation Learning in Group Activity Recognition
Guoliang Xu
Jianqin Yin
137
1
0
18 Apr 2023
Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments
Journal of Visual Communication and Image Representation (JVCIR), 2023
Felix Ott
Lucas Heublein
David Rügamer
B. Bischl
Christopher Mutschler
433
6
0
14 Apr 2023
Previous
1
2
3
4
5
6
7
Next