Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.08050
Cited By
Pay Attention to MLPs
17 May 2021
Hanxiao Liu
Zihang Dai
David R. So
Quoc V. Le
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pay Attention to MLPs"
50 / 303 papers shown
Title
Are Message Passing Neural Networks Really Helpful for Knowledge Graph Completion?
Juanhui Li
Harry Shomer
Jiayu Ding
Yiqi Wang
Yao Ma
Neil Shah
Jiliang Tang
Dawei Yin
37
11
0
21 May 2022
Unraveling Attention via Convex Duality: Analysis and Interpretations of Vision Transformers
Arda Sahiner
Tolga Ergen
Batu Mehmet Ozturkler
John M. Pauly
Morteza Mardani
Mert Pilanci
24
33
0
17 May 2022
Sequencer: Deep LSTM for Image Classification
Yuki Tatsunami
Masato Taki
VLM
ViT
16
78
0
04 May 2022
Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations
Dan Oneaţă
H. Cucu
11
19
0
27 Apr 2022
GPUNet: Searching the Deployable Convolution Neural Networks for GPUs
Linnan Wang
Chenhan D. Yu
Satish Salian
Slawomir Kierat
Szymon Migacz
A. Fit-Florea
12
11
0
26 Apr 2022
Application of Transfer Learning and Ensemble Learning in Image-level Classification for Breast Histopathology
Yuchao Zheng
Chen Li
Xiaomin Zhou
Hao Chen
Hao Xu
...
Haiqing Zhang
Xirong Li
Hongzan Sun
Xinyu Huang
M. Grzegorzek
28
55
0
18 Apr 2022
Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation
Wenjing Zhu
Xiang Li
9
40
0
12 Apr 2022
The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance
Lin Zhang
Xin Wang
Erica Cooper
Nicholas W. D. Evans
Junichi Yamagishi
19
56
0
11 Apr 2022
Are We Really Making Much Progress in Text Classification? A Comparative Review
Lukas Galke
Andor Diera
Bao Xin Lin
Bhakti Khera
Tim Meuser
Tushar Singhal
Fabian Karl
A. Scherp
VLM
24
3
0
08 Apr 2022
DaViT: Dual Attention Vision Transformers
Mingyu Ding
Bin Xiao
Noel Codella
Ping Luo
Jingdong Wang
Lu Yuan
ViT
30
240
0
07 Apr 2022
MaxViT: Multi-Axis Vision Transformer
Zhengzhong Tu
Hossein Talebi
Han Zhang
Feng Yang
P. Milanfar
A. Bovik
Yinxiao Li
ViT
43
636
0
04 Apr 2022
InstaFormer: Instance-Aware Image-to-Image Translation with Transformer
Soohyun Kim
Jongbeom Baek
Jihye Park
Gyeongnyeon Kim
Seung Wook Kim
ViT
28
47
0
30 Mar 2022
Brain-inspired Multilayer Perceptron with Spiking Neurons
Wenshuo Li
Hanting Chen
Jianyuan Guo
Ziyang Zhang
Yunhe Wang
28
34
0
28 Mar 2022
FAMLP: A Frequency-Aware MLP-Like Architecture For Domain Generalization
Kecheng Zheng
Yang Cao
Kai Zhu
Ruijing Zhao
Zhengjun Zha
12
5
0
24 Mar 2022
Focal Modulation Networks
Jianwei Yang
Chunyuan Li
Xiyang Dai
Lu Yuan
Jianfeng Gao
3DPC
22
263
0
22 Mar 2022
Three things everyone should know about Vision Transformers
Hugo Touvron
Matthieu Cord
Alaaeldin El-Nouby
Jakob Verbeek
Hervé Jégou
ViT
18
119
0
18 Mar 2022
On the Properties of Adversarially-Trained CNNs
Mattia Carletti
M. Terzi
Gian Antonio Susto
AAML
24
1
0
17 Mar 2022
Learning Audio Representations with MLPs
Mashrur M. Morshed
Ahmad Omar Ahsan
H. Mahmud
Md. Kamrul Hasan
19
4
0
16 Mar 2022
Self-Promoted Supervision for Few-Shot Transformer
Bowen Dong
Pan Zhou
Shuicheng Yan
W. Zuo
ViT
22
28
0
14 Mar 2022
Efficient Language Modeling with Sparse all-MLP
Ping Yu
Mikel Artetxe
Myle Ott
Sam Shleifer
Hongyu Gong
Ves Stoyanov
Xian Li
MoE
15
11
0
14 Mar 2022
Contrastive Learning for Automotive mmWave Radar Detection Points Based Instance Segmentation
Weiyi Xiong
Jianan Liu
Yu Xia
Tao Huang
Bing Zhu
Wei Xiang
14
21
0
13 Mar 2022
HyperMixer: An MLP-based Low Cost Alternative to Transformers
Florian Mai
Arnaud Pannatier
Fabio Fehr
Haolin Chen
François Marelli
F. Fleuret
James Henderson
20
11
0
07 Mar 2022
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Greg Yang
J. E. Hu
Igor Babuschkin
Szymon Sidor
Xiaodong Liu
David Farhi
Nick Ryder
J. Pachocki
Weizhu Chen
Jianfeng Gao
24
148
0
07 Mar 2022
HEAR: Holistic Evaluation of Audio Representations
Joseph P. Turian
Jordie Shier
H. Khan
Bhiksha Raj
Björn W. Schuller
...
P. Esling
Pranay Manocha
Shinji Watanabe
Zeyu Jin
Yonatan Bisk
31
99
0
06 Mar 2022
Bending Reality: Distortion-aware Transformers for Adapting to Panoramic Semantic Segmentation
Jiaming Zhang
Kailun Yang
Chaoxiang Ma
Simon Reiß
Kunyu Peng
Rainer Stiefelhagen
ViT
24
72
0
02 Mar 2022
Filter-enhanced MLP is All You Need for Sequential Recommendation
Kun Zhou
Hui Yu
Wayne Xin Zhao
Ji-Rong Wen
79
251
0
28 Feb 2022
Transformers in Medical Image Analysis: A Review
Kelei He
Chen Gan
Zhuoyuan Li
I. Rekik
Zihao Yin
Wen Ji
Yang Gao
Qian Wang
Junfeng Zhang
D. Shen
ViT
MedIm
20
254
0
24 Feb 2022
Transformer Quality in Linear Time
Weizhe Hua
Zihang Dai
Hanxiao Liu
Quoc V. Le
71
222
0
21 Feb 2022
Visual Attention Network
Meng-Hao Guo
Chengrou Lu
Zheng-Ning Liu
Ming-Ming Cheng
Shiyong Hu
ViT
VLM
19
636
0
20 Feb 2022
MLP-ASR: Sequence-length agnostic all-MLP architectures for speech recognition
Jin Sakuma
Tatsuya Komatsu
Robin Scheibler
13
6
0
17 Feb 2022
The Quarks of Attention
Pierre Baldi
Roman Vershynin
GNN
16
9
0
15 Feb 2022
Mixing and Shifting: Exploiting Global and Local Dependencies in Vision MLPs
Huangjie Zheng
Pengcheng He
Weizhu Chen
Mingyuan Zhou
22
14
0
14 Feb 2022
BViT: Broad Attention based Vision Transformer
Nannan Li
Yaran Chen
Weifan Li
Zixiang Ding
Dong Zhao
ViT
30
23
0
13 Feb 2022
pNLP-Mixer: an Efficient all-MLP Architecture for Language
Francesco Fusco
Damian Pascual
Peter W. J. Staar
Diego Antognini
26
29
0
09 Feb 2022
Image-to-Image MLP-mixer for Image Reconstruction
Youssef Mansour
Kang Lin
Reinhard Heckel
SupR
31
15
0
04 Feb 2022
Plug-In Inversion: Model-Agnostic Inversion for Vision with Data Augmentations
Amin Ghiasi
Hamid Kazemi
Steven Reich
Chen Zhu
Micah Goldblum
Tom Goldstein
34
15
0
31 Jan 2022
DynaMixer: A Vision MLP Architecture with Dynamic Mixing
Ziyu Wang
Wenhao Jiang
Yiming Zhu
Li Yuan
Yibing Song
Wei Liu
35
43
0
28 Jan 2022
When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Guangting Wang
Yucheng Zhao
Chuanxin Tang
Chong Luo
Wenjun Zeng
12
68
0
26 Jan 2022
Patches Are All You Need?
Asher Trockman
J. Zico Kolter
ViT
216
402
0
24 Jan 2022
MAXIM: Multi-Axis MLP for Image Processing
Zhengzhong Tu
Hossein Talebi
Han Zhang
Feng Yang
P. Milanfar
A. Bovik
Yinxiao Li
23
463
0
09 Jan 2022
Beyond modeling: NLP Pipeline for efficient environmental policy analysis
J. Planas
Daniel Firebanks-Quevedo
G. Naydenova
Ramansh Sharma
Cristina Taylor
Kathleen Buckingham
Rong Fang
20
4
0
08 Jan 2022
The GatedTabTransformer. An enhanced deep learning architecture for tabular modeling
Radostin Cholakov
T. Kolev
LMTD
27
12
0
01 Jan 2022
RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality
Xiaohan Ding
Honghao Chen
X. Zhang
Jungong Han
Guiguang Ding
17
71
0
21 Dec 2021
The King is Naked: on the Notion of Robustness for Natural Language Processing
Emanuele La Malfa
Marta Z. Kwiatkowska
20
28
0
13 Dec 2021
MLP Architectures for Vision-and-Language Modeling: An Empirical Study
Yi-Liang Nie
Linjie Li
Zhe Gan
Shuohang Wang
Chenguang Zhu
Michael Zeng
Zicheng Liu
Mohit Bansal
Lijuan Wang
12
6
0
08 Dec 2021
VIRT: Improving Representation-based Models for Text Matching through Virtual Interaction
Dan Li
Yang Yang
Hongyin Tang
Jingang Wang
Tong Bill Xu
Wei Yu Wu
Enhong Chen
20
7
0
08 Dec 2021
A Novel Deep Parallel Time-series Relation Network for Fault Diagnosis
Chun Yang
AI4TS
AI4CE
17
4
0
03 Dec 2021
SWAT: Spatial Structure Within and Among Tokens
Kumara Kahatapitiya
Michael S. Ryoo
23
6
0
26 Nov 2021
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning
David Junhao Zhang
Kunchang Li
Yali Wang
Yuxiang Chen
Shashwat Chandra
Yu Qiao
Luoqi Liu
Mike Zheng Shou
AI4TS
19
30
0
24 Nov 2021
An Image Patch is a Wave: Phase-Aware Vision MLP
Yehui Tang
Kai Han
Jianyuan Guo
Chang Xu
Yanxi Li
Chao Xu
Yunhe Wang
22
133
0
24 Nov 2021
Previous
1
2
3
4
5
6
7
Next