ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.08050
  4. Cited By
Pay Attention to MLPs
v1v2 (latest)

Pay Attention to MLPs

Neural Information Processing Systems (NeurIPS), 2021
17 May 2021
Hanxiao Liu
Zihang Dai
David R. So
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Pay Attention to MLPs"

50 / 323 papers shown
SpectFormer: Frequency and Attention is what you need in a Vision
  Transformer
SpectFormer: Frequency and Attention is what you need in a Vision TransformerIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Badri N. Patro
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
ViT
205
99
0
13 Apr 2023
MC-MLP:Multiple Coordinate Frames in all-MLP Architecture for Vision
MC-MLP:Multiple Coordinate Frames in all-MLP Architecture for Vision
Zhimin Zhu
Jianguo Zhao
Tong Mu
Yuliang Yang
Mengyu Zhu
171
0
0
08 Apr 2023
Revisiting the Evaluation of Image Synthesis with GANs
Revisiting the Evaluation of Image Synthesis with GANsNeural Information Processing Systems (NeurIPS), 2023
Mengping Yang
Ceyuan Yang
Yichi Zhang
Qingyan Bai
Yujun Shen
Bo Dai
EGVM
278
10
0
04 Apr 2023
ReBotNet: Fast Real-time Video Enhancement
ReBotNet: Fast Real-time Video EnhancementIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Jeya Maria Jose Valanarasu
Rahul Garg
Andeep S. Toor
Xin Tong
Weijuan Xi
Andreas Lugmayr
Vishal M. Patel
A. Menini
260
2
0
23 Mar 2023
Boosting Convolution with Efficient MLP-Permutation for Volumetric Medical Image Segmentation
Boosting Convolution with Efficient MLP-Permutation for Volumetric Medical Image SegmentationIEEE Transactions on Medical Imaging (TMI), 2023
Yi Lin
Xiao Fang
Dong Zhang
Kwang-Ting Cheng
Hao Chen
MedIm
589
4
0
23 Mar 2023
Multiscale Attention via Wavelet Neural Operators for Vision
  Transformers
Multiscale Attention via Wavelet Neural Operators for Vision Transformers
Anahita Nekoozadeh
M. Ahmadzadeh
Zahra Mardani
ViT
201
2
0
22 Mar 2023
ProphNet: Efficient Agent-Centric Motion Forecasting with
  Anchor-Informed Proposals
ProphNet: Efficient Agent-Centric Motion Forecasting with Anchor-Informed ProposalsComputer Vision and Pattern Recognition (CVPR), 2023
Xishun Wang
Tong Su
Fang Da
Xiaodong Yang
357
85
0
21 Mar 2023
ALOFT: A Lightweight MLP-like Architecture with Dynamic Low-frequency
  Transform for Domain Generalization
ALOFT: A Lightweight MLP-like Architecture with Dynamic Low-frequency Transform for Domain GeneralizationComputer Vision and Pattern Recognition (CVPR), 2023
Jintao Guo
Na Wang
Lei Qi
Yinghuan Shi
291
61
0
21 Mar 2023
Good Neighbors Are All You Need for Chinese Grapheme-to-Phoneme
  Conversion
Good Neighbors Are All You Need for Chinese Grapheme-to-Phoneme ConversionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Jungjun Kim
C. Han
Gyuhyeon Nam
Gyeongsu Chae
155
5
0
14 Mar 2023
AutoMLP: Automated MLP for Sequential Recommendations
AutoMLP: Automated MLP for Sequential RecommendationsThe Web Conference (WWW), 2023
Muyang Li
Zijian Zhang
Xiangyu Zhao
Wanyu Wang
Minghao Zhao
Runze Wu
Ruocheng Guo
AI4TS
180
63
0
11 Mar 2023
Prismer: A Vision-Language Model with Multi-Task Experts
Prismer: A Vision-Language Model with Multi-Task Experts
Shikun Liu
Linxi Fan
Edward Johns
Zhiding Yu
Chaowei Xiao
Anima Anandkumar
VLMMLLM
304
33
0
04 Mar 2023
MixVPR: Feature Mixing for Visual Place Recognition
MixVPR: Feature Mixing for Visual Place RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
A. Ali-bey
B. Chaib-draa
Philippe Giguère
214
228
0
03 Mar 2023
Image as Set of Points
Image as Set of PointsInternational Conference on Learning Representations (ICLR), 2023
Xu Ma
Yuqian Zhou
Huan Wang
Can Qin
Bin Sun
Chang Liu
Yun Fu
VLM
186
65
0
02 Mar 2023
A Survey on Long Text Modeling with Transformers
A Survey on Long Text Modeling with Transformers
Zican Dong
Tianyi Tang
Lunyi Li
Wayne Xin Zhao
VLM
401
69
0
28 Feb 2023
Co-Driven Recognition of Semantic Consistency via the Fusion of
  Transformer and HowNet Sememes Knowledge
Co-Driven Recognition of Semantic Consistency via the Fusion of Transformer and HowNet Sememes Knowledge
Fan Chen
Yan Huang
Xinfang Zhang
KangYi Luo
Jinxuan Zhu
Ruixian He
123
1
0
21 Feb 2023
Optical Transformers
Optical Transformers
Maxwell G. Anderson
Shifan Ma
Tianyu Wang
Logan G. Wright
Peter L. McMahon
150
35
0
20 Feb 2023
Efficiency 360: Efficient Vision Transformers
Efficiency 360: Efficient Vision Transformers
Badri N. Patro
Vijay Srinivas Agneeswaran
405
7
0
16 Feb 2023
Open Problems in Applied Deep Learning
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
232
3
0
26 Jan 2023
WLD-Reg: A Data-dependent Within-layer Diversity Regularizer
WLD-Reg: A Data-dependent Within-layer Diversity RegularizerAAAI Conference on Artificial Intelligence (AAAI), 2023
Firas Laakom
Jenni Raitoharju
Alexandros Iosifidis
Moncef Gabbouj
AI4CE
131
8
0
03 Jan 2023
Rethinking Mobile Block for Efficient Attention-based Models
Rethinking Mobile Block for Efficient Attention-based ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Jiangning Zhang
Xiangtai Li
Jian Li
Liang Liu
Zhucun Xue
Boshen Zhang
Zhe Jiang
Tianxin Huang
Yabiao Wang
Chengjie Wang
MQ
344
195
0
03 Jan 2023
BiMLP: Compact Binary Architectures for Vision Multi-Layer Perceptrons
BiMLP: Compact Binary Architectures for Vision Multi-Layer PerceptronsNeural Information Processing Systems (NeurIPS), 2022
Yixing Xu
Xinghao Chen
Yunhe Wang
MQ
167
8
0
29 Dec 2022
A Generalization of ViT/MLP-Mixer to Graphs
A Generalization of ViT/MLP-Mixer to GraphsInternational Conference on Machine Learning (ICML), 2022
Xiaoxin He
Bryan Hooi
T. Laurent
Adam Perold
Yann LeCun
Xavier Bresson
243
122
0
27 Dec 2022
OAMixer: Object-aware Mixing Layer for Vision Transformers
OAMixer: Object-aware Mixing Layer for Vision Transformers
H. Kang
Sangwoo Mo
Jinwoo Shin
VLM
260
5
0
13 Dec 2022
BALF: Simple and Efficient Blur Aware Local Feature Detector
BALF: Simple and Efficient Blur Aware Local Feature DetectorIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Zhenjun Zhao
Yuwei Zhai
Ben Chen
Peidong Liu
284
24
0
27 Nov 2022
EurNet: Efficient Multi-Range Relational Modeling of Spatial
  Multi-Relational Data
EurNet: Efficient Multi-Range Relational Modeling of Spatial Multi-Relational Data
Minghao Xu
Yuanfan Guo
Yi Xu
Jiangtao Tang
Xinlei Chen
Yuandong Tian
GNN
179
6
0
23 Nov 2022
SimVP: Towards Simple yet Powerful Spatiotemporal Predictive Learning
SimVP: Towards Simple yet Powerful Spatiotemporal Predictive LearningIEEE transactions on multimedia (IEEE TMM), 2022
Cheng Tan
Zhangyang Gao
Siyuan Li
Stan Z. Li
VLMAI4TS
273
40
0
22 Nov 2022
Unsupervised Echocardiography Registration through Patch-based MLPs and
  Transformers
Unsupervised Echocardiography Registration through Patch-based MLPs and Transformers
Zihao Wang
Yingyu Yang
Maxime Sermesant
H. Delingette
ViTMedIm
279
8
0
21 Nov 2022
Convexifying Transformers: Improving optimization and understanding of
  transformer networks
Convexifying Transformers: Improving optimization and understanding of transformer networks
Tolga Ergen
Behnam Neyshabur
Harsh Mehta
MLT
229
15
0
20 Nov 2022
Astronomia ex machina: a history, primer, and outlook on neural networks
  in astronomy
Astronomia ex machina: a history, primer, and outlook on neural networks in astronomyRoyal Society Open Science (RSOS), 2022
Michael J. Smith
James E. Geach
197
48
0
07 Nov 2022
How Much Does Attention Actually Attend? Questioning the Importance of
  Attention in Pretrained Transformers
How Much Does Attention Actually Attend? Questioning the Importance of Attention in Pretrained TransformersConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Michael Hassid
Hao Peng
Daniel Rotem
Jungo Kasai
Ivan Montero
Noah A. Smith
Roy Schwartz
232
31
0
07 Nov 2022
Neural Fourier Shift for Binaural Speech Rendering
Neural Fourier Shift for Binaural Speech RenderingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Jinkyu Lee
Kyogu Lee
222
14
0
02 Nov 2022
Globally Gated Deep Linear Networks
Globally Gated Deep Linear NetworksNeural Information Processing Systems (NeurIPS), 2022
Qianyi Li
H. Sompolinsky
AI4CE
251
15
0
31 Oct 2022
QNet: A Quantum-native Sequence Encoder Architecture
QNet: A Quantum-native Sequence Encoder ArchitectureInternational Conference on Quantum Computing and Engineering (ICQCE), 2022
Wei-Yen Day
Hao-Sheng Chen
Min Sun
247
1
0
31 Oct 2022
Same Pre-training Loss, Better Downstream: Implicit Bias Matters for
  Language Models
Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language ModelsInternational Conference on Machine Learning (ICML), 2022
Hong Liu
Sang Michael Xie
Zhiyuan Li
Tengyu Ma
AI4CE
320
68
0
25 Oct 2022
MetaFormer Baselines for Vision
MetaFormer Baselines for VisionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Weihao Yu
Chenyang Si
Pan Zhou
Mi Luo
Yichen Zhou
Jiashi Feng
Shuicheng Yan
Xinchao Wang
MoE
242
270
0
24 Oct 2022
Similarity of Neural Architectures using Adversarial Attack
  Transferability
Similarity of Neural Architectures using Adversarial Attack TransferabilityEuropean Conference on Computer Vision (ECCV), 2022
Ian Ryu
Dongyoon Han
Byeongho Heo
Song Park
Sanghyuk Chun
Jong-Seok Lee
AAML
538
3
0
20 Oct 2022
Decoupling Features in Hierarchical Propagation for Video Object
  Segmentation
Decoupling Features in Hierarchical Propagation for Video Object SegmentationNeural Information Processing Systems (NeurIPS), 2022
Zongxin Yang
Yi Yang
VOS
317
195
0
18 Oct 2022
SA-MLP: Distilling Graph Knowledge from GNNs into Structure-Aware MLP
SA-MLP: Distilling Graph Knowledge from GNNs into Structure-Aware MLP
Jie Chen
Shouzhen Chen
Mingyuan Bai
Junbin Gao
Junping Zhang
Jian Pu
214
16
0
18 Oct 2022
AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for
  Efficient Neural Machine Translation
AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Ganesh Jawahar
Subhabrata Mukherjee
Xiaodong Liu
Young Jin Kim
Muhammad Abdul-Mageed
L. Lakshmanan
Ahmed Hassan Awadallah
Sébastien Bubeck
Jianfeng Gao
MoE
181
11
0
14 Oct 2022
Are All Vision Models Created Equal? A Study of the Open-Loop to
  Closed-Loop Causality Gap
Are All Vision Models Created Equal? A Study of the Open-Loop to Closed-Loop Causality Gap
Mathias Lechner
Ramin Hasani
Alexander Amini
Tsun-Hsuan Wang
T. Henzinger
Daniela Rus
CMLOOD
220
7
0
09 Oct 2022
The Lie Derivative for Measuring Learned Equivariance
The Lie Derivative for Measuring Learned EquivarianceInternational Conference on Learning Representations (ICLR), 2022
Nate Gruver
Marc Finzi
Micah Goldblum
A. Wilson
283
52
0
06 Oct 2022
Centralized Feature Pyramid for Object Detection
Centralized Feature Pyramid for Object DetectionIEEE Transactions on Image Processing (IEEE TIP), 2022
Yu Quan
Dong Zhang
Liyan Zhang
Jinhui Tang
ObjD
225
237
0
05 Oct 2022
Rethinking Performance Gains in Image Dehazing Networks
Rethinking Performance Gains in Image Dehazing Networks
Yuda Song
Yang Zhou
Hui Qian
Xin Du
SSeg
169
70
0
23 Sep 2022
Mega: Moving Average Equipped Gated Attention
Mega: Moving Average Equipped Gated AttentionInternational Conference on Learning Representations (ICLR), 2022
Xuezhe Ma
Chunting Zhou
Xiang Kong
Junxian He
Liangke Gui
Graham Neubig
Jonathan May
Luke Zettlemoyer
331
217
0
21 Sep 2022
Analysis of Quantization on MLP-based Vision Models
Analysis of Quantization on MLP-based Vision Models
Lingran Zhao
Zhen Dong
Kurt Keutzer
MQ
131
7
0
14 Sep 2022
Pre-Training a Graph Recurrent Network for Language Representation
Pre-Training a Graph Recurrent Network for Language Representation
Yile Wang
Linyi Yang
Zhiyang Teng
M. Zhou
Yue Zhang
GNN
239
1
0
08 Sep 2022
LKD-Net: Large Kernel Convolution Network for Single Image Dehazing
LKD-Net: Large Kernel Convolution Network for Single Image DehazingIEEE International Conference on Multimedia and Expo (ICME), 2022
Pinjun Luo
Guoqiang Xiao
Xinbo Gao
Song Wu
145
57
0
05 Sep 2022
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted
  Window
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted WindowIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Mocho Go
Hideyuki Tachibana
ViT
163
11
0
24 Aug 2022
Efficient Attention-free Video Shift Transformers
Efficient Attention-free Video Shift Transformers
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
ViT
211
1
0
23 Aug 2022
CM-MLP: Cascade Multi-scale MLP with Axial Context Relation Encoder for
  Edge Segmentation of Medical Image
CM-MLP: Cascade Multi-scale MLP with Axial Context Relation Encoder for Edge Segmentation of Medical ImageIEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2022
Jinkai Lv
Yuyong Hu
Quanshui Fu
Zhiwang Zhang
Yuqiang Hu
Lin Lv
Guoqing Yang
Jinpeng Li
Yi Zhao
MedIm
170
12
0
23 Aug 2022
Previous
1234567
Next