Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2105.08050
Cited By
v1
v2 (latest)
Pay Attention to MLPs
Neural Information Processing Systems (NeurIPS), 2021
17 May 2021
Hanxiao Liu
Zihang Dai
David R. So
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Pay Attention to MLPs"
50 / 323 papers shown
"Why the face?": Exploring Robot Error Detection Using Instrumented Bystander Reactions
Maria Teresa Parreira
Ruidong Zhang
Sukruth Gowdru Lingaraju
Alexandra Bremers
Xuanyu Fang
Adolfo G. Ramirez-Aristizabal
Manaswi Saha
Michael Kuniavsky
Cheng Zhang
Wendy Ju
28
0
0
29 Nov 2025
DiffPixelFormer: Differential Pixel-Aware Transformer for RGB-D Indoor Scene Segmentation
Yan Gong
J. Lu
Yongsheng Gao
Jie Zhao
X. Zhang
Susanto Rahardja
112
0
0
17 Nov 2025
Generating Sketches in a Hierarchical Auto-Regressive Process for Flexible Sketch Drawing Manipulation at Stroke-Level
Sicong Zang
Shuhui Gao
Zhijun Fang
151
0
0
11 Nov 2025
SAGS: Self-Adaptive Alias-Free Gaussian Splatting for Dynamic Surgical Endoscopic Reconstruction
Wenfeng Huang
X. Liao
Yinling Qian
Hao Liu
Yongming Yang
Wenjing Jia
Qiong Wang
3DGS
179
0
0
31 Oct 2025
Kelle: Co-design KV Caching and eDRAM for Efficient LLM Serving in Edge Computing
Tianhua Xia
Sai Qian Zhang
89
1
0
16 Oct 2025
Flow-Matching Guided Deep Unfolding for Hyperspectral Image Reconstruction
Yi Ai
Yuanhao Cai
Yulun Zhang
Xiaokang Yang
153
0
0
02 Oct 2025
Short window attention enables long-term memorization
Loic Cabannes
Maximilian Beck
Gergely Szilvasy
Matthijs Douze
Maria Lomeli
Jade Copet
Pierre-Emmanuel Mazaré
Gabriel Synnaeve
Hervé Jégou
132
1
0
29 Sep 2025
CURA: Size Isnt All You Need - A Compact Universal Architecture for On-Device Intelligence
Jae-Bum Seo
Muhammad Salman
Lismer Andres Caceres-Najarro
99
0
0
29 Sep 2025
JaneEye: A 12-nm 2K-FPS 18.9-
μ
μ
μ
J/Frame Event-based Eye Tracking Accelerator
Tao Han
Ang Li
Qinyu Chen
Chang Gao
111
0
0
18 Sep 2025
SAC-MIL: Spatial-Aware Correlated Multiple Instance Learning for Histopathology Whole Slide Image Classification
Yu Bai
Zitong Yu
Haowen Tian
X. Wang
Shuo Yan
...
Zheng Zhang
Wufan Wang
Hui Gao
Xiangyang Gong
Wendong Wang
112
0
0
04 Sep 2025
Multi-level SSL Feature Gating for Audio Deepfake Detection
Hoan My Tran
Damien Lolive
Aghilas Sini
Arnaud Delhay
Pierre-François Marteau
David Guennec
132
1
0
03 Sep 2025
Image Quality Assessment for Machines: Paradigm, Large-scale Database, and Models
Xiaoqi Wang
Yun Zhang
Weisi Lin
140
0
0
27 Aug 2025
Learning to See Through Flare
Xiaopeng Peng
Heath Gemar
Erin F. Fleet
Kyle Novak
A. Watnik
Grover A. Swartzlander
119
1
0
19 Aug 2025
Keyword Mamba: Spoken Keyword Spotting with State Space Models
Hanyu Ding
Wenlong Dong
Qirong Mao
Mamba
112
1
0
10 Aug 2025
ClinicalFMamba: Advancing Clinical Assessment using Mamba-based Multimodal Neuroimaging Fusion
Meng Zhou
Farzad Khalvati
Mamba
135
0
0
05 Aug 2025
MLP Memory: A Retriever-Pretrained Memory for Large Language Models
Rubin Wei
Jiaqi Cao
Jiarui Wang
Jushi Kai
Qipeng Guo
Bowen Zhou
Zhouhan Lin
RALM
259
0
0
03 Aug 2025
Implicit Counterfactual Learning for Audio-Visual Segmentation
Mingfeng Zha
Tianyu Li
G. Wang
Peng Wang
Yangyang Wu
Yang Yang
Heng Tao Shen
VOS
CML
163
1
0
28 Jul 2025
Multi-Task Dense Prediction Fine-Tuning with Mixture of Fine-Grained Experts
Yangyang Xu
Xi Ye
Duo Su
MoE
MoMe
224
0
0
25 Jul 2025
TTMBA: Towards Text To Multiple Sources Binaural Audio Generation
Yuxuan He
Xiaoran Yang
Ningning Pan
Gongping Huang
167
0
0
22 Jul 2025
evMLP: An Efficient Event-Driven MLP Architecture for Vision
Zhentan Zheng
VLM
237
0
0
02 Jul 2025
Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization
Or Shafran
Atticus Geiger
Mor Geva
MILM
353
1
0
12 Jun 2025
Same Task, Different Circuits: Disentangling Modality-Specific Mechanisms in VLMs
Yaniv Nikankin
Dana Arad
Yossi Gandelsman
Yonatan Belinkov
312
6
0
10 Jun 2025
Optimal Weighted Convolution for Classification and Denosing
Simone Cammarasana
Giuseppe Patané
133
1
0
30 May 2025
Precise In-Parameter Concept Erasure in Large Language Models
Yoav Gur-Arieh
Clara Suslik
Yihuai Hong
Fazl Barez
Mor Geva
KELM
MU
408
3
0
28 May 2025
Out-of-distribution generalisation is hard: evidence from ARC-like tasks
George Dimitriadis
Spyridon Samothrakis
358
0
0
14 May 2025
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
Wenyuan Xu
Shibiao Xu
ViT
1.1K
2
0
06 May 2025
Hadamard product in deep learning: Introduction, Advances and Challenges
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
Volkan Cevher
AAML
342
14
0
17 Apr 2025
Exploring Synergistic Ensemble Learning: Uniting CNNs, MLP-Mixers, and Vision Transformers to Enhance Image Classification
Mk Bashar
Ocean Monjur
Samia Islam
Mohammad Galib Shams
Niamul Quader
UQCV
254
1
0
12 Apr 2025
Evaluation of (Un-)Supervised Machine Learning Methods for GNSS Interference Classification with Real-World Data Discrepancies
Lucas Heublein
Nisha Lakshmana Raichur
Tobias Feigl
Tobias Brieger
Fin Heuer
Lennart Asbach
A. Rügamer
Felix Ott
448
10
0
31 Mar 2025
GmNet: Revisiting Gating Mechanisms From A Frequency View
Yifan Wang
Xu Ma
Yitian Zhang
Zhongruo Wang
Sung-Cheol Kim
Vahid Mirjalili
Vidya Renganathan
Y. Fu
340
0
0
28 Mar 2025
DeepRV: Accelerating spatiotemporal inference with pre-trained neural priors
Jhonathan Navott
Daniel Jenson
Seth Flaxman
Elizaveta Semenova
287
0
0
27 Mar 2025
Enabling Heterogeneous Adversarial Transferability via Feature Permutation Attacks
Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2025
Tao Wu
Tie Luo
AAML
375
0
0
26 Mar 2025
Enhanced Bloom's Educational Taxonomy for Fostering Information Literacy in the Era of Large Language Models
Yiming Luo
Ting Liu
Patrick Cheong-Iao Pang
Dana McKay
Zhongfu Chen
George Buchanan
Shanton Chang
AI4Ed
216
3
0
25 Mar 2025
Changing Base Without Losing Pace: A GPU-Efficient Alternative to MatMul in DNNs
Nir Ailon
Akhiad Bercovich
Yahel Uffenheimer
Omri Weinstein
457
3
0
15 Mar 2025
MP-HSIR: A Multi-Prompt Framework for Universal Hyperspectral Image Restoration
Zhehui Wu
Yong Chen
Xiangwei Zhu
Wei He
324
3
0
12 Mar 2025
Speculative Decoding and Beyond: An In-Depth Survey of Techniques
Y. Hu
Zining Liu
Zhenyuan Dong
Tianfan Peng
Bradley McDanel
Shanghang Zhang
708
0
0
27 Feb 2025
Neural Attention: A Novel Mechanism for Enhanced Expressive Power in Transformer Models
Andrew DiGiugno
Ausif Mahmood
330
0
0
24 Feb 2025
MSECG: Incorporating Mamba for Robust and Efficient ECG Super-Resolution
Jie Lin
I-Hsiang Chiu
Kuan-Chen Wang
Kai-Chun Liu
H. Wang
Ping-Cheng Yeh
Yu Tsao
Mamba
242
2
0
06 Dec 2024
Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation
Computer Vision and Pattern Recognition (CVPR), 2024
Seokil Ham
H. Kim
Sangmin Woo
Changick Kim
Mamba
1.1K
2
0
21 Nov 2024
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics
International Conference on Learning Representations (ICLR), 2024
Yaniv Nikankin
Anja Reusch
Aaron Mueller
Yonatan Belinkov
AIFin
LRM
362
61
0
28 Oct 2024
Prototypical Extreme Multi-label Classification with a Dynamic Margin Loss
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Kunal Dahiya
Diego Ortego
David Jiménez
255
1
0
27 Oct 2024
FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
Woosung Koh
Wonbeen Oh
S. Kim
Suhin Shin
Hyeongjin Kim
Jaein Jang
Junghyun Lee
Se-Young Yun
446
0
0
21 Oct 2024
Use of What-if Scenarios to Help Explain Artificial Intelligence Models for Neonatal Health
Abdullah Mamun
Lawrence D. Devoe
Mark I. Evans
David W. Britt
Judith Klein-Seetharaman
Hassan Ghasemzadeh
171
7
0
12 Oct 2024
On the Adversarial Transferability of Generalized "Skip Connections"
Yisen Wang
Yichuan Mo
Dongxian Wu
Mingjie Li
Jiabo He
Zhouchen Lin
AAML
277
3
0
11 Oct 2024
BiPC: Bidirectional Probability Calibration for Unsupervised Domain Adaption
Expert systems with applications (ESWA), 2024
Wenlve Zhou
Zhiheng Zhou
Junyuan Shang
Chang Niu
Mingyue Zhang
Xiyuan Tao
Tianlei Wang
264
1
0
29 Sep 2024
Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Alexander Prutsch
Horst Bischof
Horst Possegger
223
7
0
24 Sep 2024
Nonlocal Attention Operator: Materializing Hidden Knowledge Towards Interpretable Physics Discovery
Neural Information Processing Systems (NeurIPS), 2024
Yue Yu
Ning Liu
Fei Lu
Tian Gao
S. Jafarzadeh
Stewart Silling
AI4CE
253
21
0
14 Aug 2024
GlitchProber: Advancing Effective Detection and Mitigation of Glitch Tokens in Large Language Models
International Conference on Automated Software Engineering (ASE), 2024
Zhibo Zhang
Wuxia Bai
Yuxi Li
Max Meng
Kaidi Wang
Ling Shi
Li Li
Jun Wang
Haoyu Wang
205
5
0
09 Aug 2024
Enhancing Exploratory Learning through Exploratory Search with the Emergence of Large Language Models
Hawaii International Conference on System Sciences (HICSS), 2024
Yiming Luo
Patrick Cheong-Iao
Shanton Chang
AI4Ed
302
5
0
09 Aug 2024
Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation
Hyunwoo Yu
Yubin Cho
Beoungwoo Kang
Seunghun Moon
Kyeongbo Kong
Suk-Ju Kang
240
11
0
24 Jul 2024
1
2
3
4
5
6
7
Next