Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.08050
Cited By
Pay Attention to MLPs
17 May 2021
Hanxiao Liu
Zihang Dai
David R. So
Quoc V. Le
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pay Attention to MLPs"
50 / 303 papers shown
Title
Out-of-distribution generalisation is hard: evidence from ARC-like tasks
George Dimitriadis. Spyridon Samothrakis
12
0
0
14 May 2025
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
W. Xu
Shibiao Xu
ViT
113
0
0
06 May 2025
Hadamard product in deep learning: Introduction, Advances and Challenges
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
V. Cevher
AAML
96
0
0
17 Apr 2025
Exploring Synergistic Ensemble Learning: Uniting CNNs, MLP-Mixers, and Vision Transformers to Enhance Image Classification
Mk Bashar
Ocean Monjur
Samia Islam
Mohammad Galib Shams
Niamul Quader
UQCV
29
0
0
12 Apr 2025
Evaluation of (Un-)Supervised Machine Learning Methods for GNSS Interference Classification with Real-World Data Discrepancies
Lucas Heublein
N. Raichur
Tobias Feigl
Tobias Brieger
Fin Heuer
Lennart Asbach
A. Rügamer
Felix Ott
47
7
0
31 Mar 2025
GmNet: Revisiting Gating Mechanisms From A Frequency View
Yifan Wang
Xu Ma
Yitian Zhang
Zhongruo Wang
Sung-Cheol Kim
Vahid Mirjalili
Vidya Renganathan
Y. Fu
36
0
0
28 Mar 2025
DeepRV: pre-trained spatial priors for accelerated disease mapping
Jhonathan Navott
Daniel Jenson
Seth Flaxman
Elizaveta Semenova
47
0
0
27 Mar 2025
Enabling Heterogeneous Adversarial Transferability via Feature Permutation Attacks
Tao Wu
Tie Luo
AAML
84
0
0
26 Mar 2025
Enhanced Bloom's Educational Taxonomy for Fostering Information Literacy in the Era of Large Language Models
Yiming Luo
Ting Liu
Patrick Cheong-Iao Pang
Dana McKay
Z. Chen
George Buchanan
Shanton Chang
AI4Ed
46
0
0
25 Mar 2025
Changing Base Without Losing Pace: A GPU-Efficient Alternative to MatMul in DNNs
Nir Ailon
Akhiad Bercovich
Omri Weinstein
52
0
0
15 Mar 2025
MP-HSIR: A Multi-Prompt Framework for Universal Hyperspectral Image Restoration
Zhehui Wu
Yong Chen
Naoto Yokoya
Wei He
49
0
0
12 Mar 2025
Speculative Decoding and Beyond: An In-Depth Survey of Techniques
Y. Hu
Zining Liu
Zhenyuan Dong
Tianfan Peng
Bradley McDanel
S. Zhang
88
0
0
27 Feb 2025
Neural Attention: A Novel Mechanism for Enhanced Expressive Power in Transformer Models
Andrew DiGiugno
Ausif Mahmood
33
0
0
24 Feb 2025
Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation
Seokil Ham
H. Kim
Sangmin Woo
Changick Kim
Mamba
157
0
0
21 Nov 2024
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics
Yaniv Nikankin
Anja Reusch
Aaron Mueller
Yonatan Belinkov
AIFin
LRM
33
21
0
28 Oct 2024
FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
Woosung Koh
Wonbeen Oh
S. Kim
Suhin Shin
Hyeongjin Kim
Jaein Jang
Junghyun Lee
Se-Young Yun
24
0
0
21 Oct 2024
Use of What-if Scenarios to Help Explain Artificial Intelligence Models for Neonatal Health
Abdullah Mamun
Lawrence D. Devoe
Mark I. Evans
David W. Britt
Judith Klein-Seetharaman
Hassan Ghasemzadeh
11
4
0
12 Oct 2024
On the Adversarial Transferability of Generalized "Skip Connections"
Yisen Wang
Yichuan Mo
Dongxian Wu
Mingjie Li
Xingjun Ma
Zhouchen Lin
AAML
21
2
0
11 Oct 2024
BiPC: Bidirectional Probability Calibration for Unsupervised Domain Adaption
Wenlve Zhou
Zhiheng Zhou
Junyuan Shang
Chang Niu
Mingyue Zhang
Xiyuan Tao
Tianlei Wang
20
0
0
29 Sep 2024
Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed
Alexander Prutsch
Horst Bischof
Horst Possegger
26
2
0
24 Sep 2024
Nonlocal Attention Operator: Materializing Hidden Knowledge Towards Interpretable Physics Discovery
Yue Yu
Ning Liu
Fei Lu
Tian Gao
S. Jafarzadeh
Stewart Silling
AI4CE
43
7
0
14 Aug 2024
GlitchProber: Advancing Effective Detection and Mitigation of Glitch Tokens in Large Language Models
Zhibo Zhang
Wuxia Bai
Yuxi Li
M. Meng
K. Wang
Ling Shi
Li Li
Jun Wang
Haoyu Wang
24
4
0
09 Aug 2024
Enhancing Exploratory Learning through Exploratory Search with the Emergence of Large Language Models
Yiming Luo
Patrick Cheong-Iao
Shanton Chang
AI4Ed
34
4
0
09 Aug 2024
Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation
Hyunwoo Yu
Yubin Cho
Beoungwoo Kang
Seunghun Moon
Kyeongbo Kong
Suk-Ju Kang
30
3
0
24 Jul 2024
Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization
Junyan Wu
Wei Lu
Xiangyang Luo
Rui Yang
Qian Wang
Xiaochun Cao
32
3
0
23 Jul 2024
XTraffic: A Dataset Where Traffic Meets Incidents with Explainability and More
Xiaochuan Gou
Ziyue Li
Tian-Shing Lan
Junpeng Lin
Zhishuai Li
Bingyu Zhao
Chen Zhang
Di Wang
Xiangliang Zhang
AI4TS
44
1
0
16 Jul 2024
Team up GBDTs and DNNs: Advancing Efficient and Effective Tabular Prediction with Tree-hybrid MLPs
Jiahuan Yan
Jintai Chen
Qianxing Wang
D. Z. Chen
Jian Wu
29
3
0
13 Jul 2024
Lite-SAM Is Actually What You Need for Segment Everything
Jianhai Fu
Yuanjie Yu
Ningchuan Li
Yi Zhang
Qichao Chen
Jianping Xiong
Jun Yin
Zhiyu Xiang
VLM
34
4
0
12 Jul 2024
Latent Space Imaging
Matheus Souza
Yidan Zheng
Kaizhang Kang
Yogeshwar Nath Mishra
Qiang Fu
Wolfgang Heidrich
50
0
0
09 Jul 2024
Graph-Guided Test-Time Adaptation for Glaucoma Diagnosis using Fundus Photography
Qian Zeng
Le Zhang
Yipeng Liu
Ce Zhu
Fan Zhang
OOD
MedIm
43
1
0
05 Jul 2024
Multi-Convformer: Extending Conformer with Multiple Convolution Kernels
Darshan Prabhu
Yifan Peng
P. Jyothi
Shinji Watanabe
39
0
0
04 Jul 2024
PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition
Y. Hao
Diansong Zhou
Zhicai Wang
Chong-Wah Ngo
Meng Wang
ViT
26
4
0
03 Jul 2024
Multimodal Multilabel Classification by CLIP
Yanming Guo
VLM
21
0
0
23 Jun 2024
Graph Edge Representation via Tensor Product Graph Convolutional Representation
Bo Jiang
Sheng Ge
Ziyan Zhang
Beibei Wang
Jin Tang
Bin Luo
GNN
33
0
0
21 Jun 2024
SMRU: Split-and-Merge Recurrent-based UNet for Acoustic Echo Cancellation and Noise Suppression
Zhihang Sun
Andong Li
Rilin Chen
Hao Zhang
Meng Yu
Yi Zhou
Dong Yu
66
0
0
17 Jun 2024
Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio
Lin Zhang
Xin Wang
Erica Cooper
Mireia Díez
Federico Landini
Nicholas W. D. Evans
Junichi Yamagishi
36
0
0
12 Jun 2024
How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio?
Tianchi Liu
Lin Zhang
Rohan Kumar Das
Yi Ma
Ruijie Tao
Haizhou Li
36
7
0
04 Jun 2024
Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration
Mingyuan Meng
Da-wei Feng
Lei Bi
Jinman Kim
ViT
MedIm
30
16
0
31 May 2024
SFANet: Spatial-Frequency Attention Network for Weather Forecasting
Jiaze Wang
Hao Chen
Hongcan Xu
Jinpeng Li
Bo-Lan Wang
Kun Shao
Furui Liu
Huaxi Chen
Guangyong Chen
Pheng-Ann Heng
58
0
0
29 May 2024
Aligning in a Compact Space: Contrastive Knowledge Distillation between Heterogeneous Architectures
Hongjun Wu
Li Xiao
Xingkuo Zhang
Yining Miao
38
1
0
28 May 2024
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention
Bencheng Liao
Xinggang Wang
Lianghui Zhu
Qian Zhang
Chang Huang
45
4
0
28 May 2024
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention
Zhen Qin
Weigao Sun
Dong Li
Xuyang Shen
Weixuan Sun
Yiran Zhong
46
9
0
27 May 2024
Lateralization MLP: A Simple Brain-inspired Architecture for Diffusion
Zizhao Hu
Mohammad Rostami
34
0
0
25 May 2024
Activator: GLU Activation Function as the Core Component of a Vision Transformer
Abdullah Nazhat Abdullah
Tarkan Aydin
ViT
38
0
0
24 May 2024
Scaling Law for Time Series Forecasting
Jingzhe Shi
Qinwei Ma
Huan Ma
Lei Li
AI4TS
31
8
0
24 May 2024
Dielectric Tensor Prediction for Inorganic Materials Using Latent Information from Preferred Potential
Zetian Mao
Wenwen Li
Jethro Tan
30
2
0
15 May 2024
Pruning as a Domain-specific LLM Extractor
Nan Zhang
Yanchi Liu
Xujiang Zhao
Wei Cheng
Runxue Bao
Rui Zhang
Prasenjit Mitra
Haifeng Chen
19
9
0
10 May 2024
An Advanced Features Extraction Module for Remote Sensing Image Super-Resolution
Naveed Sultan
Amir Hajian
S. Aramvith
SupR
21
6
0
07 May 2024
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges
Badri N. Patro
Vijay Srinivas Agneeswaran
Mamba
35
38
0
24 Apr 2024
ODMixer: Fine-grained Spatial-temporal MLP for Metro Origin-Destination Prediction
Yang Liu
Binglin Chen
Yongsen Zheng
Lechao Cheng
Guanbin Li
Liang Lin
AI4TS
24
0
0
24 Apr 2024
1
2
3
4
5
6
7
Next