Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.01601
Cited By
MLP-Mixer: An all-MLP Architecture for Vision
4 May 2021
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
Thomas Unterthiner
Jessica Yung
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MLP-Mixer: An all-MLP Architecture for Vision"
17 / 17 papers shown
Title
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
Young-Hu Park
R.-H. Park
Hyung-Min Park
4
0
0
07 May 2025
Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook
Muyi Bao
Shuchang Lyu
Zhaoyang Xu
Huiyu Zhou
Jinchang Ren
Shiming Xiang
X. Li
Guangliang Cheng
Mamba
59
0
0
01 May 2025
Unsupervised 2D-3D lifting of non-rigid objects using local constraints
Shalini Maiti
Lourdes Agapito
Benjamin Graham
16
30
0
27 Apr 2025
A Spatially-Aware Multiple Instance Learning Framework for Digital Pathology
H. Keshvarikhojasteh
Mihail Tifrea
Sibylle Hess
J. Pluim
M. Veta
25
0
0
24 Apr 2025
QuantBench: Benchmarking AI Methods for Quantitative Investment
Saizhuo Wang
Hao Kong
Jiadong Guo
Fengrui Hua
Yiyan Qi
Wanyun Zhou
Jiahao Zheng
Xinyu Wang
Lionel M. Ni
Jian Guo
19
61
0
24 Apr 2025
TabKAN: Advancing Tabular Data Analysis using Kolmogorov-Arnold Network
Ali Eslamian
Alireza Afzal Aghaei
Qiang Cheng
LMTD
39
0
0
09 Apr 2025
Joint-Embedding Masked Autoencoder for Self-supervised Learning of Dynamic Functional Connectivity from the Human Brain
Jungwon Choi
Hyungi Lee
Byung-Hoon Kim
Juho Lee
19
0
0
11 Mar 2024
An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and Calibration
Hiroki Naganuma
Ryuichiro Hataya
Kotaro Yoshida
Ioannis Mitliagkas
OODD
56
1
0
17 Jul 2023
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
247
2,898
0
24 Feb 2021
LambdaNetworks: Modeling Long-Range Interactions Without Attention
Irwan Bello
238
165
0
17 Feb 2021
High-Performance Large-Scale Image Recognition Without Normalization
Andrew Brock
Soham De
Samuel L. Smith
Karen Simonyan
VLM
199
450
0
11 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
268
2,875
0
11 Feb 2021
Bottleneck Transformers for Visual Recognition
A. Srinivas
Tsung-Yi Lin
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
SLR
232
840
0
27 Jan 2021
Towards Learning Convolutions from Scratch
Behnam Neyshabur
SSL
181
63
0
27 Jul 2020
Meta Pseudo Labels
Hieu H. Pham
Zihang Dai
Qizhe Xie
Minh-Thang Luong
Quoc V. Le
VLM
230
583
0
23 Mar 2020
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
924
18,450
0
17 Apr 2017
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
239
6,278
0
16 Nov 2016
1