Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.13002
Cited By
A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP
30 August 2021
Yucheng Zhao
Guangting Wang
Chuanxin Tang
Chong Luo
Wenjun Zeng
Zhengjun Zha
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP"
30 / 30 papers shown
Title
Joint Resource Management for Energy-efficient UAV-assisted SWIPT-MEC: A Deep Reinforcement Learning Approach
Yue Chen
Hui Kang
Jiahui Li
Geng Su
Boxiong Wang
Jiacheng Wang
Cong Liang
Shuang Liang
Dusit Niyato
49
0
0
06 May 2025
Exploring Synergistic Ensemble Learning: Uniting CNNs, MLP-Mixers, and Vision Transformers to Enhance Image Classification
Mk Bashar
Ocean Monjur
Samia Islam
Mohammad Galib Shams
Niamul Quader
UQCV
29
0
0
12 Apr 2025
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
39
0
0
31 Mar 2025
A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition
Dewan Tauhid Rahman
Yeahia Sarker
Antar Mazumder
Md. Shamim Anower
ViT
46
0
0
24 Feb 2025
Exploring Real&Synthetic Dataset and Linear Attention in Image Restoration
Yuzhen Du
Teng Hu
J. Zhang
Ran Yi Chengming Xu
Xiaobin Hu
Kai WU
Donghao Luo
Y. Wang
Lizhuang Ma
83
1
0
05 Dec 2024
DBF-Net: A Dual-Branch Network with Feature Fusion for Ultrasound Image Segmentation
Guoping Xu
Ximing Wu
Wentao Liao
Xinglong Wu
Qing Huang
Chang Li
20
0
0
17 Nov 2024
Ophthalmic Biomarker Detection with Parallel Prediction of Transformer and Convolutional Architecture
Md. Touhidul Islam
Md. Abtahi Majeed Chowdhury
Mahmudul Hasan
Asif Quadir
Lutfa Aktar
MedIm
14
1
0
26 Sep 2024
Evaluating the Efficacy of Prompt-Engineered Large Multimodal Models Versus Fine-Tuned Vision Transformers in Image-Based Security Applications
Fouad Trad
Ali Chehab
MLLM
32
2
0
26 Mar 2024
Activating Wider Areas in Image Super-Resolution
Cheng Cheng
Hang Wang
Hongbin Sun
34
10
0
13 Mar 2024
Knowledge Translation: A New Pathway for Model Compression
Wujie Sun
Defang Chen
Jiawei Chen
Yan Feng
Chun-Yen Chen
Can Wang
23
0
0
11 Jan 2024
MLPST: MLP is All You Need for Spatio-Temporal Prediction
Zijian Zhang
Ze Huang
Zhiwei Hu
Xiangyu Zhao
Wanyu Wang
Zitao Liu
Junbo Zhang
S. Qin
Hongwei Zhao
AI4TS
15
27
0
23 Sep 2023
HAT: Hybrid Attention Transformer for Image Restoration
Xiangyu Chen
Xintao Wang
Wenlong Zhang
Xiangtao Kong
Yu Qiao
Jiantao Zhou
Chao Dong
24
44
0
11 Sep 2023
SCSC: Spatial Cross-scale Convolution Module to Strengthen both CNNs and Transformers
Xijun Wang
Xiaojie Chu
Chunrui Han
Xiangyu Zhang
ViT
18
1
0
14 Aug 2023
Efficient Deep Spiking Multi-Layer Perceptrons with Multiplication-Free Inference
Boyan Li
Luziwei Leng
Shuaijie Shen
Kaixuan Zhang
Jianguo Zhang
Jianxing Liao
Ran Cheng
26
7
0
21 Jun 2023
SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky Videos from Physics-constrained VideoGPT
Yuhao Nie
E. Zelikman
Andea Scott
Quentin Paletta
A. Brandt
26
3
0
20 Jun 2023
MLP-AIR: An Efficient MLP-Based Method for Actor Interaction Relation Learning in Group Activity Recognition
Guoliang Xu
Jianqin Yin
19
1
0
18 Apr 2023
MetaFormer Baselines for Vision
Weihao Yu
Chenyang Si
Pan Zhou
Mi Luo
Yichen Zhou
Jiashi Feng
Shuicheng Yan
Xinchao Wang
MoE
23
156
0
24 Oct 2022
Image Semantic Relation Generation
Mingzhe Du
16
0
0
19 Oct 2022
Bridging the Gap Between Vision Transformers and Convolutional Neural Networks on Small Datasets
Zhiying Lu
Hongtao Xie
Chuanbin Liu
Yongdong Zhang
ViT
10
57
0
12 Oct 2022
MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition
Xiaodong Chen
Wu Liu
Xinchen Liu
Yongdong Zhang
Jungong Han
Tao Mei
3DPC
41
12
0
01 Sep 2022
Peripheral Vision Transformer
Juhong Min
Yucheng Zhao
Chong Luo
Minsu Cho
ViT
MDE
24
30
0
14 Jun 2022
Inception Transformer
Chenyang Si
Weihao Yu
Pan Zhou
Yichen Zhou
Xinchao Wang
Shuicheng Yan
ViT
26
187
0
25 May 2022
Accelerating the Training of Video Super-Resolution Models
Lijian Lin
Xintao Wang
Zhongang Qi
Ying Shan
30
3
0
10 May 2022
Activating More Pixels in Image Super-Resolution Transformer
Xiangyu Chen
Xintao Wang
Jiantao Zhou
Yu Qiao
Chao Dong
ViT
59
600
0
09 May 2022
CodedVTR: Codebook-based Sparse Voxel Transformer with Geometric Guidance
Tianchen Zhao
Niansong Zhang
Xuefei Ning
He-Nan Wang
Li Yi
Yu Wang
3DPC
ViT
22
8
0
18 Mar 2022
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
Xiaohan Ding
X. Zhang
Yi Zhou
Jungong Han
Guiguang Ding
Jian-jun Sun
VLM
47
528
0
13 Mar 2022
LighTN: Light-weight Transformer Network for Performance-overhead Tradeoff in Point Cloud Downsampling
Xu Wang
Yi Jin
Yigang Cen
Tao Wang
Bowen Tang
Yidong Li
ViT
12
28
0
13 Feb 2022
Hformer: Hybrid CNN-Transformer for Fringe Order Prediction in Phase Unwrapping of Fringe Projection
Xinjun Zhu
Zhiqiang Han
Mengkai Yuan
Qinghua Guo
Hongyi Wang
14
4
0
13 Dec 2021
MLP-Mixer: An all-MLP Architecture for Vision
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
...
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
259
2,603
0
04 May 2021
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
950
20,561
0
17 Apr 2017
1