Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.13452
Cited By
MetaFormer Baselines for Vision
24 October 2022
Weihao Yu
Chenyang Si
Pan Zhou
Mi Luo
Yichen Zhou
Jiashi Feng
Shuicheng Yan
Xinchao Wang
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MetaFormer Baselines for Vision"
50 / 88 papers shown
Title
False Promises in Medical Imaging AI? Assessing Validity of Outperformance Claims
Evangelia Christodoulou
Annika Reinke
Pascaline Andrè
Patrick Godau
P. Kalinowski
...
Amber L. Simpson
A. Kopp-Schneider
Gaël Varoquaux
O. Colliot
Lena Maier-Hein
33
0
0
07 May 2025
Vision-Centric Representation-Efficient Fine-Tuning for Robust Universal Foreground Segmentation
Guoyi Zhang
Siyang Chen
Guangsheng Xu
Han Wang
Xiaohu Zhang
29
0
0
20 Apr 2025
HDBFormer: Efficient RGB-D Semantic Segmentation with A Heterogeneous Dual-Branch Framework
Shuobin Wei
Zhuang Zhou
Zhengan Lu
Zizhao Yuan
Binghua Su
MDE
42
0
0
18 Apr 2025
LightFormer: A lightweight and efficient decoder for remote sensing image segmentation
Sihang Chen
Lijun Yun
Z. Liu
JianFeng Zhu
J. Chen
Hui Wang
Yueping Nie
24
0
0
15 Apr 2025
Decision SpikeFormer: Spike-Driven Transformer for Decision Making
Wei Huang
Qinying Gu
Nanyang Ye
OffRL
29
1
0
04 Apr 2025
RBT4DNN: Requirements-based Testing of Neural Networks
Nusrat Jahan Mozumder
Felipe Toledo
Swaroopa Dola
Matthew B. Dwyer
AAML
46
1
0
03 Apr 2025
Learned Image Compression with Dictionary-based Entropy Model
Jingbo Lu
Leheng Zhang
Xingyu Zhou
Mu-Wei Li
Wen Li
Shuhang Gu
37
0
0
01 Apr 2025
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
32
0
0
31 Mar 2025
UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines
Chen Tang
Xinzhu Ma
Encheng Su
Xiufeng Song
Xiaohong Liu
Wei-Hong Li
Lei Bai
Wanli Ouyang
Xiangyu Yue
3DGS
AI4TS
67
0
0
26 Mar 2025
Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation
Edgar Heinert
Thomas Gottwald
Annika Mütze
Matthias Rottmann
55
0
0
16 Mar 2025
SDTrack: A Baseline for Event-based Tracking via Spiking Neural Networks
Yimeng Shan
Zhenbang Ren
Haodi Wu
Wenjie Wei
Rui-jie Zhu
...
Jason Eshraghian
Haicheng Qu
J. Zhang
Malu Zhang
Y. Yang
39
0
0
09 Mar 2025
Semi-Supervised Learning for Dose Prediction in Targeted Radionuclide: A Synthetic Data Study
Jing Zhang
Alexandre Bousse
Laetitia Imbert
Song Xue
Kuangyu Shi
Julien Bert
73
0
0
07 Mar 2025
Disentangling Visual Transformers: Patch-level Interpretability for Image Classification
Guillaume Jeanneret
Loïc Simon
F. Jurie
ViT
44
0
0
24 Feb 2025
Predicting Satisfied User and Machine Ratio for Compressed Images: A Unified Approach
Qi Zhang
Shanshe Wang
Xinfeng Zhang
Siwei Ma
Jingshan Pan
Wen Gao
21
0
0
23 Dec 2024
Learning Dynamic Local Context Representations for Infrared Small Target Detection
Guoyi Zhang
Guangsheng Xu
Han Wang
Siyang Chen
Yunxiao Shan
Xiaohu Zhang
29
1
0
23 Dec 2024
Token Cropr: Faster ViTs for Quite a Few Tasks
Benjamin Bergner
C. Lippert
Aravindh Mahendran
ViT
VLM
64
0
0
01 Dec 2024
Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training
Man Yao
Xuerui Qiu
Tianxiang Hu
J. Hu
Yuhong Chou
Keyu Tian
Jianxing Liao
Luziwei Leng
Bo Xu
Guoqi Li
74
4
0
25 Nov 2024
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Jongseong Bae
Junwoo Ha
Ha Young Kim
79
0
0
25 Nov 2024
MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map
Yuhong Chou
Man Yao
Kexin Wang
Yuqi Pan
Ruijie Zhu
Yiran Zhong
Yu Qiao
J. Wu
Bo Xu
Guoqi Li
46
4
0
16 Nov 2024
Local Lesion Generation is Effective for Capsule Endoscopy Image Data Augmentation in a Limited Data Setting
Adrian B. Chłopowiec
Adam R. Chłopowiec
Krzysztof Galus
Wojciech Cebula
Martin Tabakov
MedIm
28
0
0
05 Nov 2024
ByteNet: Rethinking Multimedia File Fragment Classification through Visual Perspectives
Wenyang Liu
Kejun Wu
Tianyi Liu
Yi Wang
Kim-Hui Yap
Lap-Pui Chau
16
3
0
28 Oct 2024
MoH: Multi-Head Attention as Mixture-of-Head Attention
Peng Jin
Bo Zhu
Li Yuan
Shuicheng Yan
MoE
29
13
0
15 Oct 2024
Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning
Siyuan Li
Juanxi Tian
Zedong Wang
Luyuan Zhang
Zicheng Liu
Weiyang Jin
Yang Liu
Baigui Sun
Stan Z. Li
16
0
0
08 Oct 2024
On Efficient Variants of Segment Anything Model: A Survey
Xiaorui Sun
J. Liu
H. Shen
Xiaofeng Zhu
Ping Hu
VLM
43
4
0
07 Oct 2024
Polyp-SES: Automatic Polyp Segmentation with Self-Enriched Semantic Model
Quang Vinh Nguyen
Thanh Hoang Son Vo
Sae-Ryung Kang
Soo-Hyung Kim
19
0
0
02 Oct 2024
A new baseline for edge detection: Make Encoder-Decoder great again
Yachuan Li
Xavier Soria Pomab
Yongke Xi
Guanlin Li
Chaozhi Yang
Qian Xiao
Yun Bai
Zongmin Li
18
0
0
23 Sep 2024
Kolmogorov-Arnold Transformer
Xingyi Yang
Xinchao Wang
34
15
0
16 Sep 2024
Investigation of Hierarchical Spectral Vision Transformer Architecture for Classification of Hyperspectral Imagery
Wei Liu
Saurabh Prasad
Melba M. Crawford
30
3
0
14 Sep 2024
VFA: Vision Frequency Analysis of Foundation Models and Human
Mohammad Javad Darvishi Bayazi
Md Rifat Arefin
Jocelyn Faubert
Irina Rish
VLM
29
1
0
09 Sep 2024
SDformerFlow: Spatiotemporal swin spikeformer for event-based optical flow estimation
Yi Tian
Juan Andrade-Cetto
27
0
0
06 Sep 2024
Accuracy Improvement of Cell Image Segmentation Using Feedback Former
Hinako Mitsuoka
Kazuhiro Hotta
ViT
MedIm
20
0
0
23 Aug 2024
HcNet: Image Modeling with Heat Conduction Equation
Zhemin Zhang
Xun Gong
DiffM
3DV
33
0
0
12 Aug 2024
U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising Training
Zhuoyan Liu
Bo Wang
Ye Li
ViT
25
0
0
11 Aug 2024
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications
Tianfang Zhang
Lei Li
Yang Zhou
Wentao Liu
Chen Qian
Xiangyang Ji
ViT
28
9
0
07 Aug 2024
Dilated Convolution with Learnable Spacings makes visual models more aligned with humans: a Grad-CAM study
Rabih Chamas
Ismail Khalfaoui-Hassani
T. Masquelier
21
0
0
06 Aug 2024
Unsupervised Representation Learning by Balanced Self Attention Matching
Daniel Shalam
Simon Korman
SSL
31
0
0
04 Aug 2024
AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Zili Wang
Qi Yang
Linsu Shi
Jiazhong Yu
M. Tanveer
Fei Li
Shiming Xiang
VOS
14
1
0
03 Aug 2024
Multiple Contexts and Frequencies Aggregation Network forDeepfake Detection
Zifeng Li
Wen-Jun Tang
Shijun Gao
Shuai Wang
Yanxiang Wang
CVBM
26
2
0
03 Aug 2024
Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer Vision
Tim J. M. Jaspers
Ronald L.P.D. de Jong
Yasmina Alkhalil
Tijn Zeelenberg
C. H. Kusters
...
Franciscus Hendericus Aäron Bakker
J P Ruurda
Willem M. Brinkman
Peter H. N. de With
Fons van der Sommen
27
1
0
25 Jul 2024
FC3DNet: A Fully Connected Encoder-Decoder for Efficient Demoiréing
Zhibo Du
Long Peng
Yang Wang
Yang Cao
Zheng-Jun Zha
3DH
34
1
0
21 Jun 2024
Hierarchical Associative Memory, Parallelized MLP-Mixer, and Symmetry Breaking
Ryo Karakida
Toshihiro Ota
Masato Taki
27
2
0
18 Jun 2024
The 3D-PC: a benchmark for visual perspective taking in humans and machines
Drew Linsley
Peisen Zhou
A. Ashok
Akash Nagaraj
Gaurav Gaonkar
Francis E Lewis
Zygmunt Pizlo
Thomas Serre
41
6
0
06 Jun 2024
Dinomaly: The Less Is More Philosophy in Multi-Class Unsupervised Anomaly Detection
Jia Guo
Shuai Lu
Weihang Zhang
Huiqi Li
Huiqi Li
Hongen Liao
ViT
56
7
0
23 May 2024
LookHere: Vision Transformers with Directed Attention Generalize and Extrapolate
A. Fuller
Daniel G. Kyrollos
Yousef Yassin
James R. Green
34
2
0
22 May 2024
Hierarchical Selective Classification
Shani Goren
Ido Galil
Ran El-Yaniv
BDL
44
1
0
19 May 2024
EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training
Yulin Wang
Yang Yue
Rui Lu
Yizeng Han
Shiji Song
Gao Huang
VLM
40
12
0
14 May 2024
MambaOut: Do We Really Need Mamba for Vision?
Weihao Yu
Xinchao Wang
Mamba
39
46
0
13 May 2024
SOPHON: Non-Fine-Tunable Learning to Restrain Task Transferability For Pre-trained Models
Jiangyi Deng
Shengyuan Pang
Yanjiao Chen
Liangming Xia
Yijie Bai
Haiqin Weng
Wenyuan Xu
AAML
23
6
0
19 Apr 2024
Unsegment Anything by Simulating Deformation
Jiahao Lu
Xingyi Yang
Xinchao Wang
29
4
0
03 Apr 2024
Efficient Modulation for Vision Networks
Xu Ma
Xiyang Dai
Jianwei Yang
Bin Xiao
Yinpeng Chen
Yun Fu
Lu Yuan
33
17
0
29 Mar 2024
1
2
Next