Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2205.03436
Cited By
v1
v2 (latest)
EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers
European Conference on Computer Vision (ECCV), 2022
6 May 2022
Junting Pan
Adrian Bulat
Fuwen Tan
Xiatian Zhu
Łukasz Dudziak
Jiaming Song
Georgios Tzimiropoulos
Brais Martínez
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (107★)
Papers citing
"EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers"
50 / 99 papers shown
Rethinking Vision Transformer Depth via Structural Reparameterization
Chengwei Zhou
Vipin Chaudhary
Gourav Datta
ViT
157
0
0
24 Nov 2025
Hybrid Convolution and Vision Transformer NAS Search Space for TinyML Image Classification
Mikhael Djajapermana
Moritz Reiber
Daniel Mueller-Gritschneder
Ulf Schlichtmann
ViT
146
0
0
04 Nov 2025
WaveSeg: Enhancing Segmentation Precision via High-Frequency Prior and Mamba-Driven Spectrum Decomposition
Guoan Xu
Yang Xiao
Wenjing Jia
Guangwei Gao
Guo-Jun Qi
Chia-Wen Lin
Mamba
265
0
0
24 Oct 2025
I-Segmenter: Integer-Only Vision Transformer for Efficient Semantic Segmentation
Jordan Sassoon
Michal Szczepanski
Martyna Poreba
MQ
VLM
271
0
0
12 Sep 2025
A Lightweight Convolution and Vision Transformer integrated model with Multi-scale Self-attention Mechanism
Yi Zhang
Lingxiao Wei
Bowei Zhang
Z. Liu
Kai Yi
Shu Hu
ViT
197
3
0
23 Aug 2025
UniConvNet: Expanding Effective Receptive Field while Maintaining Asymptotically Gaussian Distribution for ConvNets of Any Scale
Yuhao Wang
Wei Xi
275
4
0
12 Aug 2025
Lightweight Backbone Networks Only Require Adaptive Lightweight Self-Attention Mechanisms
Fengyun Li
Chao Zheng
Yangyang Fang
Jialiang Lan
Jianhua Liang
Luhao Zhang
Fa Si
267
1
0
02 Aug 2025
Mobile U-ViT: Revisiting large kernel and U-shaped ViT for efficient medical image segmentation
Fenghe Tang
Bingkun Nian
Jianrui Ding
Wenxin Ma
Quan Quan
Chengqi Dong
J. Yang
Wei Liu
S.Kevin Zhou
MedIm
211
5
0
01 Aug 2025
DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding
Bin Guo
John H.L. Hansen
307
1
0
11 Jun 2025
MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection
Xiaochun Lei
Siqi Wu
Weilin Wu
Zetao Jiang
Mamba
357
0
0
04 Jun 2025
Expert-Like Reparameterization of Heterogeneous Pyramid Receptive Fields in Efficient CNNs for Fair Medical Image Classification
Xiao Wu
Xiaoqing Zhang
Zunjie Xiao
Lingxi Hu
Risa Higashita
Jiang Liu
457
1
0
19 May 2025
Spec2VolCAMU-Net: A Spectrogram-to-Volume Model for EEG-to-fMRI Reconstruction based on Multi-directional Time-Frequency Convolutional Attention Encoder and Vision-Mamba U-Net
Journal of Neural Engineering (J. Neural Eng.), 2025
Dongyi He
Shiyang Li
Bin Jiang
He Yan
MedIm
295
2
0
14 May 2025
LSNet: See Large, Focus Small
Computer Vision and Pattern Recognition (CVPR), 2025
Ao Wang
Hui Chen
Zijia Lin
Jiawei Han
Guiguang Ding
341
35
0
29 Mar 2025
GmNet: Revisiting Gating Mechanisms From A Frequency View
Yifan Wang
Xu Ma
Yitian Zhang
Zhongruo Wang
Sung-Cheol Kim
Vahid Mirjalili
Vidya Renganathan
Y. Fu
391
0
0
28 Mar 2025
Atlas: Multi-Scale Attention Improves Long Context Image Modeling
Kumar Krishna Agrawal
Long Lian
Lu Liu
Natalia Harguindeguy
Boyi Li
Alexander Bick
Maggie Chung
Trevor Darrell
Adam Yala
ViT
229
3
0
16 Mar 2025
Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition
Shun Zou
Yi Zou
Mingya Zhang
Shipeng Luo
Zhihao Chen
Guangwei Gao
ViT
341
2
0
15 Mar 2025
HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views
Computer Vision and Pattern Recognition (CVPR), 2025
Ethan Griffiths
Maryam Haghighat
Akila Pemasiri
Clinton Fookes
Milad Ramezani
3DPC
642
5
0
11 Mar 2025
Partial Convolution Meets Visual Attention
Haiduo Huang
Fuwei Yang
D. Li
Ji Liu
Lu Tian
Jinzhang Peng
Pengju Ren
E. Barsoum
3DH
937
2
0
05 Mar 2025
Thicker and Quicker: A Jumbo Token for Fast Plain Vision Transformers
A. Fuller
Yousef Yassin
Daniel G. Kyrollos
Evan Shelhamer
James R. Green
637
1
0
20 Feb 2025
iFormer: Integrating ConvNet and Transformer for Mobile Application
International Conference on Learning Representations (ICLR), 2025
Chuanyang Zheng
ViT
461
11
0
26 Jan 2025
RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations
Mingshu Zhao
Yi Luo
Yong Ouyang
387
0
0
27 Dec 2024
Distilled Pooling Transformer Encoder for Efficient Realistic Image Dehazing
Le-Anh Tran
Dong-Chul Park
ViT
282
11
0
18 Dec 2024
RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Mustafa Munir
Md Mostafijur Rahman
R. Marculescu
MedIm
ViT
428
7
0
14 Dec 2024
MultiTASC++: A Continuously Adaptive Scheduler for Edge-Based Multi-Device Cascade Inference
Sokratis Nikolaidis
Stylianos I. Venieris
I. Venieris
298
0
0
05 Dec 2024
CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
Computer Vision and Pattern Recognition (CVPR), 2024
Yuan Zhou
Qingshan Xu
Jiequan Cui
Junbao Zhou
Jing Zhang
Richang Hong
Han Zhang
ViT
315
6
0
25 Nov 2024
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
Computer Vision and Pattern Recognition (CVPR), 2024
Haoyang He
Jing Zhang
Yuxuan Cai
Hongxu Chen
Xiaobin Hu
Zhenye Gan
Yun Wang
Chengjie Wang
Yunsheng Wu
Lei Xie
Mamba
528
55
0
24 Nov 2024
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality
Computer Vision and Pattern Recognition (CVPR), 2024
Sanghyeok Lee
Joonmyung Choi
Hyunwoo J. Kim
527
34
0
22 Nov 2024
AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Neural Information Processing Systems (NeurIPS), 2024
Vidit Goel
Huseyin Coskun
Jierun Chen
Junli Cao
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
Jian Ren
278
7
0
07 Nov 2024
Improving Vision Transformers by Overlapping Heads in Multi-Head Self-Attention
Tianxiao Zhang
Bo Luo
G. Wang
ViT
318
5
0
18 Oct 2024
SCAN-Edge: Finding MobileNet-speed Hybrid Networks for Diverse Edge Devices via Hardware-Aware Evolutionary Search
Hung-Yueh Chiang
Diana Marculescu
245
0
0
27 Aug 2024
Towards Real-time Video Compressive Sensing on Mobile Devices
ACM Multimedia (MM), 2024
Miao Cao
Lishun Wang
Huan Wang
Guoqing Wang
Xin Yuan
3DGS
340
3
0
14 Aug 2024
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications
Tianfang Zhang
Lei Li
Yang Zhou
Wentao Liu
Chen Qian
Xiangyang Ji
ViT
245
81
0
07 Aug 2024
How Lightweight Can A Vision Transformer Be
Jen Hong Tan
ViT
MoE
266
1
0
25 Jul 2024
GroupMamba: Efficient Group-Based Visual State Space Model
Abdelrahman M. Shaker
Syed Talal Wasim
Salman Khan
Juergen Gall
Fahad Shahbaz Khan
Mamba
267
4
0
18 Jul 2024
AFIDAF: Alternating Fourier and Image Domain Adaptive Filters as an Efficient Alternative to Attention in ViTs
Yunling Zheng
Zeyi Xu
Fanghui Xue
Biao Yang
Jiancheng Lyu
Shuai Zhang
Y. Qi
Jack Xin
246
0
0
16 Jul 2024
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model
Haobo Yuan
Xiangtai Li
Lu Qi
Tao Zhang
Ming-Hsuan Yang
Shuicheng Yan
Chen Change Loy
VLM
348
21
0
27 Jun 2024
RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization
Mingshu Zhao
Yi Luo
Yong Ouyang
477
9
0
23 Jun 2024
Scaling Graph Convolutions for Mobile Vision
William Avery
Mustafa Munir
R. Marculescu
GNN
352
17
0
09 Jun 2024
Navigating Efficiency in MobileViT through Gaussian Process on Global Architecture Factors
Ke Meng
Kai Chen
275
2
0
07 Jun 2024
Semantic Equitable Clustering: A Simple and Effective Strategy for Clustering Vision Tokens
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
431
3
0
22 May 2024
Vision Transformer with Sparse Scan Prior
Qihang Fan
Huaibo Huang
Mingrui Chen
ViT
465
9
0
22 May 2024
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training
Jin Gao
Shubo Lin
Shaoru Wang
Yutong Kou
Zeming Li
Liang Li
Congxuan Zhang
Xiaoqin Zhang
Yizheng Wang
Weiming Hu
357
10
0
18 Apr 2024
LUCF-Net: Lightweight U-shaped Cascade Fusion Network for Medical Image Segmentation
Songkai Sun
Qingshan She
Yuliang Ma
Rihui Li
Yingchun Zhang
MedIm
240
8
0
11 Apr 2024
Rewrite the Stars
Xu Ma
Xiyang Dai
Yue Bai
Yizhou Wang
Yun Fu
285
408
0
29 Mar 2024
Efficient Modulation for Vision Networks
Xu Ma
Xiyang Dai
Jianwei Yang
Bin Xiao
Yinpeng Chen
Yun Fu
Lu Yuan
349
29
0
29 Mar 2024
Scenario Engineering for Autonomous Transportation: A New Stage in Open-Pit Mines
IEEE Transactions on Intelligent Vehicles (TIV), 2024
Siyu Teng
Xuan Li
Yuchen Li
Zhe Xuanyuan
Yunfeng Ai
Long Chen
287
22
0
15 Mar 2024
Attention-aware Semantic Communications for Collaborative Inference
Jiwoong Im
Nayoung Kwon
Taewoo Park
Jiheon Woo
Jaeho Lee
Yongjune Kim
274
13
0
23 Feb 2024
YOLO-Ant: A Lightweight Detector via Depthwise Separable Convolutional and Large Kernel Design for Antenna Interference Source Detection
Xiaoyu Tang
Xingming Chen
Jintao Cheng
Jin Wu
Rui Fan
Chengxi Zhang
Zebo Zhou
286
15
0
20 Feb 2024
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
Computer Vision and Pattern Recognition (CVPR), 2024
Seokju Yun
Youngmin Ro
ViT
464
137
0
29 Jan 2024
RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything
Shilin Xu
Haobo Yuan
Qingyu Shi
Lu Qi
Jingbo Wang
...
Kai Chen
Yunhai Tong
Guohao Li
Xiangtai Li
Ming-Hsuan Yang
VLM
163
8
0
18 Jan 2024
1
2
Next
Page 1 of 2