v1v2 (latest)

EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers

European Conference on Computer Vision (ECCV), 2022

6 May 2022

Georgios Tzimiropoulos

Brais Martínez

ViT

ArXiv (abs)PDF HTML Github (107★)

Papers citing "EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers"

50 / 99 papers shown

Rethinking Vision Transformer Depth via Structural Reparameterization

157

24 Nov 2025

Hybrid Convolution and Vision Transformer NAS Search Space for TinyML Image Classification

Mikhael Djajapermana

Moritz Reiber

Daniel Mueller-Gritschneder

Ulf Schlichtmann

ViT

146

04 Nov 2025

WaveSeg: Enhancing Segmentation Precision via High-Frequency Prior and Mamba-Driven Spectrum Decomposition

265

24 Oct 2025

I-Segmenter: Integer-Only Vision Transformer for Efficient Semantic Segmentation

271

12 Sep 2025

A Lightweight Convolution and Vision Transformer integrated model with Multi-scale Self-attention Mechanism

197

23 Aug 2025

UniConvNet: Expanding Effective Receptive Field while Maintaining Asymptotically Gaussian Distribution for ConvNets of Any Scale

Yuhao Wang

Wei Xi

275

12 Aug 2025

Lightweight Backbone Networks Only Require Adaptive Lightweight Self-Attention Mechanisms

267

02 Aug 2025

Mobile U-ViT: Revisiting large kernel and U-shaped ViT for efficient medical image segmentation

211

01 Aug 2025

DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding

Bin Guo

John H.L. Hansen

307

11 Jun 2025

MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection

357

04 Jun 2025

Expert-Like Reparameterization of Heterogeneous Pyramid Receptive Fields in Efficient CNNs for Fair Medical Image Classification

457

19 May 2025

Spec2VolCAMU-Net: A Spectrogram-to-Volume Model for EEG-to-fMRI Reconstruction based on Multi-directional Time-Frequency Convolutional Attention Encoder and Vision-Mamba U-NetJournal of Neural Engineering (J. Neural Eng.), 2025

295

14 May 2025

LSNet: See Large, Focus SmallComputer Vision and Pattern Recognition (CVPR), 2025

341

29 Mar 2025

GmNet: Revisiting Gating Mechanisms From A Frequency View

391

28 Mar 2025

Atlas: Multi-Scale Attention Improves Long Context Image Modeling

Kumar Krishna Agrawal

229

16 Mar 2025

Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition

341

15 Mar 2025

HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial ViewsComputer Vision and Pattern Recognition (CVPR), 2025

642

11 Mar 2025

Partial Convolution Meets Visual Attention

937

05 Mar 2025

Thicker and Quicker: A Jumbo Token for Fast Plain Vision Transformers

637

20 Feb 2025

iFormer: Integrating ConvNet and Transformer for Mobile ApplicationInternational Conference on Learning Representations (ICLR), 2025

Chuanyang Zheng

ViT

461

26 Jan 2025

RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations

Mingshu Zhao

Yi Luo

Yong Ouyang

387

27 Dec 2024

Distilled Pooling Transformer Encoder for Efficient Realistic Image Dehazing

Le-Anh Tran

Dong-Chul Park

ViT

282

18 Dec 2024

RapidNet: Multi-Level Dilated Convolution Based Mobile BackboneIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

428

14 Dec 2024

MultiTASC++: A Continuously Adaptive Scheduler for Edge-Based Multi-Device Cascade Inference

Sokratis Nikolaidis

Stylianos I. Venieris

I. Venieris

298

05 Dec 2024

CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual InteractionComputer Vision and Pattern Recognition (CVPR), 2024

315

25 Nov 2024

MobileMamba: Lightweight Multi-Receptive Visual Mamba NetworkComputer Vision and Pattern Recognition (CVPR), 2024

528

24 Nov 2024

EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space DualityComputer Vision and Pattern Recognition (CVPR), 2024

Sanghyeok Lee

Joonmyung Choi

Hyunwoo J. Kim

527

22 Nov 2024

AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and GenerationNeural Information Processing Systems (NeurIPS), 2024

278

07 Nov 2024

Improving Vision Transformers by Overlapping Heads in Multi-Head Self-Attention

318

18 Oct 2024

SCAN-Edge: Finding MobileNet-speed Hybrid Networks for Diverse Edge Devices via Hardware-Aware Evolutionary Search

Hung-Yueh Chiang

Diana Marculescu

245

27 Aug 2024

Towards Real-time Video Compressive Sensing on Mobile DevicesACM Multimedia (MM), 2024

Miao Cao

Xin Yuan

340

14 Aug 2024

CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications

Tianfang Zhang

Lei Li

Chen Qian

245

07 Aug 2024

How Lightweight Can A Vision Transformer Be

Jen Hong Tan

ViT MoE

266

25 Jul 2024

GroupMamba: Efficient Group-Based Visual State Space Model

Abdelrahman M. Shaker

Syed Talal Wasim

Salman Khan

Juergen Gall

Fahad Shahbaz Khan

Mamba

267

18 Jul 2024

AFIDAF: Alternating Fourier and Image Domain Adaptive Filters as an Efficient Alternative to Attention in ViTs

246

16 Jul 2024

Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model

Xiangtai Li

Tao Zhang

Chen Change Loy

348

27 Jun 2024

RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization

Mingshu Zhao

Yi Luo

Yong Ouyang

477

23 Jun 2024

Scaling Graph Convolutions for Mobile Vision

352

09 Jun 2024

Navigating Efficiency in MobileViT through Gaussian Process on Global Architecture Factors

Ke Meng

Kai Chen

275

07 Jun 2024

Semantic Equitable Clustering: A Simple and Effective Strategy for Clustering Vision Tokens

431

22 May 2024

Vision Transformer with Sparse Scan Prior

465

22 May 2024

An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training

357

18 Apr 2024

LUCF-Net: Lightweight U-shaped Cascade Fusion Network for Medical Image Segmentation

240

11 Apr 2024

285

408

29 Mar 2024

Efficient Modulation for Vision Networks

Jianwei Yang

Lu Yuan

349

29 Mar 2024

Scenario Engineering for Autonomous Transportation: A New Stage in Open-Pit MinesIEEE Transactions on Intelligent Vehicles (TIV), 2024

287

15 Mar 2024

Attention-aware Semantic Communications for Collaborative Inference

274

23 Feb 2024

YOLO-Ant: A Lightweight Detector via Depthwise Separable Convolutional and Large Kernel Design for Antenna Interference Source Detection

286

20 Feb 2024

SHViT: Single-Head Vision Transformer with Memory Efficient Macro DesignComputer Vision and Pattern Recognition (CVPR), 2024

Seokju Yun

Youngmin Ro

ViT

464

137

29 Jan 2024

RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything

...

Xiangtai Li

Ming-Hsuan Yang

VLM

163

18 Jan 2024