ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.01136
  4. Cited By
LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference
v1v2 (latest)

LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference

IEEE International Conference on Computer Vision (ICCV), 2021
2 April 2021
Ben Graham
Alaaeldin El-Nouby
Hugo Touvron
Pierre Stock
Armand Joulin
Edouard Grave
Matthijs Douze
    ViT
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (612★)

Papers citing "LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference"

50 / 327 papers shown
Performance Evaluation of Deep Learning for Tree Branch Segmentation in Autonomous Forestry Systems
Performance Evaluation of Deep Learning for Tree Branch Segmentation in Autonomous Forestry SystemsImage and Vision Computing New Zealand (IVCNZ), 2025
Yida Lin
Bing Xue
Mengjie Zhang
Sam Schofield
Richard Green
197
6
0
05 Dec 2025
DisentangleFormer: Spatial-Channel Decoupling for Multi-Channel Vision
DisentangleFormer: Spatial-Channel Decoupling for Multi-Channel Vision
Jiashu Liao
Pietro Liò
Marc de Kamps
Duygu Sarikaya
167
0
0
03 Dec 2025
nnMobileNet++: Towards Efficient Hybrid Networks for Retinal Image Analysis
nnMobileNet++: Towards Efficient Hybrid Networks for Retinal Image Analysis
Xin Li
Wenhui Zhu
Xuanzhao Dong
Hao Wang
Yujian Xiong
Oana Dumitrascu
Yalin Wang
MedIm
195
0
0
01 Dec 2025
Energy-Efficient Vision Transformer Inference for Edge-AI Deployment
Energy-Efficient Vision Transformer Inference for Edge-AI Deployment
Nursultan Amanzhol
Jurn-Gyu Park
207
0
0
28 Nov 2025
AutoTailor: Automatic and Efficient Adaptive Model Deployment for Diverse Edge Devices
AutoTailor: Automatic and Efficient Adaptive Model Deployment for Diverse Edge Devices
M. Liu
Chenyu Lu
H. Tian
Fang Dong
Ruiting Zhou
Wei Wang
Dian Shen
Guangtong Li
Ye Wan
Li Li
116
0
0
27 Nov 2025
DMSORT: An efficient parallel maritime multi-object tracking architecture for unmanned vessel platforms
DMSORT: An efficient parallel maritime multi-object tracking architecture for unmanned vessel platformsOcean Engineering (Ocean Eng.), 2025
Shengyu Tang
Zeyuan Lu
Jiazhi Dong
Changdong Yu
Xiaoyu Wang
Yaohui Lyu
Weihao Xia
VOT
690
0
0
06 Nov 2025
Integrating ConvNeXt and Vision Transformers for Enhancing Facial Age Estimation
Integrating ConvNeXt and Vision Transformers for Enhancing Facial Age EstimationComputer Vision and Image Understanding (CVIU), 2025
Gaby Maroun
Salah Eddine Bekhouche
Fadi Dornaika
ViT
154
1
0
31 Oct 2025
Alias-Free ViT: Fractional Shift Invariance via Linear Attention
Alias-Free ViT: Fractional Shift Invariance via Linear Attention
H. Michaeli
Daniel Soudry
227
1
0
26 Oct 2025
WaveSeg: Enhancing Segmentation Precision via High-Frequency Prior and Mamba-Driven Spectrum Decomposition
WaveSeg: Enhancing Segmentation Precision via High-Frequency Prior and Mamba-Driven Spectrum Decomposition
Guoan Xu
Yang Xiao
Wenjing Jia
Guangwei Gao
Guo-Jun Qi
Chia-Wen Lin
Mamba
265
0
0
24 Oct 2025
Multi-Scale High-Resolution Logarithmic Grapher Module for Efficient Vision GNNs
Multi-Scale High-Resolution Logarithmic Grapher Module for Efficient Vision GNNs
Mustafa Munir
Alex Zhang
R. Marculescu
210
2
0
15 Oct 2025
BioAutoML-NAS: An End-to-End AutoML Framework for Multimodal Insect Classification via Neural Architecture Search on Large-Scale Biodiversity Data
BioAutoML-NAS: An End-to-End AutoML Framework for Multimodal Insect Classification via Neural Architecture Search on Large-Scale Biodiversity Data
Arefin Ittesafun Abian
Debopom Sutradhar
Md Rafi Ur Rashid
Reem E. Mohamed
M. Islam
Asif Karim
Kheng Cher Yeo
Sami Azam
193
0
0
07 Oct 2025
MER-Inspector: Assessing model extraction risks from an attack-agnostic perspective
MER-Inspector: Assessing model extraction risks from an attack-agnostic perspective
Xinwei Zhang
Haibo Hu
Qingqing Ye
Li Bai
Huadi Zheng
MIACV
426
5
0
23 Sep 2025
Optimizing Product Deduplication in E-Commerce with Multimodal Embeddings
Optimizing Product Deduplication in E-Commerce with Multimodal Embeddings
Aysenur Kulunk
Berk Taskin
M. Furkan Eseoglu
H. Bahadir Sahin
224
0
0
19 Sep 2025
Leveraging Geometric Visual Illusions as Perceptual Inductive Biases for Vision Models
Leveraging Geometric Visual Illusions as Perceptual Inductive Biases for Vision Models
Haobo Yang
Minghao Guo
Dequan Yang
Wenyu Wang
167
0
0
18 Sep 2025
Image Quality Assessment for Machines: Paradigm, Large-scale Database, and Models
Image Quality Assessment for Machines: Paradigm, Large-scale Database, and Models
Xiaoqi Wang
Yun Zhang
Weisi Lin
234
0
0
27 Aug 2025
NAT: Learning to Attack Neurons for Enhanced Adversarial Transferability
NAT: Learning to Attack Neurons for Enhanced Adversarial TransferabilityIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Krishna Kanth Nakka
Alexandre Alahi
AAML
190
2
0
23 Aug 2025
Vision encoders should be image size agnostic and task driven
Vision encoders should be image size agnostic and task driven
Nedyalko Prisadnikov
Danda Pani Paudel
Yuqian Fu
Luc Van Gool
180
1
0
22 Aug 2025
The Maximum Coverage Model and Recommendation System for UAV Vertiports Location Planning
The Maximum Coverage Model and Recommendation System for UAV Vertiports Location Planning
Chunliang Hua
Xiao Hu
Jiayang Sun
Zeyuan Yang
263
1
0
18 Aug 2025
ViT-EnsembleAttack: Augmenting Ensemble Models for Stronger Adversarial Transferability in Vision Transformers
ViT-EnsembleAttack: Augmenting Ensemble Models for Stronger Adversarial Transferability in Vision Transformers
Hanwen Cao
Haobo Lu
Xiaosen Wang
Kun He
ViTAAML
239
3
0
17 Aug 2025
CoCAViT: Compact Vision Transformer with Robust Global Coordination
CoCAViT: Compact Vision Transformer with Robust Global Coordination
Xuyang Wang
Lingjuan Miao
Zhiqiang Zhou
ViTVLM
188
1
0
07 Aug 2025
Representation Shift: Unifying Token Compression with FlashAttention
Representation Shift: Unifying Token Compression with FlashAttention
Joonmyung Choi
S. Lee
Byungoh Ko
Eunseo Kim
Jihyung Kil
Hyunwoo J. Kim
248
2
0
01 Aug 2025
A Survey of Token Compression for Efficient Multimodal Large Language Models
A Survey of Token Compression for Efficient Multimodal Large Language Models
Kele Shao
Keda Tao
Kejia Zhang
Sicheng Feng
Mu Cai
Yuzhang Shang
Haoxuan You
Can Qin
Yang Sui
Huan Wang
721
12
0
27 Jul 2025
Modality Agnostic Efficient Long Range Encoder
Modality Agnostic Efficient Long Range Encoder
T. Parag
Ahmed Elgammal
203
0
0
25 Jul 2025
RAM-W600: A Multi-Task Wrist Dataset and Benchmark for Rheumatoid Arthritis
RAM-W600: A Multi-Task Wrist Dataset and Benchmark for Rheumatoid Arthritis
Songxiao Yang
Haolin Wang
Yao Fu
Ye Tian
Tamotsu Kamishima
Masayuki Ikebe
Yafei Ou
Masatoshi Okutomi
294
3
0
07 Jul 2025
Boosting Generative Adversarial Transferability with Self-supervised Vision Transformer Features
Boosting Generative Adversarial Transferability with Self-supervised Vision Transformer Features
Shangbo Wu
Yu-an Tan
Ruinan Ma
Wencong Ma
Dehua Zhu
Yuanzhang Li
ViT
268
3
0
26 Jun 2025
Attention-based Adversarial Robust Distillation in Radio Signal Classifications for Low-Power IoT Devices
Attention-based Adversarial Robust Distillation in Radio Signal Classifications for Low-Power IoT DevicesIEEE Internet of Things Journal (IEEE IoT J.), 2023
Lu Zhang
S. Lambotharan
G. Zheng
G. Liao
Basil AsSadhan
Fabio Roli
AAML
236
16
0
13 Jun 2025
DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding
DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding
Bin Guo
John H.L. Hansen
307
1
0
11 Jun 2025
Efficient Egocentric Action Recognition with Multimodal Data
Efficient Egocentric Action Recognition with Multimodal Data
Marco Calzavara
Ard Kastrati
Matteo Macchini
Dushan Vasilevski
Roger Wattenhofer
EgoV
384
0
0
02 Jun 2025
TESSER: Transfer-Enhancing Adversarial Attacks from Vision Transformers via Spectral and Semantic Regularization
TESSER: Transfer-Enhancing Adversarial Attacks from Vision Transformers via Spectral and Semantic Regularization
Amira Guesmi
B. Ouni
Muhammad Shafique
AAML
541
1
0
26 May 2025
AnchorFormer: Differentiable Anchor Attention for Efficient Vision Transformer
AnchorFormer: Differentiable Anchor Attention for Efficient Vision TransformerPattern Recognition Letters (Pattern Recogn. Lett.), 2025
Jiquan Shan
Junxiao Wang
Lifeng Zhao
Liang Cai
Hongyuan Zhang
Ioannis Liritzis
ViT
860
8
0
22 May 2025
MSVIT: Improving Spiking Vision Transformer Using Multi-scale Attention Fusion
MSVIT: Improving Spiking Vision Transformer Using Multi-scale Attention FusionInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Wei Hua
Chenlin Zhou
Jibin Wu
Yansong Chua
Yangyang Shu
452
2
0
19 May 2025
CGTrack: Cascade Gating Network with Hierarchical Feature Aggregation for UAV Tracking
CGTrack: Cascade Gating Network with Hierarchical Feature Aggregation for UAV TrackingIEEE International Conference on Robotics and Automation (ICRA), 2025
Weihong Li
Xiaoqiong Liu
Heng Fan
L. Zhang
280
3
0
09 May 2025
Adaptive Data-Resilient Multi-Modal Hierarchical Multi-Label Book Genre Identification
Adaptive Data-Resilient Multi-Modal Hierarchical Multi-Label Book Genre Identification
Utsav Nareti
S. Chattopadhyay
Prolay Mallick
Suraj Kumar
Ayush Vikas Daga
Chandranath Adak
Adarsh Wase
Arjab Roy
400
1
0
05 May 2025
Optimal Hyperspectral Undersampling Strategy for Satellite Imaging
Optimal Hyperspectral Undersampling Strategy for Satellite Imaging
Vita V. Vlasova
Vladimir G. Kuzmin
Maria S. Varetsa
Natalia A. Ibragimova
Oleg Y. Rogov
Elena V. Lyapuntsova
314
0
0
27 Apr 2025
RaPA: Enhancing Transferable Targeted Attacks via Random Parameter Pruning
RaPA: Enhancing Transferable Targeted Attacks via Random Parameter Pruning
Tongrui Su
Qingbin Li
Shengyu Zhu
Wei Chen
Xueqi Cheng
AAMLSILM
477
1
0
24 Apr 2025
ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale Stages
ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale Stages
Zhoujie Qian
ViT
318
1
0
21 Apr 2025
BeetleVerse: A Study on Taxonomic Classification of Ground Beetles
BeetleVerse: A Study on Taxonomic Classification of Ground Beetles
S M Rayeed
Alyson East
Samuel Stevens
Sydne Record
Charles V. Stewart
273
3
0
18 Apr 2025
EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture
EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture
Wenfeng Feng
Guoying Sun
Jianlong Wang
Xin Zhang
Jingjing Zhao
Yueyue Liang
Xiang Chen
Duokui Han
395
2
0
09 Apr 2025
Efficient Token Compression for Vision Transformer with Spatial Information Preserved
Efficient Token Compression for Vision Transformer with Spatial Information Preserved
Junzhu Mao
Yang Shen
Jinyang Guo
Yazhou Yao
Xiansheng Hua
ViT
474
2
0
30 Mar 2025
GmNet: Revisiting Gating Mechanisms From A Frequency View
GmNet: Revisiting Gating Mechanisms From A Frequency View
Yifan Wang
Xu Ma
Yitian Zhang
Zhongruo Wang
Sung-Cheol Kim
Vahid Mirjalili
Vidya Renganathan
Y. Fu
391
0
0
28 Mar 2025
Deepfake Detection via Knowledge Injection
Deepfake Detection via Knowledge Injection
Tonghui Li
Yuanfang Guo
Ziqiang Liu
Heqi Peng
Yunhong Wang
382
2
0
04 Mar 2025
Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual Tracking
Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual TrackingAAAI Conference on Artificial Intelligence (AAAI), 2025
Jiawen Zhu
Huayi Tang
Xin Chen
Xinying Wang
Dong Wang
Huchuan Lu
317
26
0
01 Mar 2025
Escaping The Big Data Paradigm in Self-Supervised Representation Learning
Escaping The Big Data Paradigm in Self-Supervised Representation Learning
Carlos Vélez García
Miguel Cazorla
Jorge Pomares
288
0
0
25 Feb 2025
MaxGlaViT: A novel lightweight vision transformer-based approach for early diagnosis of glaucoma stages from fundus images
MaxGlaViT: A novel lightweight vision transformer-based approach for early diagnosis of glaucoma stages from fundus images
Mustafa Yurdakul
Kubra Uyar
Şakir Tasdemir
334
10
0
24 Feb 2025
Dual-Flow: Transferable Multi-Target, Instance-Agnostic Attacks via In-the-wild Cascading Flow Optimization
Dual-Flow: Transferable Multi-Target, Instance-Agnostic Attacks via In-the-wild Cascading Flow Optimization
Yixiao Chen
Shikun Sun
Jianshu Li
Ruoyu Li
Zhe Li
Junliang Xing
AAML
788
1
0
04 Feb 2025
Parallel Sequence Modeling via Generalized Spatial Propagation Network
Parallel Sequence Modeling via Generalized Spatial Propagation NetworkComputer Vision and Pattern Recognition (CVPR), 2025
Hongjun Wang
Wonmin Byeon
Jiarui Xu
Liang Feng
Ka Chun Cheung
Xiaolong Wang
Kai Han
Jan Kautz
Sifei Liu
877
3
0
21 Jan 2025
RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations
RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations
Mingshu Zhao
Yi Luo
Yong Ouyang
387
0
0
27 Dec 2024
Cascaded Multi-Scale Attention for Enhanced Multi-Scale Feature Extraction and Interaction with Low-Resolution Images
Cascaded Multi-Scale Attention for Enhanced Multi-Scale Feature Extraction and Interaction with Low-Resolution Images
Xiangyong Lu
Masanori Suganuma
Takayuki Okatani
523
2
0
03 Dec 2024
Improving Transferable Targeted Attacks with Feature Tuning Mixup
Improving Transferable Targeted Attacks with Feature Tuning MixupComputer Vision and Pattern Recognition (CVPR), 2024
K. Liang
Xuelong Dai
Yanjie Li
Dong Wang
Bin Xiao
AAML
1.2K
8
0
23 Nov 2024
SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers
SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers
Shravan Venkatraman
Jaskaran Singh Walia
J. Raheja
ViT
572
8
0
14 Nov 2024
1234567
Next
Page 1 of 7