v1v2v3v4 (latest)

Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation

3 July 2021

ArXiv (abs)PDF HTML Github

Papers citing "Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation"

50 / 50 papers shown

Rethinking Vision Transformer Depth via Structural Reparameterization

162

24 Nov 2025

From Low-Rank Features to Encoding Mismatch: Rethinking Feature Distillation in Vision Transformers

165

19 Nov 2025

Distillation Dynamics: Towards Understanding Feature-Based Distillation in Vision Transformers

Huiyuan Tian

Bonan Xu Shijian Li

Shijian Li

274

10 Nov 2025

No Alignment Needed for Generation: Learning Linearly Separable Representations in Diffusion Models

Junno Yun

Yasar Utku Alçalar

Mehmet Akçakaya

163

25 Sep 2025

ResidualViT for Efficient Temporally Dense Video Encoding

225

16 Sep 2025

UNIFORM: Unifying Knowledge from Large-scale and Diverse Pre-trained Models

242

27 Aug 2025

Cross-Architecture Distillation Made Simple with Redundancy Suppression

239

29 Jul 2025

A Layered Self-Supervised Knowledge Distillation Framework for Efficient Multimodal Learning on the Edge

Tarique Dahri

Zulfiqar Ali Memon

Zhenyu Yu

Mohd Yamani Idna Idris

261

08 Jun 2025

MoKD: Multi-Task Optimization for Knowledge Distillation

459

13 May 2025

Position: Beyond Euclidean -- Foundation Models Should Embrace Non-Euclidean Geometries

375

11 Apr 2025

Distilling Knowledge from Heterogeneous Architectures for Semantic SegmentationAAAI Conference on Artificial Intelligence (AAAI), 2025

349

10 Apr 2025

Random Conditioning with Distillation for Data-Efficient Diffusion Model CompressionComputer Vision and Pattern Recognition (CVPR), 2025

394

02 Apr 2025

U-REPA: Aligning Diffusion U-Nets to ViTs

495

24 Mar 2025

Jointly Understand Your Command and Intention:Reciprocal Co-Evolution between Scene-Aware 3D Human Motion Synthesis and Analysis

566

01 Mar 2025

Janus: Collaborative Vision Transformer Under Dynamic Network EnvironmentIEEE Conference on Computer Communications (IEEE INFOCOM), 2025

945

14 Feb 2025

iFormer: Integrating ConvNet and Transformer for Mobile ApplicationInternational Conference on Learning Representations (ICLR), 2025

Chuanyang Zheng

ViT

461

26 Jan 2025

Cognitive Edge Computing: A Comprehensive Survey on Optimizing Large Models and AI Agents for Pervasive DeploymentInternational Conference on Artificial Neural Networks (ICANN), 2025

Xubin Wang

Weijia Jia

620

04 Jan 2025

Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers

947

21 Dec 2024

On the Surprising Effectiveness of Attention Transfer for Vision TransformersNeural Information Processing Systems (NeurIPS), 2024

253

14 Nov 2024

DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any ArchitectureComputer Vision and Pattern Recognition (CVPR), 2024

Qianlong Xiang

Miao Zhang

Yuzhang Shang

Yue Yu

Yan Yan

Liqiang Nie

DiffM

425

05 Sep 2024

UNIC: Universal Classification Models via Multi-teacher DistillationEuropean Conference on Computer Vision (ECCV), 2024

Mert Bulent Sariyildiz

451

09 Aug 2024

Neural-based Video Compression on Solar Dynamics Observatory Images

Atefeh Khoshkhahtinat

Barbara J. Thompson

374

12 Jul 2024

ViT-1.58b: Mobile Vision Transformers in the 1-bit Era

Hongyi Wang

Lichao Sun

235

26 Jun 2024

Efficient Multimodal Large Language Models: A Survey

Yizhang Jin

Jian Li

Yexin Liu

Tianjun Gu

Kai Wu

...

Xin Tan

Zhenye Gan

Yabiao Wang

Chengjie Wang

Lizhuang Ma

LRM

371

107

17 May 2024

Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers

Diana-Nicoleta Grigore

Mariana-Iuliana Georgescu

J. A. Justo

T. Johansen

Andreea-Iuliana Ionescu

Radu Tudor Ionescu

401

14 Apr 2024

The Need for Speed: Pruning Transformers with One Recipe

Samir Khaki

Konstantinos N. Plataniotis

405

26 Mar 2024

$V_kD:$ Improving Knowledge Distillation using Orthogonal Projections

V_kD:

Improving Knowledge Distillation using Orthogonal ProjectionsComputer Vision and Pattern Recognition (CVPR), 2024

Roy Miles

Ismail Elezi

Jiankang Deng

390

10 Mar 2024

Tiny Reinforcement Learning for Quadruped Locomotion using Decision Transformers

273

20 Feb 2024

A Survey on Transformer Compression

584

05 Feb 2024

A Manifold Representation of the Key in Vision Transformers

508

01 Feb 2024

One-for-All: Bridge the Gap Between Heterogeneous Architectures in Knowledge DistillationNeural Information Processing Systems (NeurIPS), 2023

362

152

30 Oct 2023

Understanding the Effects of Projectors in Knowledge Distillation

397

26 Oct 2023

Talking Models: Distill Pre-trained Knowledge to Downstream Models via Interactive Communication

207

04 Oct 2023

Gold-YOLO: Efficient Object Detector via Gather-and-Distribute MechanismNeural Information Processing Systems (NeurIPS), 2023

463

526

20 Sep 2023

DeViT: Decomposing Vision Transformers for Collaborative Inference in Edge DevicesIEEE Transactions on Mobile Computing (IEEE TMC), 2023

296

10 Sep 2023

A survey on efficient vision transformers: algorithms, techniques, and performance benchmarkingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

410

119

05 Sep 2023

LGViT: Dynamic Early Exiting for Accelerating Vision TransformerACM Multimedia (ACM MM), 2023

349

01 Aug 2023

VanillaKD: Revisit the Power of Vanilla Knowledge Distillation from Small Scale to Large Scale

252

25 May 2023

Bi-ViT: Pushing the Limit of Vision Transformer QuantizationAAAI Conference on Artificial Intelligence (AAAI), 2023

Sheng Xu

Xiao Sun

240

21 May 2023

Visual TuningACM Computing Surveys (ACM Comput. Surv.), 2023

...

537

10 May 2023

RIFormer: Keep Your Vision Backbone Effective While Removing Token MixerComputer Vision and Pattern Recognition (CVPR), 2023

Yong Liu

Yujiu Yang

Ping Luo

274

12 Apr 2023

Knowledge Distillation in Vision Transformers: A Critical Review

Gousia Habib

Tausifa Jan Saleem

Brejesh Lall

401

04 Feb 2023

PSAQ-ViT V2: Towards Accurate and General Data-Free Quantization for Vision TransformersIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022

Qingyi Gu

318

13 Sep 2022

I-ViT: Integer-only Quantization for Efficient Vision Transformer InferenceIEEE International Conference on Computer Vision (ICCV), 2022

Zhikai Li

Qingyi Gu

516

171

04 Jul 2022

Chemical transformer compression for accelerating both training and inference of molecular modeling

Yi Yu

K. Börjesson

148

16 May 2022

Depth Estimation with Simplified Transformer

244

28 Apr 2022

Patch Similarity Aware Data-Free Quantization for Vision TransformersEuropean Conference on Computer Vision (ECCV), 2022

Liping Ma

Qingyi Gu

465

04 Mar 2022

Meta Knowledge Distillation

360

16 Feb 2022

Multi-Dimensional Model Compression of Vision TransformerIEEE International Conference on Multimedia and Expo (ICME), 2021

Zejiang Hou

S. Kung

ViT

232

31 Dec 2021

A Survey on Visual TransformerIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020

...

1.3K

3,405

23 Dec 2020