Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2201.00814
Cited By
v1
v2 (latest)
Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space
Computer Vision and Pattern Recognition (CVPR), 2022
3 January 2022
Arnav Chavan
Zhiqiang Shen
Zhuang Liu
Zechun Liu
Kwang-Ting Cheng
Eric P. Xing
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (249★)
Papers citing
"Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space"
50 / 53 papers shown
One-Shot Knowledge Transfer for Scalable Person Re-Identification
Longhua Li
Lei Qi
Xin Geng
195
3
0
08 Nov 2025
General Compression Framework for Efficient Transformer Object Tracking
Lingyi Hong
Jinglun Li
Xinyu Zhou
Shilin Yan
Pinxue Guo
...
Runze Li
Xingdong Sheng
Wei Zhang
Hong Lu
Wenqiang Zhang
ViT
371
4
0
01 Jul 2025
How to Train Your Metamorphic Deep Neural Network
Thomas Sommariva
Simone Calderara
Angelo Porrello
275
0
0
07 May 2025
AdaVid: Adaptive Video-Language Pretraining
Chaitanya Patel
Juan Carlos Niebles
Ehsan Adeli
VLM
251
0
0
16 Apr 2025
Discovering Influential Neuron Path in Vision Transformers
International Conference on Learning Representations (ICLR), 2025
Yifan Wang
Yifei Liu
Yingdong Shi
Chong Li
Anqi Pang
Sibei Yang
Jingyi Yu
Kan Ren
ViT
663
6
0
12 Mar 2025
Neural Metamorphosis
European Conference on Computer Vision (ECCV), 2024
Xingyi Yang
Xinchao Wang
389
5
0
10 Oct 2024
HydraViT: Stacking Heads for a Scalable ViT
Neural Information Processing Systems (NeurIPS), 2024
Janek Haberer
A. Hojjat
Olaf Landsiedel
260
7
0
26 Sep 2024
OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition
International Conference on Learning Representations (ICLR), 2024
Stephen Zhang
Vardan Papyan
VLM
719
18
0
20 Sep 2024
Agglomerative Token Clustering
European Conference on Computer Vision (ECCV), 2024
Joakim Bruslund Haurum
Sergio Escalera
Graham W. Taylor
T. Moeslund
352
11
0
18 Sep 2024
Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Shuai Peng
Di Fu
Baole Wei
Yong Cao
Liangcai Gao
Zhi Tang
ViT
245
4
0
30 Aug 2024
Token Compensator: Altering Inference Cost of Vision Transformer without Re-Tuning
European Conference on Computer Vision (ECCV), 2024
Shibo Jie
Yehui Tang
Jianyuan Guo
Zhi-Hong Deng
Kai Han
Yunhe Wang
VLM
254
7
0
13 Aug 2024
Efficient Visual Transformer by Learnable Token Merging
Yancheng Wang
Yingzhen Yang
ViT
385
13
0
21 Jul 2024
Straightforward Layer-wise Pruning for More Efficient Visual Adaptation
Ruizi Han
Jinglei Tang
361
3
0
19 Jul 2024
PRANCE: Joint Token-Optimization and Structural Channel-Pruning for Adaptive ViT Inference
Ye Li
Chen Tang
Yuan Meng
Jiajun Fan
Runnan Li
Cheng Wang
Zhi Wang
Wenwu Zhu
290
7
0
06 Jul 2024
Isomorphic Pruning for Vision Models
Gongfan Fang
Xinyin Ma
Michael Bi Mi
Xinchao Wang
VLM
ViT
362
33
0
05 Jul 2024
Surgical Feature-Space Decomposition of LLMs: Why, When and How?
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Arnav Chavan
Nahush Lele
Deepak Gupta
317
6
0
17 May 2024
Efficient Multimodal Large Language Models: A Survey
Yizhang Jin
Jian Li
Yexin Liu
Tianjun Gu
Kai Wu
...
Xin Tan
Zhenye Gan
Yabiao Wang
Chengjie Wang
Lizhuang Ma
LRM
366
104
0
17 May 2024
Data-independent Module-aware Pruning for Hierarchical Vision Transformers
Yang He
Qiufeng Wang
ViT
319
11
0
21 Apr 2024
MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning
Matteo Farina
Goran Frehse
Elia Cunegatti
Gaowen Liu
Giovanni Iacca
Elisa Ricci
VLM
324
9
0
08 Apr 2024
Dense Vision Transformer Compression with Few Samples
Hanxiao Zhang
Yifan Zhou
Guo-Hua Wang
Jianxin Wu
ViT
VLM
344
11
0
27 Mar 2024
Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression
Computer Vision and Pattern Recognition (CVPR), 2024
Hancheng Ye
Chong Yu
Peng Ye
Renqiu Xia
Yansong Tang
Jiwen Lu
Tao Chen
Bo Zhang
260
11
0
23 Mar 2024
PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation
European Conference on Computer Vision (ECCV), 2024
Yizhe Xiong
Hui Chen
Tianxiang Hao
Zijia Lin
Jungong Han
Yuesong Zhang
Guoxin Wang
Yongjun Bao
Guiguang Ding
451
26
0
14 Mar 2024
MoPE-CLIP: Structured Pruning for Efficient Vision-Language Models with Module-wise Pruning Error Metric
Computer Vision and Pattern Recognition (CVPR), 2024
Haokun Lin
Haoli Bai
Zhili Liu
Lu Hou
Muyi Sun
Linqi Song
Ying Wei
Zhenan Sun
CLIP
VLM
198
40
0
12 Mar 2024
MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer
Jianjian Cao
Peng Ye
Shengze Li
Chong Yu
Yansong Tang
Jiwen Lu
Tao Chen
255
57
0
05 Mar 2024
A Survey on Transformer Compression
Yehui Tang
Yunhe Wang
Jianyuan Guo
Zhijun Tu
Kai Han
Hailin Hu
Dacheng Tao
584
73
0
05 Feb 2024
Faster and Lighter LLMs: A Survey on Current Challenges and Way Forward
Arnav Chavan
Raghav Magazine
Shubham Kushwaha
M. Debbah
Deepak Gupta
384
43
0
02 Feb 2024
Bridging The Gaps Between Token Pruning and Full Pre-training via Masked Fine-tuning
Fengyuan Shi
Limin Wang
ViT
251
0
0
26 Oct 2023
MatFormer: Nested Transformer for Elastic Inference
Neural Information Processing Systems (NeurIPS), 2023
Devvrit
Sneha Kudugunta
Aditya Kusupati
Tim Dettmers
Kaifeng Chen
...
Yulia Tsvetkov
Hannaneh Hajishirzi
Sham Kakade
Ali Farhadi
Prateek Jain
303
73
0
11 Oct 2023
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance
IEEE International Conference on Computer Vision (ICCV), 2023
Kan Wu
Houwen Peng
Zhenghong Zhou
Bin Xiao
Xiyang Dai
...
Xi
Xi Chen
Xinggang Wang
Hongyang Chao
Han Hu
VLM
OODD
300
117
0
21 Sep 2023
Which Tokens to Use? Investigating Token Reduction in Vision Transformers
Joakim Bruslund Haurum
Sergio Escalera
Graham W. Taylor
T. Moeslund
ViT
313
73
0
09 Aug 2023
A Survey of Techniques for Optimizing Transformer Inference
Journal of systems architecture (JSA), 2023
Krishna Teja Chitty-Venkata
Sparsh Mittal
M. Emani
V. Vishwanath
Arun Somani
402
141
0
16 Jul 2023
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers
IEEE International Conference on Computer Vision (ICCV), 2023
Mengzhao Chen
Wenqi Shao
Peng Xu
Mingbao Lin
Kaipeng Zhang
Jiayi Ji
Rongrong Ji
Yu Qiao
Ping Luo
ViT
252
85
0
29 May 2023
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
International Conference on Machine Learning (ICML), 2023
Dachuan Shi
Chaofan Tao
Anyi Rao
Zhendong Yang
Chun Yuan
Yuan Liu
VLM
546
46
0
27 May 2023
MixFormerV2: Efficient Fully Transformer Tracking
Neural Information Processing Systems (NeurIPS), 2023
Yutao Cui
Tian-Shu Song
Gangshan Wu
Liming Wang
291
150
0
25 May 2023
Spatiotemporal Attention-based Semantic Compression for Real-time Video Recognition
Nana Li
M. Bennis
Alexandros Iosifidis
Tao Gui
176
7
0
22 May 2023
Boost Vision Transformer with GPU-Friendly Sparsity and Quantization
Computer Vision and Pattern Recognition (CVPR), 2023
Chong Yu
Tao Chen
Zhongxue Gan
Jiayuan Fan
MQ
ViT
262
44
0
18 May 2023
Three Guidelines You Should Know for Universally Slimmable Self-Supervised Learning
Computer Vision and Pattern Recognition (CVPR), 2023
Yunhao Cao
Peiqin Sun
Shuchang Zhou
178
6
0
13 Mar 2023
Efficient Transformer-based 3D Object Detection with Dynamic Token Halting
IEEE International Conference on Computer Vision (ICCV), 2023
Mao Ye
Gregory P. Meyer
Yuning Chai
Qiang Liu
304
10
0
09 Mar 2023
X-Pruner: eXplainable Pruning for Vision Transformers
Computer Vision and Pattern Recognition (CVPR), 2023
Lu Yu
Wei Xiang
ViT
344
89
0
08 Mar 2023
Structured Pruning for Deep Convolutional Neural Networks: A survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yang He
Lingao Xiao
3DPC
475
325
0
01 Mar 2023
Progressive Ensemble Distillation: Building Ensembles for Efficient Inference
Neural Information Processing Systems (NeurIPS), 2023
D. Dennis
Abhishek Shetty
A. Sevekari
K. Koishida
Virginia Smith
FedML
315
0
0
20 Feb 2023
UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers
International Conference on Machine Learning (ICML), 2023
Dachuan Shi
Chaofan Tao
Ying Jin
Zhendong Yang
Chun Yuan
Yuan Liu
VLM
ViT
463
64
0
31 Jan 2023
Rethinking Vision Transformers for MobileNet Size and Speed
IEEE International Conference on Computer Vision (ICCV), 2022
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
460
291
0
15 Dec 2022
On Designing Light-Weight Object Trackers through Network Pruning: Use CNNs or Transformers?
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Saksham Aggarwal
Taneesh Gupta
Pawan Kumar Sahu
Arnav Chavan
Rishabh Tiwari
Dilip K. Prasad
D. K. Gupta
ViT
219
1
0
24 Nov 2022
Data Level Lottery Ticket Hypothesis for Vision Transformers
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Xuan Shen
Zhenglun Kong
Minghai Qin
Zhaoyang Han
Geng Yuan
Xin Meng
Hao Tang
Xiaolong Ma
Yanzhi Wang
310
8
0
02 Nov 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng Zhang
Chao Zhang
Hanhua Hu
350
44
0
03 Oct 2022
Slimmable Networks for Contrastive Self-supervised Learning
International Journal of Computer Vision (IJCV), 2022
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yi Yang
264
7
0
30 Sep 2022
Greybox XAI: a Neural-Symbolic learning framework to produce interpretable predictions for image classification
Knowledge-Based Systems (KBS), 2022
Adrien Bennetot
Gianni Franchi
Javier Del Ser
Raja Chatila
Natalia Díaz Rodríguez
AAML
258
32
0
26 Sep 2022
EfficientFormer: Vision Transformers at MobileNet Speed
Neural Information Processing Systems (NeurIPS), 2022
Yanyu Li
Geng Yuan
Yang Wen
Eric Hu
Georgios Evangelidis
Sergey Tulyakov
Yanzhi Wang
Jian Ren
ViT
868
576
0
02 Jun 2022
Spartan: Differentiable Sparsity via Regularized Transportation
Neural Information Processing Systems (NeurIPS), 2022
Kai Sheng Tai
Taipeng Tian
Ser-Nam Lim
328
13
0
27 May 2022
1
2
Next
Page 1 of 2