Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.17921
Cited By
The Need for Speed: Pruning Transformers with One Recipe
26 March 2024
Samir Khaki
Konstantinos N. Plataniotis
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Need for Speed: Pruning Transformers with One Recipe"
8 / 8 papers shown
Title
Learning to Inference Adaptively for Multimodal Large Language Models
Zhuoyan Xu
Khoi Duc Nguyen
Preeti Mukherjee
Saurabh Bagchi
Somali Chaterji
Yingyu Liang
Yin Li
LRM
42
1
0
13 Mar 2025
Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning
Lizhen Xu
Xiuxiu Bai
Xiaojun Jia
Jianwu Fang
Shanmin Pang
61
0
0
13 Mar 2025
ATOM: Attention Mixer for Efficient Dataset Distillation
Samir Khaki
A. Sajedi
Kai Wang
Lucy Z. Liu
Y. Lawryshyn
Konstantinos N. Plataniotis
38
3
0
02 May 2024
DepGraph: Towards Any Structural Pruning
Gongfan Fang
Xinyin Ma
Mingli Song
Michael Bi Mi
Xinchao Wang
GNN
79
256
0
30 Jan 2023
CONetV2: Efficient Auto-Channel Size Optimization for CNNs
Yi Ru Wang
Samir Khaki
Weihang Zheng
Mahdi S. Hosseini
Konstantinos N. Plataniotis
26
6
0
13 Oct 2021
I-BERT: Integer-only BERT Quantization
Sehoon Kim
A. Gholami
Z. Yao
Michael W. Mahoney
Kurt Keutzer
MQ
86
336
0
05 Jan 2021
SCOP: Scientific Control for Reliable Neural Network Pruning
Yehui Tang
Yunhe Wang
Yixing Xu
Dacheng Tao
Chunjing Xu
Chao Xu
Chang Xu
AAML
37
165
0
21 Oct 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,943
0
20 Apr 2018
1