Pruning Redundant Mappings in Transformer Models via Spectral-Normalized Identity Prior

Findings (Findings), 2020

5 October 2020

Papers citing "Pruning Redundant Mappings in Transformer Models via Spectral-Normalized Identity Prior"

22 / 22 papers shown

SparsyFed: Sparse Adaptive Federated Training

403

07 Apr 2025

STAT: Shrinking Transformers After Training

296

29 May 2024

The Need for Speed: Pruning Transformers with One Recipe

Samir Khaki

Konstantinos N. Plataniotis

351

26 Mar 2024

Model Compression and Efficient Inference for Large Language Models: A Survey

284

15 Feb 2024

A Comprehensive Survey of Compression Algorithms for Language Models

329

27 Jan 2024

Breaking through Deterministic Barriers: Randomized Pruning Mask Generation and Selection

Jianwei Li

Weizhi Gao

Qi Lei

Dongkuan Xu

344

19 Oct 2023

Sub-network Discovery and Soft-masking for Continual Learning of Mixed Tasks

240

13 Oct 2023

Pit One Against Many: Leveraging Attention-head Embeddings for Parameter-efficient Multi-head AttentionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Huiyin Xue

Nikolaos Aletras

317

11 Oct 2023

$$\rm SP^3$: Enhancing Structured Pruning via PCA Projection$

\rm SP^3

: Enhancing Structured Pruning via PCA ProjectionAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

245

31 Aug 2023

Accurate Retraining-free Pruning for Pretrained Encoder-based Language ModelsInternational Conference on Learning Representations (ICLR), 2023

230

07 Aug 2023

Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity RecognitionItalian National Conference on Sensors (INS), 2023

H. Haresamudram

Irfan Essa

Thomas Ploetz

233

01 Jun 2023

HighLight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity

185

22 May 2023

Gradient-Free Structured Pruning with Unlabeled DataInternational Conference on Machine Learning (ICML), 2023

262

07 Mar 2023

Learning a Consensus Sub-Network with Polarization Regularization and One Pass TrainingData mining and knowledge discovery (DMKD), 2023

Sean Moran

443

17 Feb 2023

An Empirical Study on the Transferability of Transformer Modules in Parameter-Efficient Fine-TuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Mohammad AkbarTajari

S. Rajaee

Mohammad Taher Pilehvar

228

01 Feb 2023

Adapting a Language Model While Preserving its General KnowledgeConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

177

21 Jan 2023

A Fast Post-Training Pruning Framework for TransformersNeural Information Processing Systems (NeurIPS), 2022

Sehoon Kim

214

201

29 Mar 2022

A Survey on Model Compression and Acceleration for Pretrained Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2022

Canwen Xu

Julian McAuley

354

15 Feb 2022

Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization SpaceComputer Vision and Pattern Recognition (CVPR), 2022

269

03 Jan 2022

From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression

Runxin Xu

Chengyu Wang

Fei Huang

14 Dec 2021

Learned Token Pruning for Transformers

Sehoon Kim

339

192

02 Jul 2021

Compressing Large-Scale Transformer-Based Models: A Case Study on BERTTransactions of the Association for Computational Linguistics (TACL), 2020

425

213

27 Feb 2020