Layer-wise Model Pruning based on Mutual Information

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021

28 August 2021

Jiwei Li

Papers citing "Layer-wise Model Pruning based on Mutual Information"

14 / 14 papers shown

MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling

145

28 Oct 2025

Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model

Navin Ranjan

Andreas E. Savakis

MQ VLM

357

08 May 2025

Attention Pruning: Automated Fairness Repair of Language Models via Surrogate Simulated Annealing

433

20 Mar 2025

Dynamic Low-Rank Sparse Adaptation for Large Language ModelsInternational Conference on Learning Representations (ICLR), 2025

451

21 Feb 2025

How Redundant Is the Transformer Stack in Speech Representation Models?IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Teresa Dorszewski

Albert Kjøller Jacobsen

Lenka Tětková

Lars Kai Hansen

347

20 Jan 2025

Layer-wise Importance Matters: Less Memory for Better Performance in Parameter-efficient Fine-tuning of Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Kai Yao

149

15 Oct 2024

Persistent Topological Features in Large Language Models

547

14 Oct 2024

MPruner: Optimizing Neural Network Size with CKA-Based Mutual Information Pruning

346

24 Aug 2024

The Remarkable Robustness of LLMs: Stages of Inference?

Vedang Lad

Wes Gurnee

Max Tegmark

519

27 Jun 2024

Large Language Model Pruning

Hanjuan Huang

Hao-Jia Song

H. Pao

415

24 May 2024

The Unreasonable Ineffectiveness of the Deeper Layers

427

157

26 Mar 2024

Fairness-Aware Structured Pruning in Transformers

235

24 Dec 2023

f-Divergence Minimization for Sequence-Level Knowledge DistillationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

273

27 Jul 2023

Pruning Pretrained Encoders with a Multitask Objective

Patrick Xia

Richard Shin

129

10 Dec 2021