v1v2v3v4v5 (latest)

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

9 March 2018

Jonathan Frankle

Michael Carbin

ArXiv (abs)PDF HTML

Papers citing "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks"

50 / 2,187 papers shown

Unveiling Linguistic Regions in Large Language Models

Zhihao Zhang

Jun Zhao

Tao Gui

Xuanjing Huang

306

22 Feb 2024

NeuroFlux: Memory-Efficient CNN Training Using Adaptive Local Learning

Dhananjay Saikumar

Blesson Varghese

234

21 Feb 2024

In value-based deep reinforcement learning, a pruned network is a good network

480

19 Feb 2024

Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models

Didi Zhu

222

19 Feb 2024

Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark

...

413

100

18 Feb 2024

Training Bayesian Neural Networks with Sparse Subspace Variational Inference

203

16 Feb 2024

Transformers Can Achieve Length Generalization But Not Robustly

286

14 Feb 2024

Graph Inference Acceleration by Learning MLPs on Graphs without Supervision

177

14 Feb 2024

Towards Meta-Pruning via Optimal Transport

393

12 Feb 2024

Continual Learning on Graphs: A Survey

Zonggui Tian

Duanhao Zhang

Hong-Ning Dai

328

09 Feb 2024

How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers

335

09 Feb 2024

The SkipSponge Attack: Sponge Weight Poisoning of Deep Neural Networks

336

09 Feb 2024

Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes

Graham Neubig

283

08 Feb 2024

Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

Kaixuan Huang

Mengdi Wang

312

174

07 Feb 2024

Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers

314

07 Feb 2024

Enhance DNN Adversarial Robustness and Efficiency via Injecting Noise to Non-Essential Neurons

Zhenyu Liu

Garrett Gagnon

Swagath Venkataramani

Liu Liu

AAML

242

06 Feb 2024

Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search MethodsInternational Conference on Learning Representations (ICLR), 2024

717

06 Feb 2024

Single-GPU GNN Systems: Traps and Pitfalls

236

05 Feb 2024

Less is KEN: a Universal and Simple Non-Parametric Pruning Algorithm for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Michele Mastromattei

Fabio Massimo Zanzotto

VLM

236

05 Feb 2024

Discovering interpretable models of scientific image data with deep learning

Christopher J. Soelistyo

Alan R. Lowe

195

05 Feb 2024

Dynamic Sparse Learning: A Novel Paradigm for Efficient Recommendation

143

05 Feb 2024

KS-Lottery: Finding Certified Lottery Tickets for Multilingual Language Models

Lei Li

285

05 Feb 2024

EXGC: Bridging Efficiency and Explainability in Graph Condensation

285

05 Feb 2024

On the Role of Initialization on the Implicit Bias in Deep Linear Networks

Oria Gruber

H. Avron

AI4CE

158

04 Feb 2024

Defining Neural Network Architecture through Polytope Structures of Dataset

Sangmin Lee

Abbas Mammadov

Jong Chul Ye

403

04 Feb 2024

Optimal Parameter and Neuron Pruning for Out-of-Distribution Detection

243

04 Feb 2024

From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers

412

02 Feb 2024

Ultrafast jet classification on FPGAs for the HL-LHC

Patrick Odagiu

Zhiqiang Que

Javier Mauricio Duarte

...

224

02 Feb 2024

TEDDY: Trimming Edges with Degree-based Discrimination strategY

Hyunjin Seo

Jihun Yun

Eunho Yang

259

02 Feb 2024

Faster and Lighter LLMs: A Survey on Current Challenges and Way Forward

298

02 Feb 2024

No Free Prune: Information-Theoretic Barriers to Pruning at Initialization

Tanishq Kumar

Kevin Luo

Mark Sellke

261

02 Feb 2024

A practical existence theorem for reduced order models based on convolutional autoencoders

N. R. Franco

Simone Brugiapaglia

AI4CE

311

01 Feb 2024

EPSD: Early Pruning with Self-Distillation for Efficient Model Compression

237

31 Jan 2024

X-PEFT: eXtremely Parameter-Efficient Fine-Tuning for Extreme Multi-Profile Scenarios

Namju Kwak

Taesup Kim

MoE

109

29 Jan 2024

A Comprehensive Survey of Compression Algorithms for Language Models

329

27 Jan 2024

NACHOS: Neural Architecture Search for Hardware Constrained Early Exit Neural NetworksIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024

357

24 Jan 2024

Dynamic Layer Tying for Parameter-Efficient TransformersInternational Conference on Learning Representations (ICLR), 2024

Tamir David Hay

Lior Wolf

171

23 Jan 2024

APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and InferenceInternational Conference on Machine Learning (ICML), 2024

Bowen Zhao

Hannaneh Hajishirzi

Qingqing Cao

356

22 Jan 2024

Rethinking Centered Kernel Alignment in Knowledge DistillationInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

472

22 Jan 2024

PRILoRA: Pruned and Rank-Increasing Low-Rank AdaptationFindings (Findings), 2024

Nadav Benedek

Lior Wolf

163

20 Jan 2024

Manipulating Sparse Double Descent

Ya Shi Zhang

169

19 Jan 2024

Enhancing Scalability in Recommender Systems through Lottery Ticket Hypothesis and Knowledge Distillation-based Neural Network Pruning

118

19 Jan 2024

Model Compression Techniques in Biometrics Applications: A Survey

Naser Damer

294

18 Jan 2024

FedLoGe: Joint Local and Generic Federated Learning under Long-tailed DataInternational Conference on Learning Representations (ICLR), 2024

292

17 Jan 2024

Stochastic Subnetwork Annealing: A Regularization Technique for Fine Tuning Pruned Subnetworks

Tim Whitaker

Darrell Whitley

295

16 Jan 2024

Convolutional Neural Network Compression via Dynamic Parameter Rank PruningIEEE Access (IEEE Access), 2024

240

15 Jan 2024

Harnessing the Power of Beta Scoring in Deep Active Learning for Multi-Label Text ClassificationAAAI Conference on Artificial Intelligence (AAAI), 2024

189

15 Jan 2024

Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

198

13 Jan 2024

Always-Sparse Training by Growing Connections with Guided Stochastic Exploration

524

12 Jan 2024

A Survey on Efficient Federated Learning Methods for Foundation Model TrainingInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

Herbert Woisetschläger

293

09 Jan 2024