v1v2v3v4v5 (latest)

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

9 March 2018

Jonathan Frankle

Michael Carbin

ArXiv (abs)PDF HTML

Papers citing "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks"

50 / 2,186 papers shown

Finding Stable Subnetworks at Initialization with Dataset Distillation

Luke McDermott

Rahul Parhi

342

23 Mar 2025

Staying Alive: Online Neural Network Maintenance and Systemic Drift

Joshua Edward Hammond

Tyler Soderstrom

Brian A. Korgel

Michael Baldea

203

22 Mar 2025

Temporal Action Detection Model Compression by Progressive Block DropComputer Vision and Pattern Recognition (CVPR), 2025

302

21 Mar 2025

Structure Is Not Enough: Leveraging Behavior for Neural Network Weight Reconstruction

369

21 Mar 2025

LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language ModelsComputer Vision and Pattern Recognition (CVPR), 2025

320

21 Mar 2025

FeNeC: Enhancing Continual Learning via Feature Clustering with Neighbor- or Logit-Based Classification

582

18 Mar 2025

Trading-off Accuracy and Communication Cost in Federated LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2025

Mattia Jacopo Villani

Emanuele Natale

Frederik Mallmann-Trenn

FedML

296

18 Mar 2025

Enhanced Soups for Graph Neural NetworksIEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPS), 2025

281

14 Mar 2025

Are formal and functional linguistic mechanisms dissociated in language models?

Michael Hanna

Sandro Pezzelle

Yonatan Belinkov

523

14 Mar 2025

Explainable Bayesian deep learning through input-skip Latent Binary Bayesian Neural Networks

234

13 Mar 2025

PRISM: Privacy-Preserving Improved Stochastic Masking for Federated Generative ModelsInternational Conference on Learning Representations (ICLR), 2025

Kyeongkook Seo

Dong-Jun Han

Jaejun Yoo

496

11 Mar 2025

ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual RestorationKnowledge Discovery and Data Mining (KDD), 2025

271

10 Mar 2025

PRO-VPT: Distribution-Adaptive Visual Prompt Tuning via Prompt Relocation

351

10 Mar 2025

How can representation dimension dominate structurally pruned LLMs?

Mingxue Xu

Lisa Alazraki

Danilo Mandic

268

06 Mar 2025

Wanda++: Pruning Large Language Models via Regional GradientsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

...

569

06 Mar 2025

GaussianVideo: Efficient Video Representation and Compression by Gaussian Splatting

225

06 Mar 2025

Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model

...

OffRL LRM MLLM KELM VLM

350

06 Mar 2025

A Theory of Initialisation's Impact on SpecialisationInternational Conference on Learning Representations (ICLR), 2025

Stefano Sarao Mannelli

CLL

296

04 Mar 2025

Eau De

Q

-Network: Adaptive Distillation of Neural Networks in Deep Reinforcement Learning

264

03 Mar 2025

Everything, Everywhere, All at Once: Is Mechanistic Interpretability Identifiable?International Conference on Learning Representations (ICLR), 2025

284

28 Feb 2025

Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking)

542

28 Feb 2025

Sparse Brains are Also Adaptive Brains: Cognitive-Load-Aware Dynamic Activation for LLMs

310

26 Feb 2025

On Pruning State-Space LLMs

Tamer Ghattas

Michael Hassid

Roy Schwartz

250

26 Feb 2025

CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging

277

26 Feb 2025

Constraining Sequential Model Editing with Editing Anchor CompressionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

264

25 Feb 2025

Personalized Federated Learning for Egocentric Video Gaze Estimation with Comprehensive Parameter Frezzing

249

25 Feb 2025

Geometric Properties and Graph-Based Optimization of Neural Networks: Addressing Non-Linearity, Dimensionality, and Scalability

145

24 Feb 2025

Systematic Weight Evaluation for Pruning Large Language Models: Enhancing Performance and Sustainability

Ashhadul Islam

S. Belhaouari

Amine Bermak

232

24 Feb 2025

Spectral Theory for Edge Pruning in Asynchronous Recurrent Graph Neural Networks

Nicolas Bessone

133

23 Feb 2025

Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and CompressionComputer Vision and Pattern Recognition (CVPR), 2025

319

23 Feb 2025

Keep what you need : extracting efficient subnetworks from large audio representation modelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

David Genova

P. Esling

Tom Hurlin

208

18 Feb 2025

Signal Collapse in One-Shot Pruning: When Sparse Models Fail to Distinguish Neural Representations

Dhananjay Saikumar

Blesson Varghese

209

18 Feb 2025

FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control

182

17 Feb 2025

Fishing For Cheap And Efficient Pruners At Initialization

Ivo Gollini Navarrete

Nicolas Mauricio Cuadrado

Jose Renato Restom

Martin Takáč

Samuel Horvath

224

17 Feb 2025

Forget the Data and Fine-Tuning! Just Fold the Network to CompressInternational Conference on Learning Representations (ICLR), 2025

334

14 Feb 2025

Graph Neural Networks at a FractionPacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2025

Rucha Bhalchandra Joshi

362

10 Feb 2025

The impact of allocation strategies in subset learning on the expressive power of neural networksInternational Conference on Learning Representations (ICLR), 2025

Ofir Schlisselberg

Ran Darshan

308

10 Feb 2025

Contrastive Representation Distillation via Multi-Scale Feature Decoupling

Cuipeng Wang

Tieyuan Chen

372

09 Feb 2025

Training-Free Restoration of Pruned Neural Networks

Keonho Lee

Minsoo Kim

Dong-Wan Choi

319

06 Feb 2025

Studying Cross-cluster Modularity in Neural Networks

347

04 Feb 2025

Deep Weight Factorization: Sparse Learning Through the Lens of Artificial SymmetriesInternational Conference on Learning Representations (ICLR), 2025

830

04 Feb 2025

Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity

335

03 Feb 2025

Language Bias in Self-Supervised Learning For Automatic Speech Recognition

Edward Storey

Naomi Harte

Peter Bell

156

31 Jan 2025

Symmetric Pruning of Large Language Models

Kai Yi

Peter Richtárik

AAML VLM

320

31 Jan 2025

Algebra Unveils Deep Learning -- An Invitation to Neuroalgebraic Geometry

Giovanni Luca Marchetti

344

31 Jan 2025

Information Consistent Pruning: How to Efficiently Search for Sparse Networks?

Soheil Gharatappeh

Salimeh Yasaei Sekeh

211

28 Jan 2025

Meta-Sparsity: Learning Optimal Sparse Structures in Multi-task Networks through Meta-learning

296

21 Jan 2025

Playing the Lottery With Concave Regularizers for Sparse Trainable Neural NetworksIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024

278

19 Jan 2025

DynST: Dynamic Sparse Training for Resource-Constrained Spatio-Temporal ForecastingKnowledge Discovery and Data Mining (KDD), 2024

430

17 Jan 2025

Pruning for Sparse Diffusion Models based on Gradient FlowIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

128

17 Jan 2025