v1v2v3v4v5 (latest)

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

9 March 2018

Jonathan Frankle

Michael Carbin

ArXiv (abs)PDF HTML

Papers citing "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks"

50 / 2,187 papers shown

Random Sparse Lifts: Construction, Analysis and Convergence of finite sparse networksInternational Conference on Learning Representations (ICLR), 2025

David A. R. Robin

Kevin Scaman

Marc Lelarge

128

10 Jan 2025

Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific PromptsEuropean Conference on Artificial Intelligence (ECAI), 2024

Danyal Aftab

Steven Davy

ALM

272

10 Jan 2025

Vision Transformer Neural Architecture Search for Out-of-Distribution Generalization: Benchmark and InsightsNeural Information Processing Systems (NeurIPS), 2025

300

08 Jan 2025

Hierarchical Light Transformer Ensembles for Multimodal Trajectory ForecastingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

423

08 Jan 2025

Training-free Heterogeneous Model Merging

492

03 Jan 2025

Lillama: Large Language Models Compression via Low-Rank Feature Distillation

Yaya Sy

Christophe Cerisara

Irina Illina

310

31 Dec 2024

Improving Quantization-aware Training of Low-Precision Network via Block Replacement on Full-Precision Counterpart

271

20 Dec 2024

A Comparative Study of Pruning Methods in Transformer-based Time Series Forecasting

270

17 Dec 2024

No More Adam: Learning Rate Scaling at Initialization is All You Need

341

16 Dec 2024

RWKV-edge: Deeply Compressed RWKV for Resource-Constrained Devices

Wonkyo Choe

Yangfeng Ji

F. Lin

512

14 Dec 2024

MOFHEI: Model Optimizing Framework for Fast and Efficient Homomorphically Encrypted Neural Network InferenceInternational Conference on Trust, Privacy and Security in Intelligent Systems and Applications (ICPSISA), 2024

264

10 Dec 2024

Fast Track to Winning Tickets: Repowering One-Shot Pruning for Graph Neural NetworksAAAI Conference on Artificial Intelligence (AAAI), 2024

308

10 Dec 2024

Training MLPs on Graphs without SupervisionWeb Search and Data Mining (WSDM), 2024

265

05 Dec 2024

Is Oracle Pruning the True Oracle?

351

28 Nov 2024

On the Effectiveness of Incremental Training of Large Language Models

168

27 Nov 2024

Preserving Deep Representations In One-Shot Pruning: A Hessian-Free Second-Order Optimization FrameworkInternational Conference on Learning Representations (ICLR), 2024

Ryan Lucas

Rahul Mazumder

313

27 Nov 2024

Multi-Label Bayesian Active Learning with Inter-Label RelationshipsConference on Uncertainty in Artificial Intelligence (UAI), 2024

487

26 Nov 2024

DRPruning: Efficient Large Language Model Pruning through Distributionally Robust OptimizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

573

21 Nov 2024

Pushing the Limits of Sparsity: A Bag of Tricks for Extreme Pruning

452

20 Nov 2024

Probe-Me-Not: Protecting Pre-trained Encoders from Malicious ProbingNetwork and Distributed System Security Symposium (NDSS), 2024

351

19 Nov 2024

FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning

246

19 Nov 2024

Electrostatic Force Regularization for Neural Structured Pruning

335

17 Nov 2024

RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively

240

15 Nov 2024

On the Surprising Effectiveness of Attention Transfer for Vision TransformersNeural Information Processing Systems (NeurIPS), 2024

208

14 Nov 2024

Complexity-Aware Training of Deep Neural Networks for Optimal Structure Discovery

Valentin Frank Ingmar Guenter

Athanasios Sideris

CVBM

292

14 Nov 2024

FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training

304

12 Nov 2024

Zeroth-Order Adaptive Neuron Alignment Based Pruning without Re-Training

Elia Cunegatti

Leonardo Lucio Custode

Giovanni Iacca

641

11 Nov 2024

Towards Establishing Guaranteed Error for Learned Database OperationsInternational Conference on Learning Representations (ICLR), 2024

Sepanta Zeighami

Cyrus Shahabi

136

09 Nov 2024

Poor Man's Training on MCUs: A Memory-Efficient Quantized Back-Propagation-Free Approach

381

07 Nov 2024

Finding Strong Lottery Ticket Networks with Genetic AlgorithmsInternational Joint Conference on Computational Intelligence (IJCCI), 2024

204

07 Nov 2024

Neural Fingerprints for Adversarial Attack Detection

152

07 Nov 2024

Learning Morphisms with Gauss-Newton Approximation for Growing Networks

Neal Lawton

Aram Galstyan

Greg Ver Steeg

205

07 Nov 2024

Sparse Orthogonal Parameters Tuning for Continual Learning

306

05 Nov 2024

Navigating Extremes: Dynamic Sparsity in Large Output SpacesNeural Information Processing Systems (NeurIPS), 2024

413

05 Nov 2024

Expanding Sparse Tuning for Low Memory UsageNeural Information Processing Systems (NeurIPS), 2024

329

04 Nov 2024

Double Descent Meets Out-of-Distribution Detection: Theoretical Insights and Empirical Analysis on the role of model complexity

395

04 Nov 2024

Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level AlignmentIEEE Transactions on Artificial Intelligence (IEEE TAI), 2024

348

03 Nov 2024

Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian PriorJournal of Data Science (JDS), 2024

Mingxuan Zhang

Y. Sun

F. Liang

298

01 Nov 2024

Chasing Better Deep Image Priors between Over- and Under-parameterization

333

31 Oct 2024

Neural Network Matrix Product Operator: A Multi-Dimensionally Integrable Machine Learning PotentialPhysical Review Research (PRR), 2024

Kentaro Hino

Yuki Kurashige

341

31 Oct 2024

Mutual Information Preserving Neural Network Pruning

Charles Westphal

Stephen Hailes

Mirco Musolesi

469

31 Oct 2024

BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network InferenceNeural Information Processing Systems (NeurIPS), 2024

285

28 Oct 2024

Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRAInternational Conference on Learning Representations (ICLR), 2024

396

28 Oct 2024

FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model FusionNeural Information Processing Systems (NeurIPS), 2024

329

27 Oct 2024

Uncovering Capabilities of Model Pruning in Graph Contrastive LearningACM Multimedia (MM), 2024

Wu Junran

Chen Xueyuan

Li Shangzhe

254

27 Oct 2024

Model merging with SVD to tie the KnotsInternational Conference on Learning Representations (ICLR), 2024

333

25 Oct 2024

Expose Before You Defend: Unifying and Enhancing Backdoor Defenses via Exposed Models

Xingjun Ma

190

25 Oct 2024

CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation

Jianyi Zhang

Yiran Chen

108

23 Oct 2024

Local Contrastive Editing of Gender StereotypesConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

295

23 Oct 2024

Beware of Calibration Data for Pruning Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

322

23 Oct 2024