v1v2v3v4v5 (latest)

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

9 March 2018

Jonathan Frankle

Michael Carbin

ArXiv (abs)PDF HTML

Papers citing "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks"

50 / 2,186 papers shown

Efficient Training of Large Vision Models via Advanced Automated Progressive Learning

Changlin Li

265

06 Sep 2024

WaterMAS: Sharpness-Aware Maximization for Neural Network WatermarkingInternational Conference on Pattern Recognition (ICPR), 2024

230

05 Sep 2024

Modularity in Transformers: Investigating Neuron Separability & Specialization

Nicholas Pochinkov

Thomas Jones

Mohammed Rashidur Rahman

175

30 Aug 2024

Learning effective pruning at initialization from iterative pruning

Fusheng Zha

240

27 Aug 2024

3D Point Cloud Network Pruning: When Some Weights Do not MatterBritish Machine Vision Conference (BMVC), 2024

293

26 Aug 2024

Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers

Sayed Mohammad Vakilzadeh Hatefi

229

22 Aug 2024

Weight Scope Alignment: A Frustratingly Easy Method for Model MergingEuropean Conference on Artificial Intelligence (ECAI), 2024

293

22 Aug 2024

A Tighter Complexity Analysis of SparseGPT

306

22 Aug 2024

A Greedy Hierarchical Approach to Whole-Network Filter-Pruning in CNNs

Kiran Purohit

Anurag Parvathgari

Sourangshu Bhattacharya

VLM

238

22 Aug 2024

Transformers As Approximations of Solomonoff InductionInternational Conference on Neural Information Processing (ICONIP), 2024

Nathan Young

Michael Witbrock

108

22 Aug 2024

Approaching Deep Learning through the Spectral Dynamics of Weights

327

21 Aug 2024

On Learnable Parameters of Optimal and Suboptimal Deep Learning ModelsInternational Conference on Neural Information Processing (ICONIP), 2024

Huizhi Liang

Giuseppe Nicosia

126

21 Aug 2024

First Activations Matter: Training-Free Methods for Dynamic Activation in Large Language Models

Yujie Wang

250

21 Aug 2024

Mask in the Mirror: Implicit SparsificationInternational Conference on Learning Representations (ICLR), 2024

Tom Jacobs

R. Burkholz

488

19 Aug 2024

Activated Parameter Locating via Causal Intervention for Model Merging

162

18 Aug 2024

Antidote: Post-fine-tuning Safety Alignment for Large Language Models against Harmful Fine-tuning

592

18 Aug 2024

Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression ExperimentsIEEE Transactions on Visualization and Computer Graphics (TVCG), 2024

224

06 Aug 2024

Masked Random Noise for Communication Efficient Federaetd LearningACM Multimedia (MM), 2024

Haozhao Wang

208

06 Aug 2024

Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative SemanticsEuropean Conference on Computer Vision (ECCV), 2024

195

05 Aug 2024

Pruning Large Language Models with Semi-Structural Adaptive Sparse Training

Weiyu Huang

Yuezhou Hu

Guohao Jian

Jun Zhu

Jianfei Chen

323

30 Jul 2024

Machine Unlearning in Generative AI: A Survey

327

30 Jul 2024

ThinK: Thinner Key Cache by Query-Driven PruningInternational Conference on Learning Representations (ICLR), 2024

525

30 Jul 2024

FIARSE: Model-Heterogeneous Federated Learning via Importance-Aware Submodel ExtractionNeural Information Processing Systems (NeurIPS), 2024

Tianci Liu

Jing Gao

309

28 Jul 2024

Towards the Dynamics of a DNN Learning Symbolic InteractionsNeural Information Processing Systems (NeurIPS), 2024

Yang Xu

234

27 Jul 2024

Greedy Output Approximation: Towards Efficient Structured Pruning for LLMs Without Retraining

Jianwei Li

Yijun Dong

Qi Lei

374

26 Jul 2024

Finite Neural Networks as Mixtures of Gaussian Processes: From Provable Error Bounds to Prior Selection

374

26 Jul 2024

Nerva: a Truly Sparse Implementation of Neural Networks

Wieger Wesselink

Bram Grooten

Qiao Xiao

Cássio Machado de Campos

Mykola Pechenizkiy

172

24 Jul 2024

Self-driving lab discovers principles for steering spontaneous emission

150

22 Jul 2024

Knowledge Mechanisms in Large Language Models: A Survey and Perspective

Shumin Deng

...

Yong Jiang

Pengjun Xie

Fei Huang

Huajun Chen

Ningyu Zhang

332

22 Jul 2024

Out of spuriousity: Improving robustness to spurious correlations without group annotations

187

20 Jul 2024

LORTSAR: Low-Rank Transformer for Skeleton-based Action Recognition

Soroush Oraki

Harry Zhuang

Jie Liang

156

19 Jul 2024

Training-Free Model Merging for Multi-target Domain Adaptation

Hao Zhao

238

18 Jul 2024

Accurate Mapping of RNNs on Neuromorphic Hardware with Adaptive Spiking Neurons

163

18 Jul 2024

Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors

Matt Gorbett

Hossein Shirazi

Indrakshi Ray

307

16 Jul 2024

Continual Deep Learning on the Edge via Stochastic Local Competition among Subnetworks

Theodoros Christophides

Kyriakos Tolias

S. Chatzis

254

15 Jul 2024

From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications

282

15 Jul 2024

Dynamic Encoder Size Based on Data-Driven Layer-wise Pruning for Speech Recognition

Jingjing Xu

Wei Zhou

Zijian Yang

Eugen Beck

Ralf Schlueter

292

10 Jul 2024

LDGCN: An Edge-End Lightweight Dual GCN Based on Single-Channel EEG for Driver Drowsiness Monitoring

Antoni Grau

183

08 Jul 2024

The Impact of Quantization and Pruning on Deep Reinforcement Learning Models

Heng Lu

Mehdi Alemi

Reza Rawassizadeh

222

05 Jul 2024

Revealing the Utilized Rank of Subspaces of Learning in Neural Networks

135

05 Jul 2024

Sparsest Models Elude Pruning: An Exposé of Pruning's Current Capabilities

Stephen Zhang

Vardan Papyan

255

04 Jul 2024

SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning

379

03 Jul 2024

Efficient DNN-Powered Software with Fair Sparse Models

Chao Shen

214

03 Jul 2024

LPViT: Low-Power Semi-structured Pruning for Vision Transformers

Zhe Wang

Min Wu

Xiaoli Li

Weisi Lin

ViT VLM

658

02 Jul 2024

Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning

Haobo Song

Hao Zhao

Soumajit Majumder

Tao Lin

183

01 Jul 2024

How Does Overparameterization Affect Features?

Ahmet Cagri Duzgun

Samy Jelassi

Yuanzhi Li

174

01 Jul 2024

Neural Networks Trained by Weight Permutation are Universal Approximators

Yongqiang Cai

Gaohang Chen

Zhonghua Qiao

533

01 Jul 2024

RepAct: The Re-parameterizable Adaptive Activation Function

Xian Wu

Qingchuan Tao

Shuang Wang

315

28 Jun 2024

FedMap: Iterative Magnitude-Based Pruning for Communication-Efficient Federated Learning

136

27 Jun 2024

Infinite Width Models That Work: Why Feature Learning Doesn't Matter as Much as You Think

Luke Sernau

120

27 Jun 2024