Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.03635
Cited By
v1
v2
v3
v4
v5 (latest)
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
9 March 2018
Jonathan Frankle
Michael Carbin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks"
50 / 2,187 papers shown
What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models
Busayo Awobade
Mardiyyah Oduwole
Steven Kolawole
200
1
0
06 Apr 2024
Spectral Graph Pruning Against Over-Squashing and Over-Smoothing
Adarsh Jamadandi
Celia Rubio-Madrigal
R. Burkholz
309
14
0
06 Apr 2024
FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping
Ajay Jaiswal
Bodun Hu
Lu Yin
Yeonju Ro
Shiwei Liu
Tianlong Chen
Aditya Akella
294
21
0
05 Apr 2024
Learning smooth functions in high dimensions: from sparse polynomials to deep neural networks
Ben Adcock
Simone Brugiapaglia
N. Dexter
S. Moraga
214
8
0
04 Apr 2024
FedSelect: Personalized Federated Learning with Customized Selection of Parameters for Fine-Tuning
Computer Vision and Pattern Recognition (CVPR), 2024
Rishub Tamirisa
Chulin Xie
Wenxuan Bao
Andy Zhou
Ron Arel
Aviv Shamsian
249
36
0
03 Apr 2024
Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity and Performance Restoration
Shwai He
Ang Li
Tianlong Chen
VLM
282
3
0
03 Apr 2024
Accelerating Transformer Pre-training with 2:4 Sparsity
International Conference on Machine Learning (ICML), 2024
Yuezhou Hu
Kang Zhao
Weiyu Huang
Jianfei Chen
Jun Zhu
288
17
0
02 Apr 2024
DRIVE: Dual Gradient-Based Rapid Iterative Pruning
Dhananjay Saikumar
Blesson Varghese
183
3
0
01 Apr 2024
Effectively Prompting Small-sized Language Models for Cross-lingual Tasks via Winning Tickets
Mingqi Li
Feng Luo
LRM
158
0
0
01 Apr 2024
Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment
Alireza Ganjdanesh
Shangqian Gao
Heng-Chiao Huang
328
14
0
28 Mar 2024
The Unreasonable Ineffectiveness of the Deeper Layers
Andrey Gromov
Kushal Tirumala
Hassan Shapourian
Paolo Glorioso
Daniel A. Roberts
428
158
0
26 Mar 2024
Block Selective Reprogramming for On-device Training of Vision Transformers
Sreetama Sarkar
Souvik Kundu
Kai Zheng
Peter A. Beerel
227
5
0
25 Mar 2024
Insights into the Lottery Ticket Hypothesis and Iterative Magnitude Pruning
Tausifa Jan Saleem
Ramanjit Ahuja
Surendra Prasad
Brejesh Lall
315
0
0
22 Mar 2024
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey
Zeyu Han
Chao Gao
Jinyang Liu
Jeff Zhang
Sai Qian Zhang
797
707
0
21 Mar 2024
A Bag of Tricks for Few-Shot Class-Incremental Learning
Shuvendu Roy
Chunjong Park
Aldi Fahrezi
Ali Etemad
CLL
266
6
0
21 Mar 2024
Debiasing surgeon: fantastic weights and how to find them
Rémi Nahon
Ivan Luiz De Moura Matos
Van-Tam Nguyen
Enzo Tartaglione
229
1
0
21 Mar 2024
Advancing IIoT with Over-the-Air Federated Learning: The Role of Iterative Magnitude Pruning
Fazal Muhammad Ali Khan
Hatem Abou-Zeid
Aryan Kaushik
Syed Ali Hassan
160
3
0
21 Mar 2024
Accelerating ViT Inference on FPGA through Static and Dynamic Pruning
Dhruv Parikh
Shouyi Li
Bingyi Zhang
Rajgopal Kannan
Carl E. Busart
Viktor Prasanna
258
7
0
21 Mar 2024
Searching Search Spaces: Meta-evolving a Geometric Encoding for Neural Networks
Tarek Kunze
Paul Templier
Dennis G. Wilson
173
0
0
20 Mar 2024
Unifews: You Need Fewer Operations for Efficient Graph Neural Networks
Ningyi Liao
Zihao Yu
Ruixiao Zeng
Siqiang Luo
GNN
198
0
0
20 Mar 2024
MELTing point: Mobile Evaluation of Language Transformers
Stefanos Laskaridis
Kleomenis Katevas
Lorenzo Minto
Hamed Haddadi
301
34
0
19 Mar 2024
SEVEN: Pruning Transformer Model by Reserving Sentinels
IEEE International Joint Conference on Neural Network (IJCNN), 2024
Jinying Xiao
Ping Li
Jie Nie
Zhe Tang
193
3
0
19 Mar 2024
LNPT: Label-free Network Pruning and Training
IEEE International Joint Conference on Neural Network (IJCNN), 2024
Jinying Xiao
Ping Li
Zhe Tang
Jie Nie
226
3
0
19 Mar 2024
Graph Expansion in Pruned Recurrent Neural Network Layers Preserve Performance
Suryam Arnav Kalra
Arindam Biswas
Pabitra Mitra
Biswajit Basu
GNN
190
0
0
17 Mar 2024
Federated Learning based on Pruning and Recovery
Chengjie Ma
FedML
142
1
0
16 Mar 2024
Adversarial Fine-tuning of Compressed Neural Networks for Joint Improvement of Robustness and Efficiency
Hallgrimur Thorsteinsson
Valdemar J Henriksen
Tong Chen
Raghavendra Selvan
AAML
235
1
0
14 Mar 2024
Towards a theory of model distillation
Enric Boix-Adserà
FedML
VLM
235
14
0
14 Mar 2024
Random Search as a Baseline for Sparse Neural Network Architecture Search
Rezsa Farahani
270
0
0
13 Mar 2024
Self-Regulated Neurogenesis for Online Data-Incremental Learning
Murat Onur Yildirim
Elif Ceren Gok Yildirim
Decebal Constantin Mocanu
Joaquin Vanschoren
CLL
326
0
0
13 Mar 2024
CHAI: Clustered Head Attention for Efficient LLM Inference
International Conference on Machine Learning (ICML), 2024
Saurabh Agarwal
Bilge Acun
Basil Homer
Mostafa Elhoushi
Yejin Lee
Shivaram Venkataraman
Dimitris Papailiopoulos
Carole-Jean Wu
254
12
0
12 Mar 2024
Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons
Simon Dufort-Labbé
P. DÓro
Evgenii Nikishin
Razvan Pascanu
Pierre-Luc Bacon
A. Baratin
357
4
0
12 Mar 2024
Enhanced Sparsification via Stimulative Training
European Conference on Computer Vision (ECCV), 2024
Shengji Tang
Weihao Lin
Hancheng Ye
Peng Ye
Chong Yu
Baopu Li
Tao Chen
160
2
0
11 Mar 2024
Revisiting Edge Perturbation for Graph Neural Network in Graph Data Augmentation and Attack
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2024
Xin Liu
Yuxiang Zhang
Meng Wu
Yurui Lai
Kun He
Wei Yan
Shirui Pan
Xiaochun Ye
Xiaochun Ye
AAML
246
4
0
10 Mar 2024
POV: Prompt-Oriented View-Agnostic Learning for Egocentric Hand-Object Interaction in the Multi-View World
ACM Multimedia (ACM MM), 2023
Boshen Xu
Sipeng Zheng
Qin Jin
192
14
0
09 Mar 2024
A Survey of Lottery Ticket Hypothesis
Bohan Liu
Zijie Zhang
Peixiong He
Zhensen Wang
Yang Xiao
Ruimeng Ye
Yang Zhou
Wei-Shinn Ku
Bo Hui
UQCV
278
22
0
07 Mar 2024
MAP: MAsk-Pruning for Source-Free Model Intellectual Property Protection
Boyang Peng
Sanqing Qu
Yong Wu
Tianpei Zou
Lianghua He
Alois Knoll
Guang Chen
Changjun Jiang
AAML
194
5
0
07 Mar 2024
Training-Free Pretrained Model Merging
Zhenxing Xu
Ke Yuan
Huiqiong Wang
Yong Wang
Weilong Dai
Mingli Song
MoMe
398
24
0
04 Mar 2024
Merging Text Transformer Models from Different Initializations
Neha Verma
Maha Elbayad
MoMe
360
12
0
01 Mar 2024
Motif distribution and function of sparse deep neural networks
Olivia Zahn
Thomas L. Daniel
J. Nathan Kutz
103
0
0
01 Mar 2024
MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Vithursan Thangarasa
Mahmoud Salem
Shreyas Saxena
Kevin Leong
Joel Hestness
Sean Lie
MedIm
237
2
0
01 Mar 2024
Masks, Signs, And Learning Rate Rewinding
Advait Gadhikar
R. Burkholz
229
14
0
29 Feb 2024
How do Large Language Models Handle Multilingualism?
Yiran Zhao
Wenxuan Zhang
Guizhen Chen
Kenji Kawaguchi
Lidong Bing
LRM
374
131
0
29 Feb 2024
NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language Models
Amit Dhurandhar
Tejaswini Pedapati
Ronny Luss
Soham Dan
Aurélie C. Lozano
Payel Das
Georgios Kollias
369
3
0
28 Feb 2024
Ef-QuantFace: Streamlined Face Recognition with Small Data and Low-Bit Precision
William Gazali
Jocelyn Michelle Kho
Joshua Santoso
Williem
CVBM
MQ
205
0
0
28 Feb 2024
REPrune: Channel Pruning via Kernel Representative Selection
Mincheol Park
Dongjin Kim
Cheonjun Park
Yuna Park
Gyeong Eun Gong
Won Woo Ro
Suhyun Kim
VLM
268
2
0
27 Feb 2024
SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization
T. Yasuda
Kyriakos Axiotis
Gang Fu
M. Bateni
Vahab Mirrokni
433
2
0
27 Feb 2024
Asymmetry in Low-Rank Adapters of Foundation Models
Jiacheng Zhu
Kristjan Greenewald
Kimia Nadjahi
Haitz Sáez de Ocáriz Borde
Rickard Brüel-Gabrielsson
Leshem Choshen
Elisa Kreiss
Mikhail Yurochkin
Justin Solomon
332
66
0
26 Feb 2024
C
3
C^3
C
3
: Confidence Calibration Model Cascade for Inference-Efficient Cross-Lingual Natural Language Understanding
Taixi Lu
Haoyu Wang
Huajie Shao
Jing Gao
Huaxiu Yao
170
0
0
25 Feb 2024
Shaving Weights with Occam's Razor: Bayesian Sparsification for Neural Networks Using the Marginal Likelihood
Rayen Dhahri
Alexander Immer
Bertrand Charpentier
Stephan Günnemann
Vincent Fortuin
BDL
UQCV
209
7
0
25 Feb 2024
Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning
Yong Liu
Zirui Zhu
Chaoyu Gong
Minhao Cheng
Cho-Jui Hsieh
Yang You
MoE
241
36
0
24 Feb 2024
Previous
1
2
3
...
11
12
13
...
42
43
44
Next