ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.03635
  4. Cited By
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
v1v2v3v4v5 (latest)

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

9 March 2018
Jonathan Frankle
Michael Carbin
ArXiv (abs)PDFHTML

Papers citing "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks"

50 / 2,187 papers shown
Unveiling Linguistic Regions in Large Language Models
Unveiling Linguistic Regions in Large Language Models
Zhihao Zhang
Jun Zhao
Tao Gui
Tao Gui
Xuanjing Huang
306
17
0
22 Feb 2024
NeuroFlux: Memory-Efficient CNN Training Using Adaptive Local Learning
NeuroFlux: Memory-Efficient CNN Training Using Adaptive Local Learning
Dhananjay Saikumar
Blesson Varghese
234
2
0
21 Feb 2024
In value-based deep reinforcement learning, a pruned network is a good
  network
In value-based deep reinforcement learning, a pruned network is a good network
J. Obando-Ceron
Rameswar Panda
Pablo Samuel Castro
OffRL
480
31
0
19 Feb 2024
Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large
  Language Models
Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models
Didi Zhu
Zhongyi Sun
Zexi Li
Zhenyuan Zhang
Ke Yan
Shouhong Ding
Kun Kuang
Chao Wu
CLLKELMMoMe
222
45
0
19 Feb 2024
Revisiting Zeroth-Order Optimization for Memory-Efficient LLM
  Fine-Tuning: A Benchmark
Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark
Yihua Zhang
Pingzhi Li
Junyuan Hong
Jiaxiang Li
Yimeng Zhang
...
Wotao Yin
Mingyi Hong
Zinan Lin
Sijia Liu
Tianlong Chen
413
100
0
18 Feb 2024
Training Bayesian Neural Networks with Sparse Subspace Variational
  Inference
Training Bayesian Neural Networks with Sparse Subspace Variational Inference
Junbo Li
Zichen Miao
Qiang Qiu
Ruqi Zhang
BDLUQCV
203
10
0
16 Feb 2024
Transformers Can Achieve Length Generalization But Not Robustly
Transformers Can Achieve Length Generalization But Not Robustly
Yongchao Zhou
Uri Alon
Xinyun Chen
Xuezhi Wang
Rishabh Agarwal
Denny Zhou
286
66
0
14 Feb 2024
Graph Inference Acceleration by Learning MLPs on Graphs without
  Supervision
Graph Inference Acceleration by Learning MLPs on Graphs without Supervision
Zehong Wang
Zheyuan Zhang
Chuxu Zhang
Yanfang Ye
177
0
0
14 Feb 2024
Towards Meta-Pruning via Optimal Transport
Towards Meta-Pruning via Optimal Transport
Alexander Theus
Olin Geimer
Friedrich Wicke
Thomas Hofmann
Sotiris Anagnostidis
Sidak Pal Singh
MoMe
393
6
0
12 Feb 2024
Continual Learning on Graphs: A Survey
Continual Learning on Graphs: A Survey
Zonggui Tian
Duanhao Zhang
Hong-Ning Dai
328
13
0
09 Feb 2024
How Uniform Random Weights Induce Non-uniform Bias: Typical
  Interpolating Neural Networks Generalize with Narrow Teachers
How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers
G. Buzaglo
I. Harel
Mor Shpigel Nacson
Alon Brutzkus
Nathan Srebro
Daniel Soudry
335
9
0
09 Feb 2024
The SkipSponge Attack: Sponge Weight Poisoning of Deep Neural Networks
The SkipSponge Attack: Sponge Weight Poisoning of Deep Neural Networks
Jona te Lintelo
Stefanos Koffas
S. Picek
AAML
336
3
0
09 Feb 2024
Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes
Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes
Lucio Dery
Steven Kolawole
Jean-Francois Kagey
Virginia Smith
Graham Neubig
Ameet Talwalkar
283
46
0
08 Feb 2024
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank
  Modifications
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
Boyi Wei
Kaixuan Huang
Yangsibo Huang
Tinghao Xie
Xiangyu Qi
Mengzhou Xia
Prateek Mittal
Mengdi Wang
Peter Henderson
AAML
312
174
0
07 Feb 2024
Progressive Gradient Flow for Robust N:M Sparsity Training in
  Transformers
Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Abhimanyu Bambhaniya
Amir Yazdanbakhsh
Suvinay Subramanian
Sheng-Chun Kao
Shivani Agrawal
Utku Evci
Tushar Krishna
314
23
0
07 Feb 2024
Enhance DNN Adversarial Robustness and Efficiency via Injecting Noise to
  Non-Essential Neurons
Enhance DNN Adversarial Robustness and Efficiency via Injecting Noise to Non-Essential Neurons
Zhenyu Liu
Garrett Gagnon
Swagath Venkataramani
Liu Liu
AAML
242
2
0
06 Feb 2024
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search MethodsInternational Conference on Learning Representations (ICLR), 2024
Akira Ito
Masanori Yamada
Atsutoshi Kumagai
MoMe
717
11
0
06 Feb 2024
Single-GPU GNN Systems: Traps and Pitfalls
Single-GPU GNN Systems: Traps and Pitfalls
Yidong Gong
A. Tarafder
Saima Afrin
Pradeep Kumar
GNN
236
2
0
05 Feb 2024
Less is KEN: a Universal and Simple Non-Parametric Pruning Algorithm for
  Large Language Models
Less is KEN: a Universal and Simple Non-Parametric Pruning Algorithm for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Michele Mastromattei
Fabio Massimo Zanzotto
VLM
236
3
0
05 Feb 2024
Discovering interpretable models of scientific image data with deep
  learning
Discovering interpretable models of scientific image data with deep learning
Christopher J. Soelistyo
Alan R. Lowe
195
8
0
05 Feb 2024
Dynamic Sparse Learning: A Novel Paradigm for Efficient Recommendation
Dynamic Sparse Learning: A Novel Paradigm for Efficient Recommendation
Shuyao Wang
Yongduo Sui
Jiancan Wu
Zhi Zheng
Hui Xiong
143
25
0
05 Feb 2024
KS-Lottery: Finding Certified Lottery Tickets for Multilingual Language
  Models
KS-Lottery: Finding Certified Lottery Tickets for Multilingual Language Models
Fei Yuan
Chang Ma
Shuai Yuan
Qiushi Sun
Lei Li
285
3
0
05 Feb 2024
EXGC: Bridging Efficiency and Explainability in Graph Condensation
EXGC: Bridging Efficiency and Explainability in Graph Condensation
Cunchun Li
Xinglin Li
Yongduo Sui
Yuan Gao
Guibin Zhang
Kun Wang
Xiang Wang
Xiangnan He
DD
285
27
0
05 Feb 2024
On the Role of Initialization on the Implicit Bias in Deep Linear
  Networks
On the Role of Initialization on the Implicit Bias in Deep Linear Networks
Oria Gruber
H. Avron
AI4CE
158
1
0
04 Feb 2024
Defining Neural Network Architecture through Polytope Structures of
  Dataset
Defining Neural Network Architecture through Polytope Structures of Dataset
Sangmin Lee
Abbas Mammadov
Jong Chul Ye
403
1
0
04 Feb 2024
Optimal Parameter and Neuron Pruning for Out-of-Distribution Detection
Optimal Parameter and Neuron Pruning for Out-of-Distribution Detection
Chao Chen
Zhihang Fu
Kai-Chun Liu
Ze Chen
Mingyuan Tao
Jieping Ye
OODD
243
6
0
04 Feb 2024
From PEFT to DEFT: Parameter Efficient Finetuning for Reducing
  Activation Density in Transformers
From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers
Bharat Runwal
Tejaswini Pedapati
Pin-Yu Chen
MoE
412
8
0
02 Feb 2024
Ultrafast jet classification on FPGAs for the HL-LHC
Ultrafast jet classification on FPGAs for the HL-LHC
Patrick Odagiu
Zhiqiang Que
Javier Mauricio Duarte
J. Haller
Gregor Kasieczka
...
Arpita Seksaria
S. Summers
A. Sznajder
A. Tapper
Thea Klæboe Årrestad
224
14
0
02 Feb 2024
TEDDY: Trimming Edges with Degree-based Discrimination strategY
TEDDY: Trimming Edges with Degree-based Discrimination strategY
Hyunjin Seo
Jihun Yun
Eunho Yang
259
3
0
02 Feb 2024
Faster and Lighter LLMs: A Survey on Current Challenges and Way Forward
Faster and Lighter LLMs: A Survey on Current Challenges and Way Forward
Arnav Chavan
Raghav Magazine
Shubham Kushwaha
M. Debbah
Deepak Gupta
298
37
0
02 Feb 2024
No Free Prune: Information-Theoretic Barriers to Pruning at
  Initialization
No Free Prune: Information-Theoretic Barriers to Pruning at Initialization
Tanishq Kumar
Kevin Luo
Mark Sellke
261
9
0
02 Feb 2024
A practical existence theorem for reduced order models based on
  convolutional autoencoders
A practical existence theorem for reduced order models based on convolutional autoencoders
N. R. Franco
Simone Brugiapaglia
AI4CE
311
10
0
01 Feb 2024
EPSD: Early Pruning with Self-Distillation for Efficient Model
  Compression
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression
Dong Chen
Ning Liu
Yichen Zhu
Zhengping Che
Rui Ma
Fachao Zhang
Xiaofeng Mou
Yi Chang
Jian Tang
237
8
0
31 Jan 2024
X-PEFT: eXtremely Parameter-Efficient Fine-Tuning for Extreme
  Multi-Profile Scenarios
X-PEFT: eXtremely Parameter-Efficient Fine-Tuning for Extreme Multi-Profile Scenarios
Namju Kwak
Taesup Kim
MoE
109
0
0
29 Jan 2024
A Comprehensive Survey of Compression Algorithms for Language Models
A Comprehensive Survey of Compression Algorithms for Language Models
Seungcheol Park
Jaehyeon Choi
Sojin Lee
U. Kang
MQ
329
20
0
27 Jan 2024
NACHOS: Neural Architecture Search for Hardware Constrained Early Exit Neural Networks
NACHOS: Neural Architecture Search for Hardware Constrained Early Exit Neural NetworksIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Matteo Gambella
Jary Pomponi
Simone Scardapane
Manuel Roveri
357
5
0
24 Jan 2024
Dynamic Layer Tying for Parameter-Efficient Transformers
Dynamic Layer Tying for Parameter-Efficient TransformersInternational Conference on Learning Representations (ICLR), 2024
Tamir David Hay
Lior Wolf
171
11
0
23 Jan 2024
APT: Adaptive Pruning and Tuning Pretrained Language Models for
  Efficient Training and Inference
APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and InferenceInternational Conference on Machine Learning (ICML), 2024
Bowen Zhao
Hannaneh Hajishirzi
Qingqing Cao
356
26
0
22 Jan 2024
Rethinking Centered Kernel Alignment in Knowledge Distillation
Rethinking Centered Kernel Alignment in Knowledge DistillationInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Zikai Zhou
Chunjiang Ge
Shitong Shao
Linrui Gong
Shaohui Lin
472
12
0
22 Jan 2024
PRILoRA: Pruned and Rank-Increasing Low-Rank Adaptation
PRILoRA: Pruned and Rank-Increasing Low-Rank AdaptationFindings (Findings), 2024
Nadav Benedek
Lior Wolf
163
8
0
20 Jan 2024
Manipulating Sparse Double Descent
Manipulating Sparse Double Descent
Ya Shi Zhang
169
0
0
19 Jan 2024
Enhancing Scalability in Recommender Systems through Lottery Ticket
  Hypothesis and Knowledge Distillation-based Neural Network Pruning
Enhancing Scalability in Recommender Systems through Lottery Ticket Hypothesis and Knowledge Distillation-based Neural Network Pruning
R. Rajaram
Manoj Bharadhwaj
VS Vasan
N. Pervin
118
1
0
19 Jan 2024
Model Compression Techniques in Biometrics Applications: A Survey
Model Compression Techniques in Biometrics Applications: A Survey
Eduarda Caldeira
Pedro C. Neto
Marco Huber
Naser Damer
Ana F. Sequeira
294
15
0
18 Jan 2024
FedLoGe: Joint Local and Generic Federated Learning under Long-tailed
  Data
FedLoGe: Joint Local and Generic Federated Learning under Long-tailed DataInternational Conference on Learning Representations (ICLR), 2024
Zikai Xiao
Zihan Chen
Liyinglan Liu
Yang Feng
Jian Wu
Wanlu Liu
Qiufeng Wang
Howard H. Yang
Zuo-Qiang Liu
FedML
292
13
0
17 Jan 2024
Stochastic Subnetwork Annealing: A Regularization Technique for Fine
  Tuning Pruned Subnetworks
Stochastic Subnetwork Annealing: A Regularization Technique for Fine Tuning Pruned Subnetworks
Tim Whitaker
Darrell Whitley
295
1
0
16 Jan 2024
Convolutional Neural Network Compression via Dynamic Parameter Rank
  Pruning
Convolutional Neural Network Compression via Dynamic Parameter Rank PruningIEEE Access (IEEE Access), 2024
Manish Sharma
Jamison Heard
Eli Saber
Panos P. Markopoulos
240
3
0
15 Jan 2024
Harnessing the Power of Beta Scoring in Deep Active Learning for
  Multi-Label Text Classification
Harnessing the Power of Beta Scoring in Deep Active Learning for Multi-Label Text ClassificationAAAI Conference on Artificial Intelligence (AAAI), 2024
Wei Tan
Ngoc Dang Nguyen
Lan Du
Wray Buntine
189
5
0
15 Jan 2024
Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized
  Large Language Models
Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Zhengxin Zhang
Dan Zhao
Xupeng Miao
Xupeng Miao
Qing Li
Yong Jiang
Zhihao Jia
MQ
198
11
0
13 Jan 2024
Always-Sparse Training by Growing Connections with Guided Stochastic Exploration
Always-Sparse Training by Growing Connections with Guided Stochastic Exploration
Mike Heddes
Narayan Srinivasa
T. Givargis
Alexandru Nicolau
524
1
0
12 Jan 2024
A Survey on Efficient Federated Learning Methods for Foundation Model
  Training
A Survey on Efficient Federated Learning Methods for Foundation Model TrainingInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Herbert Woisetschläger
Alexander Isenko
Shiqiang Wang
R. Mayer
Hans-Arno Jacobsen
FedML
293
39
0
09 Jan 2024
Previous
123...121314...424344
Next