ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.03635
  4. Cited By
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
v1v2v3v4v5 (latest)

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

9 March 2018
Jonathan Frankle
Michael Carbin
ArXiv (abs)PDFHTML

Papers citing "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks"

50 / 2,184 papers shown
Title
A Thorough Performance Benchmarking on Lightweight Embedding-based Recommender Systems
A Thorough Performance Benchmarking on Lightweight Embedding-based Recommender Systems
Hung Vinh Tran
Tong Chen
Quoc Viet Hung Nguyen
Zi-Rui Huang
Lizhen Cui
Hongzhi Yin
393
7
0
25 Jun 2024
Unlocking Continual Learning Abilities in Language Models
Unlocking Continual Learning Abilities in Language Models
Wenyu Du
Shuang Cheng
Tongxu Luo
Zihan Qiu
Zeyu Huang
Ka Chun Cheung
Reynold Cheng
Jie Fu
KELMCLL
245
18
0
25 Jun 2024
Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs
Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs
Ashwinee Panda
Berivan Isik
Xiangyu Qi
Sanmi Koyejo
Tsachy Weissman
Prateek Mittal
MoMe
397
27
0
24 Jun 2024
ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models
ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models
Yash Akhauri
Ahmed F. AbouElhamayed
Jordan Dotzel
Zhiru Zhang
Alexander M Rush
Safeen Huda
Mohamed S. Abdelfattah
184
7
0
24 Jun 2024
Building on Efficient Foundations: Effectively Training LLMs with
  Structured Feedforward Layers
Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers
Xiuying Wei
Skander Moalla
Razvan Pascanu
Çağlar Gülçehre
298
3
0
24 Jun 2024
Towards Lightweight Graph Neural Network Search with Curriculum Graph
  Sparsification
Towards Lightweight Graph Neural Network Search with Curriculum Graph Sparsification
Beini Xie
Heng Chang
Ziwei Zhang
Zeyang Zhang
Simin Wu
Xin Wang
Yuan Meng
Wenwu Zhu
238
9
0
24 Jun 2024
Synergizing Foundation Models and Federated Learning: A Survey
Synergizing Foundation Models and Federated Learning: A Survey
Shenghui Li
Fanghua Ye
Meng Fang
Jiaxu Zhao
Yun-Hin Chan
Edith C. -H. Ngai
Thiemo Voigt
AI4CE
256
9
0
18 Jun 2024
Sparsifying dimensionality reduction of PDE solution data with Bregman
  learning
Sparsifying dimensionality reduction of PDE solution data with Bregman learningSIAM Journal on Scientific Computing (SISC), 2024
T. J. Heeringa
Christoph Brune
Mengwu Guo
151
1
0
18 Jun 2024
Low-Redundant Optimization for Large Language Model Alignment
Low-Redundant Optimization for Large Language Model AlignmentConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zhipeng Chen
Kun Zhou
Wayne Xin Zhao
Jingyuan Wang
Ji-Rong Wen
206
0
0
18 Jun 2024
MOYU: A Theoretical Study on Massive Over-activation Yielded Uplifts in
  LLMs
MOYU: A Theoretical Study on Massive Over-activation Yielded Uplifts in LLMs
Chi Ma
Mincong Huang
Chao Wang
Yujie Wang
Lei Yu
AI4CE
221
0
0
18 Jun 2024
Finding Task-specific Subnetworks in Multi-task Spoken Language
  Understanding Model
Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model
Hayato Futami
Siddhant Arora
Yosuke Kashiwagi
E. Tsunoo
Shinji Watanabe
220
1
0
18 Jun 2024
Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference
Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference
Donghyeon Joo
Ramyad Hadidi
Soheil Feizi
Bahar Asgari
MQ
125
3
0
17 Jun 2024
DELLA-Merging: Reducing Interference in Model Merging through
  Magnitude-Based Sampling
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
Pala Tej Deep
Rishabh Bhardwaj
Soujanya Poria
MoMe
288
45
0
17 Jun 2024
Scale Equivariant Graph Metanetworks
Scale Equivariant Graph Metanetworks
Ioannis Kalogeropoulos
Giorgos Bouritsas
Yannis Panagakis
372
15
0
15 Jun 2024
DIET: Customized Slimming for Incompatible Networks in Sequential
  Recommendation
DIET: Customized Slimming for Incompatible Networks in Sequential Recommendation
Kairui Fu
Shengyu Zhang
Zheqi Lv
Jingyuan Chen
Jiwei Li
188
7
0
13 Jun 2024
Pre-Training Identification of Graph Winning Tickets in Adaptive
  Spatial-Temporal Graph Neural Networks
Pre-Training Identification of Graph Winning Tickets in Adaptive Spatial-Temporal Graph Neural Networks
Wenying Duan
Tianxiang Fang
Hong Rao
Xiaoxi He
184
2
0
12 Jun 2024
MoreauPruner: Robust Pruning of Large Language Models against Weight
  Perturbations
MoreauPruner: Robust Pruning of Large Language Models against Weight Perturbations
Zixiao Wang
Jingwei Zhang
Wenqian Zhao
Farzan Farnia
Bei Yu
AAML
162
4
0
11 Jun 2024
Sparse Bayesian Networks: Efficient Uncertainty Quantification in
  Medical Image Analysis
Sparse Bayesian Networks: Efficient Uncertainty Quantification in Medical Image Analysis
Zeinab Abboud
Herve Lombaert
Samuel Kadoury
UQCV
187
2
0
11 Jun 2024
Reinforced Compressive Neural Architecture Search for Versatile
  Adversarial Robustness
Reinforced Compressive Neural Architecture Search for Versatile Adversarial Robustness
Dingrong Wang
Hitesh Sapkota
Zhiqiang Tao
Qi Yu
AAML
206
3
0
10 Jun 2024
Forget Sharpness: Perturbed Forgetting of Model Biases Within SAM
  Dynamics
Forget Sharpness: Perturbed Forgetting of Model Biases Within SAM Dynamics
Ankit Vani
Frederick Tung
Gabriel L. Oliveira
Hossein Sharifi-Noghabi
AAML
303
0
0
10 Jun 2024
Interpretability of Language Models via Task Spaces
Interpretability of Language Models via Task Spaces
Lucas Weber
Jaap Jumelet
Elia Bruni
Dieuwke Hupkes
201
4
0
10 Jun 2024
Geometric sparsification in recurrent neural networks
Geometric sparsification in recurrent neural networks
Wyatt Mackey
Ioannis Schizas
Jared Deighton
David L. Boothe, Jr.
Vasileios Maroulas
163
0
0
10 Jun 2024
Compute Better Spent: Replacing Dense Layers with Structured Matrices
Compute Better Spent: Replacing Dense Layers with Structured Matrices
Shikai Qiu
Andres Potapczynski
Marc Finzi
Micah Goldblum
Andrew Gordon Wilson
208
23
0
10 Jun 2024
Evaluating Zero-Shot Long-Context LLM Compression
Evaluating Zero-Shot Long-Context LLM Compression
Chenyu Wang
Yihan Wang
Kai Li
241
0
0
10 Jun 2024
Optimal Eye Surgeon: Finding Image Priors through Sparse Generators at
  Initialization
Optimal Eye Surgeon: Finding Image Priors through Sparse Generators at InitializationInternational Conference on Machine Learning (ICML), 2024
Avrajit Ghosh
Xitong Zhang
Kenneth K. Sun
Qing Qu
S. Ravishankar
Rongrong Wang
MedIm
247
6
0
07 Jun 2024
Federated LoRA with Sparse Communication
Federated LoRA with Sparse Communication
Kevin Kuo
Arian Raje
Kousik Rajesh
Virginia Smith
337
22
0
07 Jun 2024
Designs for Enabling Collaboration in Human-Machine Teaming via
  Interactive and Explainable Systems
Designs for Enabling Collaboration in Human-Machine Teaming via Interactive and Explainable SystemsNeural Information Processing Systems (NeurIPS), 2024
Rohan R. Paleja
Michael Munje
K. Chang
Reed Jensen
Matthew C. Gombolay
264
6
0
07 Jun 2024
Optimal Recurrent Network Topologies for Dynamical Systems
  Reconstruction
Optimal Recurrent Network Topologies for Dynamical Systems ReconstructionInternational Conference on Machine Learning (ICML), 2024
Christoph Jürgen Hemmer
Manuel Brenner
Florian Hess
Daniel Durstewitz
250
5
0
07 Jun 2024
MeGA: Merging Multiple Independently Trained Neural Networks Based on
  Genetic Algorithm
MeGA: Merging Multiple Independently Trained Neural Networks Based on Genetic Algorithm
Daniel Yun
FedMLMoMe
228
1
0
07 Jun 2024
Concurrent Training and Layer Pruning of Deep Neural Networks
Concurrent Training and Layer Pruning of Deep Neural Networks
Valentin Frank Ingmar Guenter
Athanasios Sideris
3DPC
203
5
0
06 Jun 2024
Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
Naibin Gu
Peng Fu
Xiyu Liu
Bowen Shen
Zheng Lin
Weiping Wang
185
15
0
06 Jun 2024
Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity
Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity
Wentao Guo
Jikai Long
Yimeng Zeng
Zirui Liu
Xinyu Yang
...
Osbert Bastani
Christopher De Sa
Xiaodong Yu
Beidi Chen
Zhaozhuo Xu
247
31
0
05 Jun 2024
Feature contamination: Neural networks learn uncorrelated features and fail to generalize
Feature contamination: Neural networks learn uncorrelated features and fail to generalize
Tianren Zhang
Chujie Zhao
Guanyu Chen
Yizhou Jiang
Feng Chen
OODMLTOODD
384
9
0
05 Jun 2024
Cyclic Sparse Training: Is it Enough?
Cyclic Sparse Training: Is it Enough?
Advait Gadhikar
Sree Harsha Nelaturu
R. Burkholz
CLL
398
1
0
04 Jun 2024
Linguistic Fingerprint in Transformer Models: How Language Variation
  Influences Parameter Selection in Irony Detection
Linguistic Fingerprint in Transformer Models: How Language Variation Influences Parameter Selection in Irony Detection
Michele Mastromattei
Fabio Massimo Zanzotto
183
0
0
04 Jun 2024
SLTrain: a sparse plus low-rank approach for parameter and memory
  efficient pretraining
SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining
Andi Han
Jiaxiang Li
Wei Huang
Mingyi Hong
Akiko Takeda
Pratik Jawanpuria
Bamdev Mishra
256
30
0
04 Jun 2024
Finding Lottery Tickets in Vision Models via Data-driven Spectral
  Foresight Pruning
Finding Lottery Tickets in Vision Models via Data-driven Spectral Foresight Pruning
Leonardo Iurada
Marco Ciccone
Tatiana Tommasi
189
12
0
03 Jun 2024
Sparser, Better, Deeper, Stronger: Improving Sparse Training with Exact
  Orthogonal Initialization
Sparser, Better, Deeper, Stronger: Improving Sparse Training with Exact Orthogonal Initialization
A. Nowak
Lukasz Gniecki
Filip Szatkowski
Jacek Tabor
243
2
0
03 Jun 2024
Tilting the Odds at the Lottery: the Interplay of Overparameterisation
  and Curricula in Neural Networks
Tilting the Odds at the Lottery: the Interplay of Overparameterisation and Curricula in Neural Networks
Stefano Sarao Mannelli
Yaraslau Ivashinka
Andrew M. Saxe
Luca Saglietti
215
8
0
03 Jun 2024
What makes unlearning hard and what to do about it
What makes unlearning hard and what to do about it
Kairan Zhao
M. Kurmanji
George-Octavian Barbulescu
Eleni Triantafillou
Peter Triantafillou
MU
271
49
0
03 Jun 2024
Diverse Subset Selection via Norm-Based Sampling and Orthogonality
Diverse Subset Selection via Norm-Based Sampling and Orthogonality
Noga Bar
Raja Giryes
CVBM
342
1
0
03 Jun 2024
Efficient Neural Light Fields (ENeLF) for Mobile Devices
Efficient Neural Light Fields (ENeLF) for Mobile Devices
Austin Peng
167
1
0
02 Jun 2024
Effective Interplay between Sparsity and Quantization: From Theory to Practice
Effective Interplay between Sparsity and Quantization: From Theory to Practice
Simla Burcu Harma
Ayan Chakraborty
Elizaveta Kostenok
Danila Mishin
Dongho Ha
...
Martin Jaggi
Ming Liu
Yunho Oh
Suvinay Subramanian
Amir Yazdanbakhsh
MQ
309
19
0
31 May 2024
Investigating Calibration and Corruption Robustness of Post-hoc Pruned
  Perception CNNs: An Image Classification Benchmark Study
Investigating Calibration and Corruption Robustness of Post-hoc Pruned Perception CNNs: An Image Classification Benchmark Study
Pallavi Mitra
Gesina Schwalbe
Nadja Klein
AAML
185
4
0
31 May 2024
The EarlyBird Gets the WORM: Heuristically Accelerating EarlyBird
  Convergence
The EarlyBird Gets the WORM: Heuristically Accelerating EarlyBird Convergence
Adithya Vasudev
181
0
0
31 May 2024
Enhancing Generative Molecular Design via Uncertainty-guided Fine-tuning
  of Variational Autoencoders
Enhancing Generative Molecular Design via Uncertainty-guided Fine-tuning of Variational Autoencoders
Nafiz Abeer
Sanket Jantre
Nathan M. Urban
Byung-Jun Yoon
270
1
0
31 May 2024
Occam Gradient Descent
Occam Gradient Descent
B. N. Kausik
ODLVLM
307
0
0
30 May 2024
Dual sparse training framework: inducing activation map sparsity via
  Transformed $\ell1$ regularization
Dual sparse training framework: inducing activation map sparsity via Transformed ℓ1\ell1ℓ1 regularization
Xiaolong Yu
Cong Tian
172
3
0
30 May 2024
ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron
  Pruning
ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning
Ruchika Chavhan
Da Li
Timothy M. Hospedales
233
28
0
29 May 2024
EdgeSight: Enabling Modeless and Cost-Efficient Inference at the Edge
EdgeSight: Enabling Modeless and Cost-Efficient Inference at the Edge
ChonLam Lao
Jiaqi Gao
Ganesh Ananthanarayanan
Aditya Akella
Minlan Yu
VLM
201
0
0
29 May 2024
Previous
123...91011...424344
Next