ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.03635
  4. Cited By
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
v1v2v3v4v5 (latest)

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

9 March 2018
Jonathan Frankle
Michael Carbin
ArXiv (abs)PDFHTML

Papers citing "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks"

50 / 2,186 papers shown
Efficient Training of Large Vision Models via Advanced Automated
  Progressive Learning
Efficient Training of Large Vision Models via Advanced Automated Progressive Learning
Changlin Li
Jiawei Zhang
Sihao Lin
Zongxin Yang
Junwei Liang
Xiaodan Liang
Xiaojun Chang
VLM
265
2
0
06 Sep 2024
WaterMAS: Sharpness-Aware Maximization for Neural Network Watermarking
WaterMAS: Sharpness-Aware Maximization for Neural Network WatermarkingInternational Conference on Pattern Recognition (ICPR), 2024
Carl De Sousa Trias
Mihai P. Mitrea
Attilio Fiandrotti
Marco Cagnazzo
Sumanta Chaudhuri
Enzo Tartaglione
AAML
230
1
0
05 Sep 2024
Modularity in Transformers: Investigating Neuron Separability &
  Specialization
Modularity in Transformers: Investigating Neuron Separability & Specialization
Nicholas Pochinkov
Thomas Jones
Mohammed Rashidur Rahman
175
0
0
30 Aug 2024
Learning effective pruning at initialization from iterative pruning
Learning effective pruning at initialization from iterative pruning
Shengkai Liu
Yaofeng Cheng
Fusheng Zha
Wei Guo
Lining Sun
Zhenshan Bing
Chenguang Yang
240
1
0
27 Aug 2024
3D Point Cloud Network Pruning: When Some Weights Do not Matter
3D Point Cloud Network Pruning: When Some Weights Do not MatterBritish Machine Vision Conference (BMVC), 2024
Amrijit Biswas
M. Hossain
M. M. L. Elahi
A. Cheraghian
Fuad Rahman
Nabeel Mohammed
Shafin Rahman
3DPC
293
5
0
26 Aug 2024
Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune
  CNNs and Transformers
Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers
Sayed Mohammad Vakilzadeh Hatefi
Maximilian Dreyer
Reduan Achtibat
Thomas Wiegand
Wojciech Samek
Sebastian Lapuschkin
ViT
229
10
0
22 Aug 2024
Weight Scope Alignment: A Frustratingly Easy Method for Model Merging
Weight Scope Alignment: A Frustratingly Easy Method for Model MergingEuropean Conference on Artificial Intelligence (ECAI), 2024
Yichu Xu
Xin-Chun Li
Le Gan
De-Chuan Zhan
MoMe
293
2
0
22 Aug 2024
A Tighter Complexity Analysis of SparseGPT
A Tighter Complexity Analysis of SparseGPT
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao Song
306
25
0
22 Aug 2024
A Greedy Hierarchical Approach to Whole-Network Filter-Pruning in CNNs
A Greedy Hierarchical Approach to Whole-Network Filter-Pruning in CNNs
Kiran Purohit
Anurag Parvathgari
Sourangshu Bhattacharya
VLM
238
1
0
22 Aug 2024
Transformers As Approximations of Solomonoff Induction
Transformers As Approximations of Solomonoff InductionInternational Conference on Neural Information Processing (ICONIP), 2024
Nathan Young
Michael Witbrock
108
1
0
22 Aug 2024
Approaching Deep Learning through the Spectral Dynamics of Weights
Approaching Deep Learning through the Spectral Dynamics of Weights
David Yunis
Kumar Kshitij Patel
Samuel Wheeler
Pedro H. P. Savarese
Gal Vardi
Karen Livescu
Michael Maire
Matthew R. Walter
327
13
0
21 Aug 2024
On Learnable Parameters of Optimal and Suboptimal Deep Learning Models
On Learnable Parameters of Optimal and Suboptimal Deep Learning ModelsInternational Conference on Neural Information Processing (ICONIP), 2024
Ziwei Zheng
Huizhi Liang
V. Snás̃el
Vito Latora
Panos Pardalos
Giuseppe Nicosia
Varun Ojha
126
2
0
21 Aug 2024
First Activations Matter: Training-Free Methods for Dynamic Activation
  in Large Language Models
First Activations Matter: Training-Free Methods for Dynamic Activation in Large Language Models
Chi Ma
Mincong Huang
Ying Zhang
Chao Wang
Yujie Wang
Lei Yu
Chuan Liu
Wei Lin
AI4CELLMSV
250
3
0
21 Aug 2024
Mask in the Mirror: Implicit Sparsification
Mask in the Mirror: Implicit SparsificationInternational Conference on Learning Representations (ICLR), 2024
Tom Jacobs
R. Burkholz
488
6
0
19 Aug 2024
Activated Parameter Locating via Causal Intervention for Model Merging
Activated Parameter Locating via Causal Intervention for Model Merging
Fanshuang Kong
Richong Zhang
Ziqiao Wang
MoMe
162
3
0
18 Aug 2024
Antidote: Post-fine-tuning Safety Alignment for Large Language Models against Harmful Fine-tuning
Antidote: Post-fine-tuning Safety Alignment for Large Language Models against Harmful Fine-tuning
Tiansheng Huang
Gautam Bhattacharya
Pratik Joshi
Josh Kimball
Ling Liu
AAMLMoMe
592
48
0
18 Aug 2024
Compress and Compare: Interactively Evaluating Efficiency and Behavior
  Across ML Model Compression Experiments
Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression ExperimentsIEEE Transactions on Visualization and Computer Graphics (TVCG), 2024
Angie Boggust
Venkatesh Sivaraman
Yannick Assogba
Donghao Ren
Dominik Moritz
Fred Hohman
VLM
224
10
0
06 Aug 2024
Masked Random Noise for Communication Efficient Federaetd Learning
Masked Random Noise for Communication Efficient Federaetd LearningACM Multimedia (MM), 2024
Shiwei Li
Yingyi Cheng
Haozhao Wang
Xing Tang
Shijie Xu
Weihong Luo
Yuhua Li
Dugang Liu
Xiuqiang He
and Ruixuan Li
FedML
208
9
0
06 Aug 2024
Latent-INR: A Flexible Framework for Implicit Representations of Videos
  with Discriminative Semantics
Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative SemanticsEuropean Conference on Computer Vision (ECCV), 2024
Shishira R. Maiya
Anubhav Gupta
M. Gwilliam
Max Ehrlich
Abhinav Shrivastava
195
6
1
05 Aug 2024
Pruning Large Language Models with Semi-Structural Adaptive Sparse
  Training
Pruning Large Language Models with Semi-Structural Adaptive Sparse Training
Weiyu Huang
Yuezhou Hu
Guohao Jian
Jun Zhu
Jianfei Chen
323
17
0
30 Jul 2024
Machine Unlearning in Generative AI: A Survey
Machine Unlearning in Generative AI: A Survey
Zheyuan Liu
Guangyao Dou
Zhaoxuan Tan
Yijun Tian
Meng Jiang
MU
327
45
0
30 Jul 2024
ThinK: Thinner Key Cache by Query-Driven Pruning
ThinK: Thinner Key Cache by Query-Driven PruningInternational Conference on Learning Representations (ICLR), 2024
Yuhui Xu
Zhanming Jie
Hanze Dong
Lei Wang
Xudong Lu
Aojun Zhou
Amrita Saha
Caiming Xiong
Doyen Sahoo
525
41
0
30 Jul 2024
FIARSE: Model-Heterogeneous Federated Learning via Importance-Aware
  Submodel Extraction
FIARSE: Model-Heterogeneous Federated Learning via Importance-Aware Submodel ExtractionNeural Information Processing Systems (NeurIPS), 2024
Feijie Wu
Xingchen Wang
Yaqing Wang
Tianci Liu
Lu Su
Jing Gao
FedML
309
23
0
28 Jul 2024
Towards the Dynamics of a DNN Learning Symbolic Interactions
Towards the Dynamics of a DNN Learning Symbolic InteractionsNeural Information Processing Systems (NeurIPS), 2024
Qihan Ren
Yang Xu
Junpeng Zhang
Yue Xin
Dongrui Liu
Quanshi Zhang
234
12
0
27 Jul 2024
Greedy Output Approximation: Towards Efficient Structured Pruning for
  LLMs Without Retraining
Greedy Output Approximation: Towards Efficient Structured Pruning for LLMs Without Retraining
Jianwei Li
Yijun Dong
Qi Lei
374
8
0
26 Jul 2024
Finite Neural Networks as Mixtures of Gaussian Processes: From Provable Error Bounds to Prior Selection
Finite Neural Networks as Mixtures of Gaussian Processes: From Provable Error Bounds to Prior Selection
Steven Adams
A. Patané
Morteza Lahijanian
Luca Laurenti
BDL
374
8
0
26 Jul 2024
Nerva: a Truly Sparse Implementation of Neural Networks
Nerva: a Truly Sparse Implementation of Neural Networks
Wieger Wesselink
Bram Grooten
Qiao Xiao
Cássio Machado de Campos
Mykola Pechenizkiy
172
4
0
24 Jul 2024
Self-driving lab discovers principles for steering spontaneous emission
Self-driving lab discovers principles for steering spontaneous emission
Saaketh Desai
S. Addamane
Jeffery Y. Tsao
Igal Brener
Rémi Dingreville
Prasad P Iyer
150
0
0
22 Jul 2024
Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Knowledge Mechanisms in Large Language Models: A Survey and Perspective
Meng Wang
Yunzhi Yao
Ziwen Xu
Shuofei Qiao
Shumin Deng
...
Yong Jiang
Pengjun Xie
Fei Huang
Huajun Chen
Ningyu Zhang
332
60
0
22 Jul 2024
Out of spuriousity: Improving robustness to spurious correlations
  without group annotations
Out of spuriousity: Improving robustness to spurious correlations without group annotations
Phuong Quynh Le
Jorg Schlotterer
Christin Seifert
CMLOOD
187
4
0
20 Jul 2024
LORTSAR: Low-Rank Transformer for Skeleton-based Action Recognition
LORTSAR: Low-Rank Transformer for Skeleton-based Action Recognition
Soroush Oraki
Harry Zhuang
Jie Liang
156
2
0
19 Jul 2024
Training-Free Model Merging for Multi-target Domain Adaptation
Training-Free Model Merging for Multi-target Domain Adaptation
Wenyi Li
Huan-ang Gao
Mingju Gao
Beiwen Tian
Rong Zhi
Hao Zhao
MoMe
238
11
0
18 Jul 2024
Accurate Mapping of RNNs on Neuromorphic Hardware with Adaptive Spiking
  Neurons
Accurate Mapping of RNNs on Neuromorphic Hardware with Adaptive Spiking Neurons
Gauthier Boeshertz
Giacomo Indiveri
M. Nair
Alpha Renner
163
3
0
18 Jul 2024
Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of
  Learnable Binary Vectors
Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors
Matt Gorbett
Hossein Shirazi
Indrakshi Ray
MQ
307
1
0
16 Jul 2024
Continual Deep Learning on the Edge via Stochastic Local Competition
  among Subnetworks
Continual Deep Learning on the Edge via Stochastic Local Competition among Subnetworks
Theodoros Christophides
Kyriakos Tolias
S. Chatzis
254
0
0
15 Jul 2024
From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications
From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications
Ajay Jaiswal
Yifan Wang
Zhenyu Zhang
Shiwei Liu
Runjin Chen
Jiawei Zhao
A. Grama
Yuandong Tian
Zinan Lin
282
15
0
15 Jul 2024
Dynamic Encoder Size Based on Data-Driven Layer-wise Pruning for Speech
  Recognition
Dynamic Encoder Size Based on Data-Driven Layer-wise Pruning for Speech Recognition
Jingjing Xu
Wei Zhou
Zijian Yang
Eugen Beck
Ralf Schlueter
292
5
0
10 Jul 2024
LDGCN: An Edge-End Lightweight Dual GCN Based on Single-Channel EEG for
  Driver Drowsiness Monitoring
LDGCN: An Edge-End Lightweight Dual GCN Based on Single-Channel EEG for Driver Drowsiness Monitoring
Jingwei Huang
Chuansheng Wang
Jiayan Huang
Haoyi Fan
Antoni Grau
Fuquan Zhang
183
0
0
08 Jul 2024
The Impact of Quantization and Pruning on Deep Reinforcement Learning
  Models
The Impact of Quantization and Pruning on Deep Reinforcement Learning Models
Heng Lu
Mehdi Alemi
Reza Rawassizadeh
222
5
0
05 Jul 2024
Revealing the Utilized Rank of Subspaces of Learning in Neural Networks
Revealing the Utilized Rank of Subspaces of Learning in Neural Networks
Isha Garg
Christian Koguchi
Eshan Verma
Daniel Ulbricht
135
1
0
05 Jul 2024
Sparsest Models Elude Pruning: An Exposé of Pruning's Current
  Capabilities
Sparsest Models Elude Pruning: An Exposé of Pruning's Current Capabilities
Stephen Zhang
Vardan Papyan
255
0
0
04 Jul 2024
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
Lukas Mauch
Marzieh Edraki
Aaron Courville
OODDCLLVLM
379
9
0
03 Jul 2024
Efficient DNN-Powered Software with Fair Sparse Models
Efficient DNN-Powered Software with Fair Sparse Models
Xuanqi Gao
Weipeng Jiang
Juan Zhai
Shiqing Ma
Xiaoyu Zhang
Chao Shen
214
0
0
03 Jul 2024
LPViT: Low-Power Semi-structured Pruning for Vision Transformers
LPViT: Low-Power Semi-structured Pruning for Vision Transformers
Kaixin Xu
Zhe Wang
Chunyun Chen
Xue Geng
Jie Lin
Xulei Yang
Ruibing Jin
Min Wu
Xiaoli Li
Weisi Lin
ViTVLM
658
17
0
02 Jul 2024
Increasing Model Capacity for Free: A Simple Strategy for Parameter
  Efficient Fine-tuning
Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning
Haobo Song
Hao Zhao
Soumajit Majumder
Tao Lin
183
7
0
01 Jul 2024
How Does Overparameterization Affect Features?
How Does Overparameterization Affect Features?
Ahmet Cagri Duzgun
Samy Jelassi
Yuanzhi Li
174
0
0
01 Jul 2024
Neural Networks Trained by Weight Permutation are Universal Approximators
Neural Networks Trained by Weight Permutation are Universal Approximators
Yongqiang Cai
Gaohang Chen
Zhonghua Qiao
533
2
0
01 Jul 2024
RepAct: The Re-parameterizable Adaptive Activation Function
RepAct: The Re-parameterizable Adaptive Activation Function
Xian Wu
Qingchuan Tao
Shuang Wang
315
0
0
28 Jun 2024
FedMap: Iterative Magnitude-Based Pruning for Communication-Efficient
  Federated Learning
FedMap: Iterative Magnitude-Based Pruning for Communication-Efficient Federated Learning
Alexander Herzog
Robbie Southam
Ioannis Mavromatis
Aftab Khan
136
2
0
27 Jun 2024
Infinite Width Models That Work: Why Feature Learning Doesn't Matter as
  Much as You Think
Infinite Width Models That Work: Why Feature Learning Doesn't Matter as Much as You Think
Luke Sernau
120
0
0
27 Jun 2024
Previous
123...8910...424344
Next