Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.03635
Cited By
v1
v2
v3
v4
v5 (latest)
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
9 March 2018
Jonathan Frankle
Michael Carbin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks"
50 / 2,187 papers shown
Random Sparse Lifts: Construction, Analysis and Convergence of finite sparse networks
International Conference on Learning Representations (ICLR), 2025
David A. R. Robin
Kevin Scaman
Marc Lelarge
128
0
0
10 Jan 2025
Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts
European Conference on Artificial Intelligence (ECAI), 2024
Danyal Aftab
Steven Davy
ALM
272
3
0
10 Jan 2025
Vision Transformer Neural Architecture Search for Out-of-Distribution Generalization: Benchmark and Insights
Neural Information Processing Systems (NeurIPS), 2025
Sy-Tuyen Ho
Tuan Van Vo
Somayeh Ebrahimkhani
Ngai-Man Cheung
300
1
0
08 Jan 2025
Hierarchical Light Transformer Ensembles for Multimodal Trajectory Forecasting
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Adrien Lafage
Mathieu Barbier
Gianni Franchi
David Filliat
423
7
0
08 Jan 2025
Training-free Heterogeneous Model Merging
Zhengqi Xu
Han Zheng
Jie Song
Li Sun
Weilong Dai
MoMe
492
2
0
03 Jan 2025
Lillama: Large Language Models Compression via Low-Rank Feature Distillation
Yaya Sy
Christophe Cerisara
Irina Illina
MQ
310
0
0
31 Dec 2024
Improving Quantization-aware Training of Low-Precision Network via Block Replacement on Full-Precision Counterpart
Chengting Yu
Shu Yang
Fengzhao Zhang
Hanzhi Ma
Aili Wang
Er-ping Li
MQ
271
7
0
20 Dec 2024
A Comparative Study of Pruning Methods in Transformer-based Time Series Forecasting
Nicholas Kiefer
Arvid Weyrauch
Muhammed Öz
Achim Streit
Markus Gotz
Charlotte Debus
AI4TS
270
1
0
17 Dec 2024
No More Adam: Learning Rate Scaling at Initialization is All You Need
Minghao Xu
Lichuan Xiang
Xu Cai
Hongkai Wen
341
4
0
16 Dec 2024
RWKV-edge: Deeply Compressed RWKV for Resource-Constrained Devices
Wonkyo Choe
Yangfeng Ji
F. Lin
512
1
0
14 Dec 2024
MOFHEI: Model Optimizing Framework for Fast and Efficient Homomorphically Encrypted Neural Network Inference
International Conference on Trust, Privacy and Security in Intelligent Systems and Applications (ICPSISA), 2024
Parsa Ghazvinian
Robert Podschwadt
Prajwal Panzade
Mohammad H. Rafiei
Daniel Takabi
264
0
0
10 Dec 2024
Fast Track to Winning Tickets: Repowering One-Shot Pruning for Graph Neural Networks
AAAI Conference on Artificial Intelligence (AAAI), 2024
Xinfeng Li
Guibin Zhang
Haoran Yang
Dawei Cheng
308
0
0
10 Dec 2024
Training MLPs on Graphs without Supervision
Web Search and Data Mining (WSDM), 2024
Zehong Wang
Zheyuan Zhang
Chuxu Zhang
Yanfang Ye
265
14
0
05 Dec 2024
Is Oracle Pruning the True Oracle?
Sicheng Feng
Keda Tao
Haoyu Wang
VLM
351
2
0
28 Nov 2024
On the Effectiveness of Incremental Training of Large Language Models
Miles Q. Li
Benjamin C. M. Fung
Shih-Chia Huang
CLL
AIFin
168
1
0
27 Nov 2024
Preserving Deep Representations In One-Shot Pruning: A Hessian-Free Second-Order Optimization Framework
International Conference on Learning Representations (ICLR), 2024
Ryan Lucas
Rahul Mazumder
313
6
0
27 Nov 2024
Multi-Label Bayesian Active Learning with Inter-Label Relationships
Conference on Uncertainty in Artificial Intelligence (UAI), 2024
Yuanyuan Qi
Jueqing Lu
Xiaohao Yang
Joanne Enticott
Lan Du
487
0
0
26 Nov 2024
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Hexuan Deng
Wenxiang Jiao
Xuebo Liu
Min Zhang
Zhaopeng Tu
Zhaopeng Tu
VLM
573
2
0
21 Nov 2024
Pushing the Limits of Sparsity: A Bag of Tricks for Extreme Pruning
Andy Li
A. Durrant
Milan Markovic
Lu Yin
Souvik Kundu
Tianlong Chen
Lu Yin
Georgios Leontidis
452
1
0
20 Nov 2024
Probe-Me-Not: Protecting Pre-trained Encoders from Malicious Probing
Network and Distributed System Security Symposium (NDSS), 2024
Ruyi Ding
Tong Zhou
Lili Su
A. A. Ding
Xiaolin Xu
Yunsi Fei
AAML
351
2
0
19 Nov 2024
FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning
Qingsong Lv
Jiasheng Sun
Sheng Zhou
Wei Wei
Liangcheng Li
Yun Gao
Sun Qiao
Jie Song
Jiajun Bu
AAML
246
2
0
19 Nov 2024
Electrostatic Force Regularization for Neural Structured Pruning
Abdesselam Ferdi
A. Taleb-Ahmed
A. Nakib
Youcef Ferdi
335
1
0
17 Nov 2024
RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively
Yao Lu
Peixin Zhang
Jingyi Wang
Lei Ma
Xiaoniu Yang
Qi Xuan
240
7
0
15 Nov 2024
On the Surprising Effectiveness of Attention Transfer for Vision Transformers
Neural Information Processing Systems (NeurIPS), 2024
Alexander C. Li
Yuandong Tian
Bin Chen
Deepak Pathak
Xinlei Chen
208
9
0
14 Nov 2024
Complexity-Aware Training of Deep Neural Networks for Optimal Structure Discovery
Valentin Frank Ingmar Guenter
Athanasios Sideris
CVBM
292
1
0
14 Nov 2024
FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training
Philip Zmushko
Aleksandr Beznosikov
Martin Takáč
Samuel Horváth
304
4
0
12 Nov 2024
Zeroth-Order Adaptive Neuron Alignment Based Pruning without Re-Training
Elia Cunegatti
Leonardo Lucio Custode
Giovanni Iacca
641
2
0
11 Nov 2024
Towards Establishing Guaranteed Error for Learned Database Operations
International Conference on Learning Representations (ICLR), 2024
Sepanta Zeighami
Cyrus Shahabi
136
4
0
09 Nov 2024
Poor Man's Training on MCUs: A Memory-Efficient Quantized Back-Propagation-Free Approach
Yequan Zhao
Hai Li
Ian Young
Zheng Zhang
MQ
381
3
0
07 Nov 2024
Finding Strong Lottery Ticket Networks with Genetic Algorithms
International Joint Conference on Computational Intelligence (IJCCI), 2024
Philipp Altmann
Julian Schonberger
Maximilian Zorn
Thomas Gabor
204
3
0
07 Nov 2024
Neural Fingerprints for Adversarial Attack Detection
Haim Fisher
Moni Shahar
Yehezkel S. Resheff
AAML
152
1
0
07 Nov 2024
Learning Morphisms with Gauss-Newton Approximation for Growing Networks
Neal Lawton
Aram Galstyan
Greg Ver Steeg
205
0
0
07 Nov 2024
Sparse Orthogonal Parameters Tuning for Continual Learning
Hai-Jian Ke
Kun-Peng Ning
Yu-Yang Liu
Jia-Yu Yao
Yong-Hong Tian
Li Yuan
CLL
306
2
0
05 Nov 2024
Navigating Extremes: Dynamic Sparsity in Large Output Spaces
Neural Information Processing Systems (NeurIPS), 2024
Nasib Ullah
Erik Schultheis
Mike Lasby
Yani Andrew Ioannou
Rohit Babbar
413
2
0
05 Nov 2024
Expanding Sparse Tuning for Low Memory Usage
Neural Information Processing Systems (NeurIPS), 2024
Shufan Shen
Junshu Sun
Xiangyang Ji
Qingming Huang
Shuhui Wang
329
9
0
04 Nov 2024
Double Descent Meets Out-of-Distribution Detection: Theoretical Insights and Empirical Analysis on the role of model complexity
Mouin Ben Ammar
David Brellmann
Arturo Mendoza
Antoine Manzanera
Gianni Franchi
OODD
395
0
0
04 Nov 2024
Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level Alignment
IEEE Transactions on Artificial Intelligence (IEEE TAI), 2024
Chengting Yu
Fengzhao Zhang
Ruizhe Chen
Zuozhu Liu
Shurun Tan
Er-ping Li
Aili Wang
348
5
0
03 Nov 2024
Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian Prior
Journal of Data Science (JDS), 2024
Mingxuan Zhang
Y. Sun
F. Liang
298
0
0
01 Nov 2024
Chasing Better Deep Image Priors between Over- and Under-parameterization
Qiming Wu
Xiaohan Chen
Lezhi Li
Zhangyang Wang
333
1
0
31 Oct 2024
Neural Network Matrix Product Operator: A Multi-Dimensionally Integrable Machine Learning Potential
Physical Review Research (PRR), 2024
Kentaro Hino
Yuki Kurashige
341
0
0
31 Oct 2024
Mutual Information Preserving Neural Network Pruning
Charles Westphal
Stephen Hailes
Mirco Musolesi
469
4
0
31 Oct 2024
BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference
Neural Information Processing Systems (NeurIPS), 2024
Changwoo Lee
Soo Min Kwon
Qing Qu
Hun-Seok Kim
285
2
0
28 Oct 2024
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
International Conference on Learning Representations (ICLR), 2024
Sangmin Bae
Adam Fisch
Hrayr Harutyunyan
Ziwei Ji
Seungyeon Kim
Tal Schuster
KELM
396
20
0
28 Oct 2024
FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model Fusion
Neural Information Processing Systems (NeurIPS), 2024
Zhenheng Tang
Yonggang Zhang
Peijie Dong
Yiu-ming Cheung
Amelie Chi Zhou
Bo Han
Xiaowen Chu
FedML
MoMe
AI4CE
329
15
0
27 Oct 2024
Uncovering Capabilities of Model Pruning in Graph Contrastive Learning
ACM Multimedia (MM), 2024
Wu Junran
Chen Xueyuan
Li Shangzhe
254
2
0
27 Oct 2024
Model merging with SVD to tie the Knots
International Conference on Learning Representations (ICLR), 2024
George Stoica
Pratik Ramesh
B. Ecsedi
Leshem Choshen
Judy Hoffman
MoMe
333
45
0
25 Oct 2024
Expose Before You Defend: Unifying and Enhancing Backdoor Defenses via Exposed Models
Yige Li
Hanxun Huang
Jiaming Zhang
Xingjun Ma
Yu-Gang Jiang
AAML
190
2
0
25 Oct 2024
CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation
Qinsi Wang
Saeed Vahidian
Hancheng Ye
Jianyang Gu
Jianyi Zhang
Yiran Chen
108
12
0
23 Oct 2024
Local Contrastive Editing of Gender Stereotypes
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Marlene Lutz
Rochelle Choenni
M. Strohmaier
Anne Lauscher
295
2
0
23 Oct 2024
Beware of Calibration Data for Pruning Large Language Models
International Conference on Learning Representations (ICLR), 2024
Yixin Ji
Yang Xiang
Juntao Li
Qingrong Xia
Ping Li
Xinyu Duan
Zhefeng Wang
Min Zhang
322
7
0
23 Oct 2024
Previous
1
2
3
...
6
7
8
...
42
43
44
Next