Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.03635
Cited By
v1
v2
v3
v4
v5 (latest)
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
9 March 2018
Jonathan Frankle
Michael Carbin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks"
50 / 2,185 papers shown
Title
Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates
Atsuki Yamaguchi
Terufumi Morishita
Aline Villavicencio
Nikolaos Aletras
CLL
169
0
0
04 Dec 2025
Lean Unet: A Compact Model for Image Segmentation
Ture Hassler
Ida Åkerholm
Marcus Nordström
Gabriele Balletti
Orcun Goksel
0
0
0
03 Dec 2025
Understanding and Harnessing Sparsity in Unified Multimodal Models
Shwai He
Chaorui Deng
Ang Li
Shen Yan
MoE
204
1
0
02 Dec 2025
Parameter Reduction Improves Vision Transformers: A Comparative Study of Sharing and Width Reduction
Anantha Padmanaban Krishna Kumar
ViT
44
0
0
30 Nov 2025
Forgetting by Pruning: Data Deletion in Join Cardinality Estimation
Chaowei He
Yuanjun Liu
Qingzhi Ma
Shenyuan Ren
Xizhao Luo
Lei Zhao
An Liu
MU
156
0
0
25 Nov 2025
ModHiFi: Identifying High Fidelity predictive components for Model Modification
Dhruva Kashyap
Chaitanya Murti
Pranav K Nayak
Tanay Narshana
Chiranjib Bhattacharyya
116
0
0
24 Nov 2025
Exploiting the Experts: Unauthorized Compression in MoE-LLMs
Pinaki Prasad Guha Neogi
Ahmad Mohammadshirazi
Dheeraj Kulshrestha
R. Ramnath
MoE
120
0
0
22 Nov 2025
Layer-wise Weight Selection for Power-Efficient Neural Network Acceleration
Jiaxun Fang
Grace Li Zhang
Shaoyi Huang
MQ
275
0
0
21 Nov 2025
E
3
^3
3
-Pruner: Towards Efficient, Economical, and Effective Layer Pruning for Large Language Models
Tao Yuan
Haoli Bai
Yinfei Pan
Xuyang Cao
Tianyu Zhang
Lu Hou
Ting Hu
Xianzhi Yu
VLM
195
0
0
21 Nov 2025
Teacher-Guided One-Shot Pruning via Context-Aware Knowledge Distillation
Md. Samiul Alim
Sharjil Khan
Amrijit Biswas
Fuad Rahman
Shafin Rahman
Nabeel Mohammed
VLM
147
0
0
20 Nov 2025
Breaking Expert Knowledge Limits: Self-Pruning for Large Language Models
Haidong Kang
Lihong Lin
Enneng Yang
Hongning Dai
Hao Wang
LRM
201
0
0
19 Nov 2025
Dynamic Black-box Backdoor Attacks on IoT Sensory Data
International Conference on Trust, Privacy and Security in Intelligent Systems and Applications (ICPSISA), 2024
Ajesh Koyatan Chathoth
Stephen Lee
AAML
156
2
0
18 Nov 2025
Weight-sparse transformers have interpretable circuits
Leo Gao
Achyuta Rajaram
Jacob Coxon
Soham V. Govande
Bowen Baker
Dan Mossing
MILM
216
4
0
17 Nov 2025
Efficient Mathematical Reasoning Models via Dynamic Pruning and Knowledge Distillation
Fengming Yu
Qingyu Meng
Haiwei Pan
Kejia Zhang
LRM
132
0
0
15 Nov 2025
Which Sparse Autoencoder Features Are Real? Model-X Knockoffs for False Discovery Rate Control
Tsogt-Ochir Enkhbayar
116
1
0
12 Nov 2025
StableMorph: High-Quality Face Morph Generation with Stable Diffusion
Wassim Kabbani
Kiran Raja
Raghavendra Ramachandra
C. Busch
80
0
0
11 Nov 2025
Hardware-Aware YOLO Compression for Low-Power Edge AI on STM32U5 for Weeds Detection in Digital Agriculture
Charalampos S. Kouzinopoulos
Yuri Manna
188
0
0
11 Nov 2025
CAMP-HiVe: Cyclic Pair Merging based Efficient DNN Pruning with Hessian-Vector Approximation for Resource-Constrained Systems
M. H. Uddin
Sai Krishna Ghanta
Liam Seymour
S. Baidya
195
0
0
09 Nov 2025
Models Got Talent: Identifying High Performing Wearable Human Activity Recognition Models Without Training
Richard Goldman
Varun Komperla
Thomas Ploetz
H. Haresamudram
142
0
0
08 Nov 2025
SiamMM: A Mixture Model Perspective on Deep Unsupervised Learning
Xiaodong Wang
Jing Huang
Kevin J. Liang
SSL
433
0
0
07 Nov 2025
APP: Accelerated Path Patching with Task-Specific Pruning
Frauke Andersen
William Rudman
Ruochen Zhang
Carsten Eickhoff
64
0
0
07 Nov 2025
The Strong Lottery Ticket Hypothesis for Multi-Head Attention Mechanisms
Hikari Otsuka
Daiki Chijiwa
Yasuyuki Okoshi
Daichi Fujiki
Susumu Takeuchi
Masato Motomura
164
0
0
06 Nov 2025
Sharp Minima Can Generalize: A Loss Landscape Perspective On Data
Raymond Fan
Bryce Sandlund
Lin Myat Ko
96
0
0
06 Nov 2025
TwIST: Rigging the Lottery in Transformers with Independent Subnetwork Training
Michael Menezes
Barbara Su
Xinze Feng
Yehya Farhat
Hamza Shili
Anastasios Kyrillidis
164
1
0
06 Nov 2025
Random Initialization of Gated Sparse Adapters
Vi Retault
Yohaï-Eliel Berreby
CLL
MoE
196
0
0
03 Nov 2025
AI Progress Should Be Measured by Capability-Per-Resource, Not Scale Alone: A Framework for Gradient-Guided Resource Allocation in LLMs
David McCoy
Yulun Wu
Zachary Butzin-Dozier
108
0
0
02 Nov 2025
Diluting Restricted Boltzmann Machines
C. Díaz-Faloh
R. Mulet
110
0
0
01 Nov 2025
Spatio-temporal Multivariate Time Series Forecast with Chosen Variables
Zibo Liu
Zhe Jiang
Zelin Xu
Tingsong Xiao
Yupu Zhang
Zhengkun Xiao
Haibo Wang
Shigang Chen
AI4TS
132
0
0
28 Oct 2025
Kernelized Sparse Fine-Tuning with Bi-level Parameter Competition for Vision Models
Shufan Shen
Junshu Sun
Shuhui Wang
Qingming Huang
136
0
0
28 Oct 2025
Adaptive Training of INRs via Pruning and Densification
Diana Aldana
João Paulo Lima
Daniel Csillag
Daniel Perazzo
Haoan Feng
Luiz Velho
Tiago Novello
99
0
0
27 Oct 2025
Frustratingly Easy Task-aware Pruning for Large Language Models
Yuanhe Tian
Junjie Liu
Xican Yang
Haishan Ye
Yan Song
133
1
0
26 Oct 2025
Pruning and Quantization Impact on Graph Neural Networks
Khatoon Khedri
Reza Rawassizadeh
Qifu Wen
M. Hosseinzadeh
GNN
190
0
0
24 Oct 2025
A flexible framework for structural plasticity in GPU-accelerated sparse spiking neural networks
James C. Knight
Johanna Senk
Thomas Nowotny
112
1
0
22 Oct 2025
A Survey on Cache Methods in Diffusion Models: Toward Efficient Multi-Modal Generation
Jiacheng Liu
Xinyu Wang
Yuqi Lin
Zhikai Wang
P. Wang
...
Zexuan Yan
Zhengyi Shi
Chang Zou
Yue Ma
Linfeng Zhang
367
2
0
22 Oct 2025
Towards Unsupervised Open-Set Graph Domain Adaptation via Dual Reprogramming
Zhen Zhang
Bingsheng He
OOD
152
0
0
21 Oct 2025
C-SWAP: Explainability-Aware Structured Pruning for Efficient Neural Networks Compression
Baptiste Bauvin
Loïc Baret
Ola Ahmad
104
0
0
21 Oct 2025
S2AP: Score-space Sharpness Minimization for Adversarial Pruning
Giorgio Piras
Qi Zhao
Fabio Brau
Maura Pintor
Christian Wressnegger
Battista Biggio
AAML
125
0
0
21 Oct 2025
The Graphon Limit Hypothesis: Understanding Neural Network Pruning via Infinite Width Analysis
Hoang Pham
T. Ta
Tom Jacobs
R. Burkholz
Long Tran-Thanh
116
0
0
20 Oct 2025
From Local to Global: Revisiting Structured Pruning Paradigms for Large Language Models
Ziyan Wang
Enmao Diao
Qi Le
Pu Wang
Minwoo Lee
Shu-ping Yeh
Evgeny Stupachenko
Hao Feng
Li Yang
128
1
0
20 Oct 2025
Neuronal Group Communication for Efficient Neural representation
Zhengqi Pei
Qingming Huang
Shuhui Wang
107
0
0
19 Oct 2025
Pruning Overparameterized Multi-Task Networks for Degraded Web Image Restoration
Thomas Katraouras
Dimitrios Rafailidis
VLM
100
0
0
16 Oct 2025
Efficient Dynamic Structured Sparse Training with Learned Shuffles
Abhishek Tyagi
Arjun Iyer
Liam Young
William H Renninger
Christopher Kanan
Yuhao Zhu
91
0
0
16 Oct 2025
Convergence, design and training of continuous-time dropout as a random batch method
Antonio Álvarez-López
Martín Hernández
88
0
0
15 Oct 2025
Structured Sparsity and Weight-adaptive Pruning for Memory and Compute efficient Whisper models
Prasenjit K Mudi
Anshi Sachan
Dahlia Devapriya
Sheetal Kalyani
60
0
0
14 Oct 2025
Compressibility Measures Complexity: Minimum Description Length Meets Singular Learning Theory
Einar Urdshals
Edmund Lau
Jesse Hoogland
Stan van Wingerden
Daniel Murfet
103
1
0
14 Oct 2025
SMEC: Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression
Biao Zhang
Lixin Chen
Tong Liu
Bo Zheng
120
0
0
14 Oct 2025
Pruning Cannot Hurt Robustness: Certified Trade-offs in Reinforcement Learning
James Pedley
Benjamin Etheridge
Stephen J. Roberts
Francesco Quinzan
OffRL
AAML
109
0
0
14 Oct 2025
Medical Interpretability and Knowledge Maps of Large Language Models
Razvan Marinescu
Victoria-Elisabeth Gruber
Diego Fajardo
FAtt
AI4MH
222
0
0
13 Oct 2025
FOSSIL: Harnessing Feedback on Suboptimal Samples for Data-Efficient Generalisation with Imitation Learning for Embodied Vision-and-Language Tasks
Sabrina McCallum
Amit Parekh
Alessandro Suglia
LM&Ro
116
0
0
13 Oct 2025
SQS: Bayesian DNN Compression through Sparse Quantized Sub-distributions
Ziyi Wang
Nan Jiang
Guang Lin
Qifan Song
MQ
197
0
0
10 Oct 2025
1
2
3
4
...
42
43
44
Next