Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.03635
Cited By
v1
v2
v3
v4
v5 (latest)
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
9 March 2018
Jonathan Frankle
Michael Carbin
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks"
50 / 2,187 papers shown
DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization
Neural Information Processing Systems (NeurIPS), 2024
Haowei Zhu
Dehua Tang
Ji Liu
Mingjie Lu
Jintu Zheng
...
Spandan Tiwari
Ashish Sirasao
Jun-Hai Yong
Bin Wang
E. Barsoum
DiffM
168
24
0
22 Oct 2024
Influential Language Data Selection via Gradient Trajectory Pursuit
Zhiwei Deng
Tao Li
Yang Li
213
1
0
22 Oct 2024
Generalized Multimodal Fusion via Poisson-Nernst-Planck Equation
Jiayu Xiong
Jing Wang
Hengjing Xiang
Jun Xue
Chen Xu
Zhouqiang Jiang
189
0
0
20 Oct 2024
The Propensity for Density in Feed-forward Models
European Conference on Artificial Intelligence (ECAI), 2024
Nandi Schoots
Alex Jackson
Ali Kholmovaia
Peter McBurney
Murray Shanahan
CVBM
161
0
0
18 Oct 2024
Linguistically Grounded Analysis of Language Models using Shapley Head Values
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Marcell Richard Fekete
Johannes Bjerva
414
1
0
17 Oct 2024
The Non-Local Model Merging Problem: Permutation Symmetries and Variance Collapse
Ekansh Sharma
Daniel M. Roy
Gintare Karolina Dziugaite
MoMe
276
5
0
16 Oct 2024
FusionLLM: A Decentralized LLM Training System on Geo-distributed GPUs with Adaptive Compression
Zhenheng Tang
Xueze Kang
Yiming Yin
Xinglin Pan
Yuxin Wang
...
Shaohuai Shi
Amelie Chi Zhou
Bo Li
Bingsheng He
Xiaowen Chu
AI4CE
220
10
0
16 Oct 2024
Deep Model Merging: The Sister of Neural Network Interpretability -- A Survey
A. Khan
Todd Nief
Nathaniel Hudson
Mansi Sakarvadia
Daniel Grzenda
Aswathy Ajith
Jordan Pettyjohn
Kyle Chard
Ian Foster
MoMe
205
1
0
16 Oct 2024
MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router
Yanyue Xie
Zhi Zhang
Ding Zhou
Cong Xie
Ziang Song
Xin Liu
Yanzhi Wang
Xue Lin
An Xu
LLMAG
230
24
0
15 Oct 2024
PaSTe: Improving the Efficiency of Visual Anomaly Detection at the Edge
Manuel Barusco
Francesco Borsatti
Davide Dalle Pezze
Francesco Paissan
Elisabetta Farella
Gian Antonio Susto
228
8
0
15 Oct 2024
AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
Neural Information Processing Systems (NeurIPS), 2024
Haiquan Lu
Yefan Zhou
Shiwei Liu
Zhangyang Wang
Michael W. Mahoney
Yaoqing Yang
146
23
0
14 Oct 2024
RoCoFT: Efficient Finetuning of Large Language Models with Row-Column Updates
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Md. Kowsher
Tara Esmaeilbeig
Chun-Nam Yu
Chen Chen
Mojtaba Soltanalian
Niloofar Yousefi
358
3
0
14 Oct 2024
ALLoRA: Adaptive Learning Rate Mitigates LoRA Fatal Flaws
Hai Huang
Randall Balestriero
200
1
0
13 Oct 2024
Non-transferable Pruning
European Conference on Computer Vision (ECCV), 2024
Ruyi Ding
Lili Su
A. A. Ding
Yunsi Fei
AAML
192
3
0
10 Oct 2024
Neural Metamorphosis
European Conference on Computer Vision (ECCV), 2024
Xingyi Yang
Xinchao Wang
276
5
0
10 Oct 2024
Mitigating Gender Bias in Code Large Language Models via Model Editing
Zhan Qin
Haochuan Wang
Zecheng Wang
Deyuan Liu
Cunhang Fan
Zhao Lv
Zhiying Tu
Dianhui Chu
Dianbo Sui
KELM
199
3
0
10 Oct 2024
Growing Efficient Accurate and Robust Neural Networks on the Edge
Vignesh Sundaresha
Naresh Shanbhag
274
0
0
10 Oct 2024
Bilinear MLPs enable weight-based mechanistic interpretability
International Conference on Learning Representations (ICLR), 2024
Michael T. Pearce
Thomas Dooms
Alice Rigg
José Oramas
Lee Sharkey
233
16
0
10 Oct 2024
More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing
International Conference on Learning Representations (ICLR), 2024
Sagi Shaier
Francisco Pereira
Katharina von der Wense
Lawrence E Hunter
Matt Jones
MoE
698
0
0
10 Oct 2024
Extracting and Combining Abilities For Building Multi-lingual Ability-enhanced Large Language Models
Zhipeng Chen
Liang Song
K. Zhou
Wayne Xin Zhao
Binghai Wang
Weipeng Chen
Ji-Rong Wen
395
0
0
10 Oct 2024
Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Abhinav Bandari
L. Yin
Cheng-Yu Hsieh
Ajay Kumar Jaiswal
Tianlong Chen
Li Shen
Ranjay Krishna
Shiwei Liu
193
15
0
09 Oct 2024
Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization
Prateek Varshney
Mert Pilanci
373
0
0
09 Oct 2024
RespDiff: An End-to-End Multi-scale RNN Diffusion Model for Respiratory Waveform Estimation from PPG Signals
Yuyang Miao
Zehua Chen
Chong Li
Danilo Mandic
DiffM
MedIm
344
12
0
06 Oct 2024
Measuring and Controlling Solution Degeneracy across Task-Trained Recurrent Neural Networks
Ann Huang
Satpreet H. Singh
Flavio Martinelli
Kanaka Rajan
378
7
0
04 Oct 2024
Dynamic Sparse Training versus Dense Training: The Unexpected Winner in Image Corruption Robustness
International Conference on Learning Representations (ICLR), 2024
Boqian Wu
Q. Xiao
Shunxin Wang
N. Strisciuglio
Mykola Pechenizkiy
M. V. Keulen
Decebal Constantin Mocanu
Elena Mocanu
OOD
3DH
514
6
0
03 Oct 2024
Efficient Source-Free Time-Series Adaptation via Parameter Subspace Disentanglement
International Conference on Learning Representations (ICLR), 2024
Gaurav Patel
Christopher Sandino
Behrooz Mahasseni
Ellen L. Zippi
Erdrin Azemi
Ali Moin
Juri Minxha
TTA
AI4TS
385
6
0
03 Oct 2024
FedPeWS: Personalized Warmup via Subnetworks for Enhanced Heterogeneous Federated Learning
Nurbek Tastan
Samuel Horváth
Martin Takáč
Karthik Nandakumar
FedML
434
1
0
03 Oct 2024
On the Geometry and Optimization of Polynomial Convolutional Networks
International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Vahid Shahverdi
Giovanni Luca Marchetti
Kathlén Kohn
259
7
0
01 Oct 2024
Do Influence Functions Work on Large Language Models?
Zhe Li
Wei Zhao
Yige Li
Jun Sun
TDI
228
8
0
30 Sep 2024
EEG Emotion Copilot: Optimizing Lightweight LLMs for Emotional EEG Interpretation with Assisted Medical Record Generation
Neural Networks (NN), 2024
Hongyu Chen
Weiming Zeng
Chong Chen
Luhui Cai
Haiwei Yang
...
Wei Zhang
Yuchen Ren
Hongjie Yan
W. Siok
Nizhuan Wang
337
0
0
30 Sep 2024
Inferring Thunderstorm Occurrence from Vertical Profiles of Convection-Permitting Simulations: Physical Insights from a Physical Deep Learning Model
Artificial Intelligence for the Earth Systems (AI4ES), 2024
Kianusch Vahid Yousefnia
Tobias Bölle
Christoph Metzl
340
0
0
30 Sep 2024
Investigating the Effect of Network Pruning on Performance and Interpretability
Jonathan von Rad
Florian Seuffert
286
3
0
29 Sep 2024
Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training
Neural Information Processing Systems (NeurIPS), 2024
Pihe Hu
Shaolong Li
Zhuoran Li
L. Pan
Longbo Huang
195
1
0
28 Sep 2024
Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey
Tiansheng Huang
Sihao Hu
Fatih Ilhan
Selim Furkan Tekin
Ling Liu
AAML
477
75
0
26 Sep 2024
AlterMOMA: Fusion Redundancy Pruning for Camera-LiDAR Fusion Models with Alternative Modality Masking
Neural Information Processing Systems (NeurIPS), 2024
Shiqi Sun
Yantao Lu
Ning Liu
Bo Jiang
JinChao Chen
Ying Zhang
VLM
219
0
0
26 Sep 2024
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
Neural Information Processing Systems (NeurIPS), 2024
Gongfan Fang
Hongxu Yin
Saurav Muralidharan
Greg Heinrich
Jeff Pool
Jan Kautz
Pavlo Molchanov
Xinchao Wang
174
35
0
26 Sep 2024
Multiplicative Logit Adjustment Approximates Neural-Collapse-Aware Decision Boundary Adjustment
International Conference on Learning Representations (ICLR), 2024
Naoya Hasegawa
Issei Sato
408
1
0
26 Sep 2024
Training Neural Networks for Modularity aids Interpretability
Satvik Golechha
Dylan R. Cope
Nandi Schoots
241
1
0
24 Sep 2024
On Importance of Pruning and Distillation for Efficient Low Resource NLP
Aishwarya Mirashi
Purva Lingayat
Srushti Sonavane
Tejas Padhiyar
Raviraj Joshi
Geetanjali Kale
244
2
0
21 Sep 2024
CFSP: An Efficient Structured Pruning Framework for LLMs with Coarse-to-Fine Activation Information
International Conference on Computational Linguistics (COLING), 2024
Yuxin Wang
Minghua Ma
Zekun Wang
Jingchang Chen
Huiming Fan
Liping Shan
Qing Yang
Dongliang Xu
Ming Liu
Bing Qin
175
6
0
20 Sep 2024
Hidden Activations Are Not Enough: A General Approach to Neural Network Predictions
Samuel Leblanc
Aiky Rasolomanana
Marco Armenta
228
0
0
20 Sep 2024
Cross-Domain Content Generation with Domain-Specific Small Language Models
Ankit Maloo
Abhinav Garg
CLL
214
0
0
19 Sep 2024
Monomial Matrix Group Equivariant Neural Functional Networks
Neural Information Processing Systems (NeurIPS), 2024
Hoang V. Tran
Thieu N. Vo
Tho H. Tran
An T. Nguyen
Tan M. Nguyen
474
13
0
18 Sep 2024
Evaluating the Impact of Compression Techniques on Task-Specific Performance of Large Language Models
Bishwash Khanal
Jeffery M. Capone
267
2
0
17 Sep 2024
Are Sparse Neural Networks Better Hard Sample Learners?
British Machine Vision Conference (BMVC), 2024
Q. Xiao
Boqian Wu
Lu Yin
Christopher Neil Gadzinski
Tianjin Huang
Mykola Pechenizkiy
Decebal Constantin Mocanu
215
1
0
13 Sep 2024
S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
Neural Information Processing Systems (NeurIPS), 2024
Yuezhou Hu
Jun-Jie Zhu
Jianfei Chen
410
5
0
13 Sep 2024
A framework for measuring the training efficiency of a neural architecture
Artificial Intelligence Review (Artif Intell Rev), 2024
Eduardo Cueto-Mendoza
John D. Kelleher
255
4
0
12 Sep 2024
Self-Masking Networks for Unsupervised Adaptation
German Conference on Pattern Recognition (DAGM), 2024
Alfonso Taboada Warmerdam
Mathilde Caron
Yuki M. Asano
305
2
0
11 Sep 2024
HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruning
Tianyi Chen
Xiaoyi Qu
David Aponte
Colby R. Banbury
Jongwoo Ko
Tianyu Ding
Yong Ma
Vladimir Lyapunov
Ilya Zharkov
Luming Liang
446
2
0
11 Sep 2024
LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation
European Conference on Computer Vision (ECCV), 2024
Archana Swaminathan
Anubhav Gupta
Kamal Gupta
Shishira R. Maiya
Vatsal Agarwal
Abhinav Shrivastava
229
14
0
10 Sep 2024
Previous
1
2
3
...
7
8
9
...
42
43
44
Next