Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2406.17989
Cited By
Learning Neural Networks with Sparse Activations
26 June 2024
Pranjal Awasthi
Nishanth Dikkala
Pritish Kamath
Raghu Meka
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning Neural Networks with Sparse Activations"
5 / 5 papers shown
Universal Properties of Activation Sparsity in Modern Large Language Models
Filip Szatkowski
Patryk Bedkowski
Alessio Devoto
Jan Dubiñski
Pasquale Minervini
Mikołaj Piórczyński
Simone Scardapane
Bartosz Wójcik
159
1
0
30 Aug 2025
Spark Transformer: Reactivating Sparsity in FFN and Attention
Chong You
Kan Wu
Zhipeng Jia
Lin Chen
Srinadh Bhojanapalli
...
Felix X. Yu
Prateek Jain
David Culler
Henry M. Levy
Sanjiv Kumar
223
2
0
07 Jun 2025
COUNTDOWN: Contextually Sparse Activation Filtering Out Unnecessary Weights in Down Projection
Jaewon Cheon
Pilsung Kang
348
0
0
23 May 2025
Mixture of Experts Made Intrinsically Interpretable
Xingyi Yang
Constantin Venhoff
Ashkan Khakzar
Christian Schroeder de Witt
P. Dokania
Adel Bibi
Juil Sock
MoE
326
9
0
05 Mar 2025
GIFT: Unlocking Full Potential of Labels in Distilled Dataset at Near-zero Cost
International Conference on Learning Representations (ICLR), 2024
Xinyi Shang
Peng Sun
Tao Lin
336
9
0
23 May 2024
1