VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor
CoresInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2023 |
Scaling Laws for Sparsely-Connected Foundation ModelsInternational Conference on Learning Representations (ICLR), 2023 |
How Does Pruning Impact Long-Tailed Multi-Label Medical Image
Classifiers?International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2023 |
Towards Automated Circuit Discovery for Mechanistic InterpretabilityNeural Information Processing Systems (NeurIPS), 2023 |
Bias in Pruned Vision Models: In-Depth Analysis and CountermeasuresComputer Vision and Pattern Recognition (CVPR), 2023 |
ZipLM: Inference-Aware Structured Pruning of Language ModelsNeural Information Processing Systems (NeurIPS), 2023 |
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-ShotInternational Conference on Machine Learning (ICML), 2023 Elias Frantar Dan Alistarh |