Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.13652
Cited By
OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition
20 September 2024
Stephen Zhang
V. Papyan
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition"
1 / 1 papers shown
Title
Efficient LLM Inference using Dynamic Input Pruning and Cache-Aware Masking
Marco Federici
Davide Belli
M. V. Baalen
Amir Jalalirad
Andrii Skliar
Bence Major
Markus Nagel
Paul N. Whatmough
76
0
0
02 Dec 2024
1