Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.10958
Cited By
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization
17 November 2024
Jintao Zhang
Haofeng Huang
Pengle Zhang
Jia wei
Jun-Jie Zhu
Jianfei Chen
VLM
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization"
Title
No papers