Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.16997
Cited By
INT-FlashAttention: Enabling Flash Attention for INT8 Quantization
25 September 2024
Shimao Chen
Zirui Liu
Zhiying Wu
Ce Zheng
Peizhuang Cong
Zihan Jiang
Yuhan Wu
Lei Su
Tong Yang
MQ
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"INT-FlashAttention: Enabling Flash Attention for INT8 Quantization"
Title
No papers