Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
2505.11594
Cited By
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training
16 May 2025
Jintao Zhang
Jia Wei
Pengle Zhang
Xiaoming Xu
Haofeng Huang
Haoxu Wang
Kai Jiang
Jun Zhu
Jianfei Chen
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (75 upvotes)
Papers citing
"SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training"
6 / 6 papers shown
Title
SageAttention2++: A More Efficient Implementation of SageAttention2
Jintao Zhang
Xiaoming Xu
Jia Wei
Haofeng Huang
Pengle Zhang
Chendong Xiang
Jun Zhu
Jianfei Chen
MQ
VLM
202
9
0
27 May 2025
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing
Tianyu Fu
Yi Ge
Yichen You
Enshu Liu
Zhihang Yuan
Guohao Dai
Shengen Yan
Huazhong Yang
Yu Wang
MoE
LRM
146
4
0
27 May 2025
Accurate INT8 Training Through Dynamic Block-Level Fallback
Pengle Zhang
Jia Wei
Jintao Zhang
Jun-Jie Zhu
Jianfei Chen
MQ
245
9
0
11 Mar 2025
AMXFP4: Taming Activation Outliers with Asymmetric Microscaling Floating-Point for 4-bit LLM Inference
Janghwan Lee
Jiwoong Park
Jinseok Kim
Yongjik Kim
Jungju Oh
Jinwook Oh
Jungwook Choi
133
3
0
15 Nov 2024
FrameBridge: Improving Image-to-Video Generation with Bridge Models
Yuji Wang
Zehua Chen
Xiaoyu Chen
Jun-Jie Zhu
Jianfei Chen
Jianfei Chen
DiffM
VGen
705
5
0
20 Oct 2024
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Jintao Zhang
Jia Wei
Pengle Zhang
Jun-Jie Zhu
Jun Zhu
Jianfei Chen
VLM
MQ
288
52
0
03 Oct 2024
1