Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2503.08040
Cited By

Accurate INT8 Training Through Dynamic Block-Level Fallback

v1v2v3 (latest)

Accurate INT8 Training Through Dynamic Block-Level Fallback

11 March 2025

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github

Papers citing "Accurate INT8 Training Through Dynamic Block-Level Fallback"

5 / 5 papers shown

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

...

297

12

0

19 Jun 2025

SageAttention2++: A More Efficient Implementation of SageAttention2

SageAttention2++: A More Efficient Implementation of SageAttention2

574

20

0

27 May 2025

Scaling Law for Quantization-Aware Training

Scaling Law for Quantization-Aware Training

...

371

12

0

20 May 2025

SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference AccelerationInternational Conference on Learning Representations (ICLR), 2024

Jun Zhu

Jianfei Chen

784

120

0

03 Oct 2024

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

796

182

0

07 May 2024

Page 1 of 1