Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.12359
Cited By
Shedding the Bits: Pushing the Boundaries of Quantization with Minifloats on FPGAs
21 November 2023
Shivam Aggarwal
Hans Jakob Damsgaard
Alessandro Pappalardo
Giuseppe Franco
Thomas B. Preußer
Michaela Blott
Tulika Mitra
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Shedding the Bits: Pushing the Boundaries of Quantization with Minifloats on FPGAs"
3 / 3 papers shown
Title
HALO: Hardware-aware quantization with low critical-path-delay weights for LLM acceleration
Rohan Juneja
Shivam Aggarwal
Safeen Huda
Tulika Mitra
L. Peh
42
0
0
27 Feb 2025
DAOP: Data-Aware Offloading and Predictive Pre-Calculation for Efficient MoE Inference
Yujie Zhang
Shivam Aggarwal
T. Mitra
MoE
72
0
0
16 Dec 2024
A2Q+: Improving Accumulator-Aware Weight Quantization
Ian Colbert
Alessandro Pappalardo
Jakoba Petri-Koenig
Yaman Umuroglu
MQ
21
4
0
19 Jan 2024
1