Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.10115
Cited By
A Bag of Tricks for Scaling CPU-based Deep FFMs to more than 300m Predictions per Second
14 July 2024
Blaž Škrlj
Benjamin Ben-Shalom
Grega Gaspersic
Adi Schwartz
Ramzi Hoseisi
Naama Ziporin
Davorin Kopic
Andraz Tori
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Bag of Tricks for Scaling CPU-based Deep FFMs to more than 300m Predictions per Second"
1 / 1 papers shown
Title
Towards Efficient Post-training Quantization of Pre-trained Language Models
Haoli Bai
Lu Hou
Lifeng Shang
Xin Jiang
Irwin King
M. Lyu
MQ
73
47
0
30 Sep 2021
1