Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.03158
Cited By
Optimal and Near-Optimal Adaptive Vector Quantization
5 February 2024
Ran Ben-Basat
Y. Ben-Itzhak
Michael Mitzenmacher
S. Vargaftik
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Optimal and Near-Optimal Adaptive Vector Quantization"
5 / 5 papers shown
Title
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
Ying Sheng
Lianmin Zheng
Binhang Yuan
Zhuohan Li
Max Ryabinin
...
Joseph E. Gonzalez
Percy Liang
Christopher Ré
Ion Stoica
Ce Zhang
144
366
0
13 Mar 2023
QUIC-FL: Quick Unbiased Compression for Federated Learning
Ran Ben-Basat
S. Vargaftik
Amit Portnoy
Gil Einziger
Y. Ben-Itzhak
Michael Mitzenmacher
FedML
64
13
0
26 May 2022
DRIVE: One-bit Distributed Mean Estimation
S. Vargaftik
Ran Ben-Basat
Amit Portnoy
Gal Mendelson
Y. Ben-Itzhak
Michael Mitzenmacher
OOD
FedML
68
51
0
18 May 2021
NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization
Ali Ramezani-Kebrya
Fartash Faghri
Ilya Markov
V. Aksenov
Dan Alistarh
Daniel M. Roy
MQ
57
30
0
28 Apr 2021
Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT
Sheng Shen
Zhen Dong
Jiayu Ye
Linjian Ma
Z. Yao
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
225
575
0
12 Sep 2019
1