ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.03158
  4. Cited By
Optimal and Near-Optimal Adaptive Vector Quantization

Optimal and Near-Optimal Adaptive Vector Quantization

5 February 2024
Ran Ben-Basat
Y. Ben-Itzhak
Michael Mitzenmacher
S. Vargaftik
    MQ
ArXivPDFHTML

Papers citing "Optimal and Near-Optimal Adaptive Vector Quantization"

5 / 5 papers shown
Title
FlexGen: High-Throughput Generative Inference of Large Language Models
  with a Single GPU
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
Ying Sheng
Lianmin Zheng
Binhang Yuan
Zhuohan Li
Max Ryabinin
...
Joseph E. Gonzalez
Percy Liang
Christopher Ré
Ion Stoica
Ce Zhang
144
366
0
13 Mar 2023
QUIC-FL: Quick Unbiased Compression for Federated Learning
QUIC-FL: Quick Unbiased Compression for Federated Learning
Ran Ben-Basat
S. Vargaftik
Amit Portnoy
Gil Einziger
Y. Ben-Itzhak
Michael Mitzenmacher
FedML
64
13
0
26 May 2022
DRIVE: One-bit Distributed Mean Estimation
DRIVE: One-bit Distributed Mean Estimation
S. Vargaftik
Ran Ben-Basat
Amit Portnoy
Gal Mendelson
Y. Ben-Itzhak
Michael Mitzenmacher
OOD
FedML
66
51
0
18 May 2021
NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization
Ali Ramezani-Kebrya
Fartash Faghri
Ilya Markov
V. Aksenov
Dan Alistarh
Daniel M. Roy
MQ
57
30
0
28 Apr 2021
Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT
Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT
Sheng Shen
Zhen Dong
Jiayu Ye
Linjian Ma
Z. Yao
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
225
574
0
12 Sep 2019
1