Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.06401
Cited By
Rate Distortion For Model Compression: From Theory To Practice
9 October 2018
Weihao Gao
Yu-Han Liu
Chong-Jun Wang
Sewoong Oh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Rate Distortion For Model Compression: From Theory To Practice"
6 / 6 papers shown
Title
BackSlash: Rate Constrained Optimized Training of Large Language Models
Jun Wu
Jiangtao Wen
Yuxing Han
34
0
0
23 Apr 2025
Rotation Invariant Quantization for Model Compression
Dor-Joseph Kampeas
Yury Nahshan
Hanoch Kremer
Gil Lederman
Shira Zaloshinski
Zheng Li
E. Haleva
MQ
16
0
0
03 Mar 2023
Minimax Optimal Quantization of Linear Models: Information-Theoretic Limits and Efficient Algorithms
R. Saha
Mert Pilanci
Andrea J. Goldsmith
MQ
17
3
0
23 Feb 2022
An Information-Theoretic Justification for Model Pruning
Berivan Isik
Tsachy Weissman
Albert No
84
35
0
16 Feb 2021
Transform Quantization for CNN (Convolutional Neural Network) Compression
Sean I. Young
Wang Zhe
David S. Taubman
B. Girod
MQ
25
69
0
02 Sep 2020
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,743
0
26 Sep 2016
1