Rate Distortion For Model Compression: From Theory To Practice

9 October 2018

Papers citing "Rate Distortion For Model Compression: From Theory To Practice"

6 / 6 papers shown

Title
BackSlash: Rate Constrained Optimized Training of Large Language Models Jun Wu Jiangtao Wen Yuxing Han 34 0 0 23 Apr 2025
Rotation Invariant Quantization for Model Compression Dor-Joseph Kampeas Yury Nahshan Hanoch Kremer Gil Lederman Shira Zaloshinski Zheng Li E. Haleva MQ 16 0 0 03 Mar 2023
Minimax Optimal Quantization of Linear Models: Information-Theoretic Limits and Efficient Algorithms R. Saha Mert Pilanci Andrea J. Goldsmith MQ 17 3 0 23 Feb 2022
An Information-Theoretic Justification for Model Pruning Berivan Isik Tsachy Weissman Albert No 84 35 0 16 Feb 2021
Transform Quantization for CNN (Convolutional Neural Network) Compression Sean I. Young Wang Zhe David S. Taubman B. Girod MQ 25 69 0 02 Sep 2020
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation Yonghui Wu M. Schuster Z. Chen Quoc V. Le Mohammad Norouzi ... Alex Rudnick Oriol Vinyals G. Corrado Macduff Hughes J. Dean AIMat 716 6,743 0 26 Sep 2016