Few-Bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction

1 February 2022

Papers citing "Few-Bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction"

6 / 6 papers shown

Title
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training Haocheng Xi Han Cai Ligeng Zhu Y. Lu Kurt Keutzer Jianfei Chen Song Han MQ 63 9 0 25 Oct 2024
Eco2AI: carbon emissions tracking of machine learning models as the first step towards sustainable AI S. Budennyy V. Lazarev N. Zakharenko A. Korovin Olga Plosskaya ... Ivan V. Oseledets I. Barsola Ilya M. Egorov A. Kosterina L. Zhukov 26 89 0 31 Jul 2022
Survey on Large Scale Neural Network Training Julia Gusak Daria Cherniuk Alena Shilova A. Katrutsa Daniel Bershatsky ... Lionel Eyraud-Dubois Oleg Shlyazhko Denis Dimitrov Ivan V. Oseledets Olivier Beaumont 22 10 0 21 Feb 2022
Emojich -- zero-shot emoji generation using Russian language: a technical report Alex Shonenkov Daria Bakshandaeva Denis Dimitrov Aleks D. Nikolich VLM 27 5 0 04 Dec 2021
Zero-Shot Text-to-Image Generation Aditya A. Ramesh Mikhail Pavlov Gabriel Goh Scott Gray Chelsea Voss Alec Radford Mark Chen Ilya Sutskever VLM 253 4,774 0 24 Feb 2021
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Alex Jinpeng Wang Amanpreet Singh Julian Michael Felix Hill Omer Levy Samuel R. Bowman ELM 294 6,950 0 20 Apr 2018