Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.03290
Cited By
FLIQS: One-Shot Mixed-Precision Floating-Point and Integer Quantization Search
7 August 2023
Jordan Dotzel
Gang Wu
Andrew Li
M. Umar
Yun Ni
Mohamed S. Abdelfattah
Zhiru Zhang
Liqun Cheng
Martin G. Dixon
N. Jouppi
Quoc V. Le
Sheng R. Li
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FLIQS: One-Shot Mixed-Precision Floating-Point and Integer Quantization Search"
5 / 5 papers shown
Title
Phoeni6: a Systematic Approach for Evaluating the Energy Consumption of Neural Networks
Antônio Oliveira-Filho
Wellington Silva-de-Souza
Carlos Alberto Valderrama Sakuyama
Samuel Xavier-de-Souza
57
0
0
25 Feb 2025
Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs
Jordan Dotzel
Yuzong Chen
Bahaa Kotb
Sushma Prasad
Gang Wu
Sheng R. Li
Mohamed S. Abdelfattah
Zhiru Zhang
21
7
0
06 May 2024
SimQ-NAS: Simultaneous Quantization Policy and Neural Architecture Search
S. N. Sridhar
Maciej Szankin
Fang Chen
Sairam Sundaresan
Anthony Sarah
MQ
11
0
0
19 Dec 2023
FP8 Formats for Deep Learning
Paulius Micikevicius
Dusan Stosic
N. Burgess
Marius Cornea
Pradeep Dubey
...
Naveen Mellempudi
S. Oberman
M. Shoeybi
Michael Siu
Hao Wu
BDL
VLM
MQ
67
119
0
12 Sep 2022
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
Amir Yazdanbakhsh
H. Esmaeilzadeh
MQ
49
68
0
05 Nov 2018
1