Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.09259
Cited By
QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language Models
13 October 2023
Saleh Ashkboos
Ilia Markov
Elias Frantar
Tingxuan Zhong
Xincheng Wang
Jie Ren
Torsten Hoefler
Dan Alistarh
MQ
SyDa
Re-assign community
ArXiv
PDF
HTML
Papers citing
"QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language Models"
3 / 3 papers shown
Title
Trends in AI Supercomputers
Konstantin Pilz
James Sanders
Robi Rahman
Lennart Heim
GNN
ELM
19
0
0
22 Apr 2025
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
41
13
0
06 Oct 2024
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
236
1,508
0
31 Dec 2020
1