ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.09259
  4. Cited By
QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language
  Models

QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language Models

13 October 2023
Saleh Ashkboos
Ilia Markov
Elias Frantar
Tingxuan Zhong
Xincheng Wang
Jie Ren
Torsten Hoefler
Dan Alistarh
    MQ
    SyDa
ArXivPDFHTML

Papers citing "QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language Models"

3 / 3 papers shown
Title
Trends in AI Supercomputers
Trends in AI Supercomputers
Konstantin Pilz
James Sanders
Robi Rahman
Lennart Heim
GNN
ELM
19
0
0
22 Apr 2025
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
41
13
0
06 Oct 2024
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
236
1,508
0
31 Dec 2020
1