Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.00964
Cited By
eDKM: An Efficient and Accurate Train-time Weight Clustering for Large Language Models
2 September 2023
Minsik Cho
Keivan Alizadeh Vahid
Qichen Fu
Saurabh N. Adya
C. C. D. Mundo
Mohammad Rastegari
Devang Naik
Peter Zatloukal
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"eDKM: An Efficient and Accurate Train-time Weight Clustering for Large Language Models"
3 / 3 papers shown
Title
On-Device LLMs for SMEs: Challenges and Opportunities
Jeremy Stephen Gabriel Yee
Pai Chet Ng
Zhengkui Wang
Ian McLoughlin
Aik Beng Ng
Simon See
27
1
0
21 Oct 2024
KV-Runahead: Scalable Causal LLM Inference by Parallel Key-Value Cache Generation
Minsik Cho
Mohammad Rastegari
Devang Naik
18
4
0
08 May 2024
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
1