Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.15778
Cited By
BASS: Batched Attention-optimized Speculative Sampling
24 April 2024
Haifeng Qian
Sujan Kumar Gonugondla
Sungsoo Ha
Mingyue Shang
Sanjay Krishna Gouda
Ramesh Nallapati
Sudipta Sengupta
Xiaofei Ma
Anoop Deoras
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BASS: Batched Attention-optimized Speculative Sampling"
2 / 2 papers shown
Title
Characterizing and Efficiently Accelerating Multimodal Generation Model Inference
Yejin Lee
Anna Y. Sun
Basil Hosmer
Bilge Acun
Can Balioglu
...
Ram Pasunuru
Scott Yih
Sravya Popuri
Xing Liu
Carole-Jean Wu
45
2
0
30 Sep 2024
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
220
3,054
0
23 Jan 2020
1