Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.08061
Cited By
The Costly Dilemma: Generalization, Evaluation and Cost-Optimal Deployment of Large Language Models
15 August 2023
Abi Aryan
Aakash Kumar Nain
Andrew McMahon
Lucas Augusto Meyer
Harpreet Sahota
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Costly Dilemma: Generalization, Evaluation and Cost-Optimal Deployment of Large Language Models"
2 / 2 papers shown
Title
Improving the Serving Performance of Multi-LoRA Large Language Models via Efficient LoRA and KV Cache Management
Hang Zhang
Jiuchen Shi
Yixiao Wang
Quan Chen
Yizhou Shan
Minyi Guo
25
0
0
19 Apr 2025
JurEE not Judges: safeguarding llm interactions with small, specialised Encoder Ensembles
Dom Nasrabadi
24
0
0
11 Oct 2024
1