Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.02134
Cited By
Optimising Calls to Large Language Models with Uncertainty-Based Two-Tier Selection
3 May 2024
Guillem Ramírez
Alexandra Birch
Ivan Titov
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Optimising Calls to Large Language Models with Uncertainty-Based Two-Tier Selection"
12 / 12 papers shown
Title
Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering
Jihao Zhao
Chunlai Zhou
Biao Qin
45
0
0
05 May 2025
RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs
Zhongzhan Huang
Guoming Ling
Vincent S. Liang
Yupei Lin
Yandong Chen
Shanshan Zhong
Hefeng Wu
Liang Lin
LRM
52
1
0
08 Mar 2025
Optimizing Model Selection for Compound AI Systems
Lingjiao Chen
Jared Quincy Davis
Boris Hanin
Peter Bailis
Matei A. Zaharia
James Y. Zou
Ion Stoica
42
0
0
20 Feb 2025
A Unified Approach to Routing and Cascading for LLMs
Jasper Dekoninck
Maximilian Baader
Martin Vechev
60
2
0
17 Feb 2025
Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding
Sukmin Cho
S. Choi
T. Hwang
Jeongyeon Seo
Soyeong Jeong
Huije Lee
Hoyun Song
Jong C. Park
Youngjin Kwon
46
0
0
08 Feb 2025
A Survey on the Honesty of Large Language Models
Siheng Li
Cheng Yang
Taiqiang Wu
Chufan Shi
Yuji Zhang
...
Jie Zhou
Yujiu Yang
Ngai Wong
Xixin Wu
Wai Lam
HILM
22
4
0
27 Sep 2024
CLAVE: An Adaptive Framework for Evaluating Values of LLM Generated Responses
Jing Yao
Xiaoyuan Yi
Xing Xie
ELM
ALM
16
7
0
15 Jul 2024
Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing
Dujian Ding
Ankur Mallick
Chi Wang
Robert Sim
Subhabrata Mukherjee
Victor Rühle
L. Lakshmanan
Ahmed Hassan Awadallah
80
73
0
22 Apr 2024
A Survey on Effective Invocation Methods of Massive LLM Services
Can Wang
Bolin Zhang
Dianbo Sui
Zhiying Tu
Xiaoyu Liu
Jiabao Kang
34
5
0
05 Feb 2024
Cache & Distil: Optimising API Calls to Large Language Models
Guillem Ramírez
Matthias Lindemann
Alexandra Birch
Ivan Titov
19
3
0
20 Oct 2023
AutoMix: Automatically Mixing Language Models
Pranjal Aggarwal
Aman Madaan
Ankit Anand
Srividya Pranavi Potharaju
Swaroop Mishra
...
Karthik Kappaganthu
Yiming Yang
Shyam Upadhyay
Manaal Faruqui
Mausam
37
17
0
19 Oct 2023
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
393
2,216
0
03 Sep 2019
1