ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.15130
  4. Cited By
OptLLM: Optimal Assignment of Queries to Large Language Models

OptLLM: Optimal Assignment of Queries to Large Language Models

24 May 2024
Yueyue Liu
Hongyu Zhang
Yuantian Miao
Van-Hoang Le
Zhiqiang Li
ArXiv (abs)PDFHTML

Papers citing "OptLLM: Optimal Assignment of Queries to Large Language Models"

7 / 7 papers shown
Title
LLM Routing with Dueling Feedback
LLM Routing with Dueling Feedback
Chao-Kai Chiang
Takashi Ishida
Masashi Sugiyama
76
0
0
01 Oct 2025
RouterArena: An Open Platform for Comprehensive Comparison of LLM Routers
RouterArena: An Open Platform for Comprehensive Comparison of LLM Routers
Yifan Lu
Rixin Liu
Jiayi Yuan
Xingqi Cui
Shenrun Zhang
Hongyi Liu
Jiarong Xing
ELM
267
0
0
30 Sep 2025
One Head, Many Models: Cross-Attention Routing for Cost-Aware LLM Selection
One Head, Many Models: Cross-Attention Routing for Cost-Aware LLM Selection
Roshini Pulishetty
Mani Kishan Ghantasala
Keerthy Kaushik Dasoju
Niti Mangwani
Vishal Garimella
...
Somya Chatterjee
Yue Kang
Ehi Nosakhare
Sadid Hasan
Soundar Srinivasan
3DV
92
0
0
11 Sep 2025
Cost-Aware Contrastive Routing for LLMs
Cost-Aware Contrastive Routing for LLMs
Reza Shirkavand
Shangqian Gao
Qi He
Heng-Chiao Huang
163
1
0
17 Aug 2025
PPMI: Privacy-Preserving LLM Interaction with Socratic Chain-of-Thought Reasoning and Homomorphically Encrypted Vector Databases
PPMI: Privacy-Preserving LLM Interaction with Socratic Chain-of-Thought Reasoning and Homomorphically Encrypted Vector Databases
Yubeen Bae
Minchan Kim
Jaejin Lee
Sangbum Kim
Jaehyung Kim
Yejin Choi
Niloofar Mireshghallah
108
3
0
19 Jun 2025
Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques
Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques
Adarsh Prasad Behera
J. Champati
Roberto Morabito
Sasu Tarkoma
J. Gross
148
5
0
06 Jun 2025
A Unified Approach to Routing and Cascading for LLMs
A Unified Approach to Routing and Cascading for LLMs
Jasper Dekoninck
Maximilian Baader
Martin Vechev
370
18
0
14 Oct 2024
1