ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.11601
  4. Cited By
Tryage: Real-time, intelligent Routing of User Prompts to Large Language
  Models

Tryage: Real-time, intelligent Routing of User Prompts to Large Language Models

22 August 2023
S. N. Hari
Matt Thomson
ArXivPDFHTML

Papers citing "Tryage: Real-time, intelligent Routing of User Prompts to Large Language Models"

12 / 12 papers shown
Title
A Unified Approach to Routing and Cascading for LLMs
A Unified Approach to Routing and Cascading for LLMs
Jasper Dekoninck
Maximilian Baader
Martin Vechev
60
2
0
17 Feb 2025
Efficiently Deploying LLMs with Controlled Risk
Efficiently Deploying LLMs with Controlled Risk
Michael J. Zellinger
Matt Thomson
36
1
0
03 Oct 2024
MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs
MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs
Quang H. Nguyen
Duy C. Hoang
Juliette Decugis
Saurav Manchanda
Nitesh V. Chawla
Khoa D. Doan
Khoa D. Doan
37
6
0
15 Jul 2024
Meta Reasoning for Large Language Models
Meta Reasoning for Large Language Models
Peizhong Gao
Ao Xie
Shaoguang Mao
Wenshan Wu
Yan Xia
Haipeng Mi
Furu Wei
ReLM
LLMAG
LRM
43
7
0
17 Jun 2024
Leveraging Open-Source Large Language Models for encoding Social
  Determinants of Health using an Intelligent Router
Leveraging Open-Source Large Language Models for encoding Social Determinants of Health using an Intelligent Router
Akul Goel
S. N. Hari
Belinda Waltman
Matt Thomson
18
0
0
30 May 2024
Cascade-Aware Training of Language Models
Cascade-Aware Training of Language Models
Congchao Wang
Sean Augenstein
Keith Rush
Wittawat Jitkrittum
Harikrishna Narasimhan
A. S. Rawat
A. Menon
Alec Go
28
4
0
29 May 2024
Language Model Cascades: Token-level uncertainty and beyond
Language Model Cascades: Token-level uncertainty and beyond
Neha Gupta
Harikrishna Narasimhan
Wittawat Jitkrittum
A. S. Rawat
A. Menon
Sanjiv Kumar
UQLM
41
42
0
15 Apr 2024
RouterBench: A Benchmark for Multi-LLM Routing System
RouterBench: A Benchmark for Multi-LLM Routing System
Qitian Jason Hu
Jacob Bieker
Xiuyu Li
Nan Jiang
Benjamin Keigwin
Gaurav Ranganath
Kurt Keutzer
Shriyash Kaustubh Upadhyay
42
36
0
18 Mar 2024
Herd: Using multiple, smaller LLMs to match the performances of
  proprietary, large LLMs via an intelligent composer
Herd: Using multiple, smaller LLMs to match the performances of proprietary, large LLMs via an intelligent composer
S. N. Hari
Matt Thomson
10
0
0
30 Oct 2023
Towards Robust Multi-Modal Reasoning via Model Selection
Towards Robust Multi-Modal Reasoning via Model Selection
Xiangyan Liu
Rongxue Li
Wei Ji
Tao Lin
LLMAG
LRM
27
3
0
12 Oct 2023
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,412
0
11 Nov 2021
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
245
1,986
0
31 Dec 2020
1