Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.04673
Cited By
Cost-Effective Hyperparameter Optimization for Large Language Model Generation Inference
8 March 2023
Chi Wang
Susan Liu
Ahmed Hassan Awadallah
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Cost-Effective Hyperparameter Optimization for Large Language Model Generation Inference"
28 / 28 papers shown
Title
Faster, Cheaper, Better: Multi-Objective Hyperparameter Optimization for LLM and RAG Systems
Matthew Barker
Andrew Bell
Evan Thomas
James Carr
Thomas Andrews
Umang Bhatt
80
1
0
25 Feb 2025
A statistically consistent measure of Semantic Variability using Language Models
Yi Liu
69
0
0
01 Feb 2025
From Text to Pose to Image: Improving Diffusion Model Control and Quality
Clément Bonnet
Ariel N. Lee
Franck Wertel
Antoine Tamano
Tanguy Cizain
Pablo Ducru
DiffM
68
0
0
19 Nov 2024
Synthesize, Partition, then Adapt: Eliciting Diverse Samples from Foundation Models
Yeming Wen
Swarat Chaudhuri
29
0
0
11 Nov 2024
EcoAct: Economic Agent Determines When to Register What Action
Shaokun Zhang
Jieyu Zhang
Dujian Ding
Mirian Hipolito Garcia
Ankur Mallick
Daniel Madrigal
Menglin Xia
Victor Rühle
Qingyun Wu
Chi Wang
LLMAG
45
4
0
03 Nov 2024
Scaling LLM Inference with Optimized Sample Compute Allocation
Kexun Zhang
Shang Zhou
Danqing Wang
William Yang Wang
Lei Li
50
7
0
29 Oct 2024
Cognitive Overload Attack:Prompt Injection for Long Context
Bibek Upadhayay
Vahid Behzadan
Amin Karbasi
AAML
28
2
0
15 Oct 2024
Problem Solving Through Human-AI Preference-Based Cooperation
Subhabrata Dutta
Timo Kaufmann
Goran Glavas
Ivan Habernal
Kristian Kersting
Frauke Kreuter
Mira Mezini
Iryna Gurevych
Eyke Hüllermeier
Hinrich Schuetze
82
1
0
14 Aug 2024
Non-Determinism of "Deterministic" LLM Settings
Berk Atil
Alexa Chittams
Liseng Fu
Ferhan Ture
Lixinyu Xu
...
Tomasz Tudrej
Ferhan Ture
Zhe Wu
Lixinyu Xu
Breck Baldwin
26
0
0
06 Aug 2024
The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines
Agathe Balayn
45
2
0
02 Aug 2024
AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation
Jia Fu
Xiaoting Qin
Fangkai Yang
Lu Wang
Jue Zhang
Qingwei Lin
Yubo Chen
Dongmei Zhang
Saravan Rajmohan
Qi Zhang
32
3
0
27 Jun 2024
FunBO: Discovering Acquisition Functions for Bayesian Optimization with FunSearch
Virginia Aglietti
Ira Ktena
Jessica Schrouff
Eleni Sgouritsa
Francisco J. R. Ruiz
Alan Malek
Alexis Bellot
Silvia Chiappa
32
3
0
07 Jun 2024
LLMParser: An Exploratory Study on Using Large Language Models for Log Parsing
Zeyang Ma
A. Chen
Dong Jae Kim
Tse-Husn Chen
Shaowei Wang
27
44
0
27 Apr 2024
CEval: A Benchmark for Evaluating Counterfactual Text Generation
Van Bach Nguyen
Jorg Schlotterer
Christin Seifert
29
5
0
26 Apr 2024
Return of EM: Entity-driven Answer Set Expansion for QA Evaluation
Dongryeol Lee
Minwoo Lee
Kyungmin Min
Joonsuk Park
Kyomin Jung
47
1
0
24 Apr 2024
Large Language Models as Test Case Generators: Performance Evaluation and Enhancement
Ke-Shen Li
Yuan Yuan
LLMAG
20
12
0
20 Apr 2024
Guiding Large Language Models to Generate Computer-Parsable Content
Jiaye Wang
26
3
0
08 Apr 2024
Hallucinations or Attention Misdirection? The Path to Strategic Value Extraction in Business Using Large Language Models
Aline Ioste
35
1
0
21 Feb 2024
LLM can Achieve Self-Regulation via Hyperparameter Aware Generation
Siyin Wang
Shimin Li
Tianxiang Sun
Jinlan Fu
Qinyuan Cheng
Jiasheng Ye
Junjie Ye
Xipeng Qiu
Xuanjing Huang
16
4
0
17 Feb 2024
CigaR: Cost-efficient Program Repair with LLMs
Dávid Hidvégi
K. Etemadi
Sofia Bobadilla
Martin Monperrus
20
20
0
09 Feb 2024
The Effect of Sampling Temperature on Problem Solving in Large Language Models
Matthew Renze
Erhan Guven
42
71
0
07 Feb 2024
IoT in the Era of Generative AI: Vision and Challenges
Xin Wang
Zhongwei Wan
Arvin Hekmati
M. Zong
Samiul Alam
Mi Zhang
Bhaskar Krishnamachari
27
15
0
03 Jan 2024
How Many Validation Labels Do You Need? Exploring the Design Space of Label-Efficient Model Ranking
Zhengyu Hu
Jieyu Zhang
Yue Yu
Yuchen Zhuang
Hui Xiong
19
5
0
04 Dec 2023
EcoAssistant: Using LLM Assistant More Affordably and Accurately
Jieyu Zhang
Ranjay Krishna
Ahmed Hassan Awadallah
Chi Wang
30
33
0
03 Oct 2023
AutoML in the Age of Large Language Models: Current Challenges, Future Opportunities and Risks
Alexander Tornede
Difan Deng
Theresa Eimer
Joseph Giovanelli
Aditya Mohan
...
Sarah Segel
Daphne Theodorakopoulos
Tanja Tornede
Henning Wachsmuth
Marius Lindauer
28
22
0
13 Jun 2023
PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization
Yidong Wang
Zhuohao Yu
Zhengran Zeng
Linyi Yang
Cunxiang Wang
...
Jindong Wang
Xingxu Xie
Wei Ye
Shi-Bo Zhang
Yue Zhang
ALM
ELM
48
222
0
08 Jun 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,881
0
04 Mar 2022
Measuring Coding Challenge Competence With APPS
Dan Hendrycks
Steven Basart
Saurav Kadavath
Mantas Mazeika
Akul Arora
...
Collin Burns
Samir Puranik
Horace He
D. Song
Jacob Steinhardt
ELM
AIMat
ALM
194
623
0
20 May 2021
1