ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.01742
  4. Cited By
Towards Optimizing the Costs of LLM Usage

Towards Optimizing the Costs of LLM Usage

29 January 2024
Shivanshu Shekhar
Tanishq Dubey
Koyel Mukherjee
Apoorv Saxena
Atharv Tyagi
Nishanth Kotla
ArXiv (abs)PDFHTML

Papers citing "Towards Optimizing the Costs of LLM Usage"

16 / 16 papers shown
Catch Me If You Can? Not Yet: LLMs Still Struggle to Imitate the Implicit Writing Styles of Everyday Authors
Catch Me If You Can? Not Yet: LLMs Still Struggle to Imitate the Implicit Writing Styles of Everyday Authors
Zhengxiang Wang
Nafis Irtiza Tripto
Solha Park
Zhenzhen Li
Jiawei Zhou
100
1
0
18 Sep 2025
eMamba: Efficient Acceleration Framework for Mamba Models in Edge Computing
eMamba: Efficient Acceleration Framework for Mamba Models in Edge ComputingACM Transactions on Embedded Computing Systems (ACM TECS), 2025
Jiyong Kim
J. Lee
Jiahao Lin
Alish Kanani
Miao Sun
Ümit Y. Ogras
Jaehyun Park
Mamba
179
2
0
14 Aug 2025
FAA Framework: A Large Language Model-Based Approach for Credit Card Fraud Investigations
FAA Framework: A Large Language Model-Based Approach for Credit Card Fraud Investigations
Shaun Shuster
Eyal Zaloof
A. Shabtai
Rami Puzis
178
0
0
13 Jun 2025
LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text
LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text
Li yunhan
Wu gengshen
AILawELMALM
432
1
0
30 May 2025
RedactOR: An LLM-Powered Framework for Automatic Clinical Data De-Identification
RedactOR: An LLM-Powered Framework for Automatic Clinical Data De-Identification
Praphul Singh
Charlotte Dzialo
Jangwon Kim
Sumana Srivatsa
Irfan Bulu
Sri Gadde
Krishnaram Kenthapadi
157
0
0
23 May 2025
Causal LLM Routing: End-to-End Regret Minimization from Observational Data
Causal LLM Routing: End-to-End Regret Minimization from Observational Data
Asterios Tsiourvas
Wei-Ju Sun
Georgia Perakis
154
4
0
21 May 2025
Transforming Decoder-Only Transformers for Accurate WiFi-Telemetry Based Indoor Localization
Transforming Decoder-Only Transformers for Accurate WiFi-Telemetry Based Indoor Localization
Nayan Sanjay Bhatia
Katia Obraczka
151
1
0
16 May 2025
Optimizing Large Language Models: Metrics, Energy Efficiency, and Case Study Insights
Optimizing Large Language Models: Metrics, Energy Efficiency, and Case Study InsightsConference on Algebraic Informatics (AI), 2025
Tahniat Khan
Soroor Motie
Sedef Akinli Kocak
Shaina Raza
MQ
235
5
0
07 Apr 2025
Think Together and Work Better: Combining Humans' and LLMs' Think-Aloud Outcomes for Effective Text Evaluation
Think Together and Work Better: Combining Humans' and LLMs' Think-Aloud Outcomes for Effective Text EvaluationInternational Conference on Human Factors in Computing Systems (CHI), 2024
SeongYeub Chu
JongWoo Kim
MunYong Yi
354
14
0
21 Feb 2025
Optimizing Model Selection for Compound AI Systems
Optimizing Model Selection for Compound AI Systems
Lingjiao Chen
Jared Quincy Davis
Boris Hanin
Peter Bailis
Matei A. Zaharia
James Zou
Ion Stoica
421
17
0
20 Feb 2025
Software Performance Engineering for Foundation Model-Powered Software
  (FMware)
Software Performance Engineering for Foundation Model-Powered Software (FMware)
Haoxiang Zhang
Shi Chang
Arthur Leung
Kishanthan Thangarajah
Boyuan Chen
Hanan Lutfiyya
Ahmed E. Hassan
575
3
0
14 Nov 2024
Engineering Trustworthy AI: A Developer Guide for Empirical Risk
  Minimization
Engineering Trustworthy AI: A Developer Guide for Empirical Risk MinimizationIEEE Transactions on Artificial Intelligence (IEEE TAI), 2024
Diana Pfau
Alexander Jung
270
1
0
25 Oct 2024
LLMBridge: Reducing Costs to Access LLMs in a Prompt-Centric Internet
LLMBridge: Reducing Costs to Access LLMs in a Prompt-Centric Internet
Noah Martin
Abdullah Bin Faisal
Hiba Eltigani
Rukhshan Haroon
Swaminathan Lamelas
Fahad R. Dogar
1.1K
3
0
04 Oct 2024
Concise Thoughts: Impact of Output Length on LLM Reasoning and Cost
Concise Thoughts: Impact of Output Length on LLM Reasoning and Cost
Sania Nayab
Giulio Rossolini
Giorgio Buttazzo
Nicolamaria Manes
F. Giacomelli
Nicolamaria Manes
Fabrizio Giacomelli
LRM
429
79
0
29 Jul 2024
Foundation Models for Autonomous Robots in Unstructured Environments
Foundation Models for Autonomous Robots in Unstructured Environments
Hossein Naderi
Alireza Shojaei
Lifu Huang
LM&Ro
291
5
0
19 Jul 2024
KG-RAG: Bridging the Gap Between Knowledge and Creativity
KG-RAG: Bridging the Gap Between Knowledge and Creativity
Diego Sanmartin
RALM
275
77
0
20 May 2024
1