Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2308.04624
Cited By
Benchmarking LLM powered Chatbots: Methods and Metrics
8 August 2023
D. Banerjee
Pooja Singh
Arjun Avadhanam
Shashank Srivastava
LLMAG
AI4MH
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Benchmarking LLM powered Chatbots: Methods and Metrics"
14 / 14 papers shown
From Prompts to Power: Measuring the Energy Footprint of LLM Inference
Francisco Caravaca
Ángel Cuevas
R. Cuevas
112
0
0
05 Nov 2025
Random-Set Large Language Models
Muhammad Mubashar
Shireen Kudukkil Manchingal
Fabio Cuzzolin
337
2
0
25 Apr 2025
LLM-DaaS: LLM-driven Drone-as-a-Service Operations from Text User Requests
Lillian Wassim
Kamal Mohamed
Ali Hamdi
209
5
0
16 Dec 2024
Is Our Chatbot Telling Lies? Assessing Correctness of an LLM-based Dutch Support Chatbot
Herman Lassche
Michiel Overeem
Ayushi Rastogi
305
0
0
29 Oct 2024
Design of a Quality Management System based on the EU Artificial Intelligence Act
Henryk Mustroph
Stefanie Rinderle-Ma
161
2
0
08 Aug 2024
A Comparison of LLM Finetuning Methods & Evaluation Metrics with Travel Chatbot Use Case
Sonia Meyer
Shreya Singh
Bertha Tam
Christopher Ton
Angel Ren
315
10
0
07 Aug 2024
LEXI: Large Language Models Experimentation Interface
Guy Laban
Tomer Laban
Hatice Gunes
266
9
0
01 Jul 2024
Few-shot Personalization of LLMs with Mis-aligned Responses
Jaehyung Kim
Yiming Yang
390
24
0
26 Jun 2024
Aligning Agents like Large Language Models
Adam Jelley
Yuhan Cao
Dave Bignell
Sam Devlin
Tabish Rashid
Tabish Rashid
LM&Ro
237
1
0
06 Jun 2024
Evaluating Large Language Models for Structured Science Summarization in the Open Research Knowledge Graph
Vladyslav Nechakhin
Jennifer D'Souza
Steffen Eger
238
6
0
03 May 2024
Enhancing Summarization Performance through Transformer-Based Prompt Engineering in Automated Medical Reporting
International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC), 2023
Daphne van Zandvoort
Laura Wiersema
Tom Huibers
S. Dulmen
S. Brinkkemper
LM&MA
MedIm
161
11
0
22 Nov 2023
Bridging Code Semantic and LLMs: Semantic Chain-of-Thought Prompting for Code Generation
Yingwei Ma
Yue Yu
Shanshan Li
Yu Jiang
Yong Guo
Yuanliang Zhang
Yutao Xie
Xiangke Liao
177
13
0
16 Oct 2023
An In-depth Survey of Large Language Model-based Artificial Intelligence Agents
Pengyu Zhao
Zijian Jin
Ning Cheng
LLMAG
193
37
0
23 Sep 2023
A Survey on Large Language Model based Autonomous Agents
Lei Wang
Chengbang Ma
Xueyang Feng
Zeyu Zhang
Hao-ran Yang
...
Xu Chen
Yankai Lin
Wayne Xin Zhao
Zhewei Wei
Ji-Rong Wen
LLMAG
AI4CE
LM&Ro
661
2,132
0
22 Aug 2023
1