ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.04624
  4. Cited By
Benchmarking LLM powered Chatbots: Methods and Metrics

Benchmarking LLM powered Chatbots: Methods and Metrics

8 August 2023
D. Banerjee
Pooja Singh
Arjun Avadhanam
Shashank Srivastava
    LLMAGAI4MH
ArXiv (abs)PDFHTML

Papers citing "Benchmarking LLM powered Chatbots: Methods and Metrics"

14 / 14 papers shown
From Prompts to Power: Measuring the Energy Footprint of LLM Inference
From Prompts to Power: Measuring the Energy Footprint of LLM Inference
Francisco Caravaca
Ángel Cuevas
R. Cuevas
112
0
0
05 Nov 2025
Random-Set Large Language Models
Random-Set Large Language Models
Muhammad Mubashar
Shireen Kudukkil Manchingal
Fabio Cuzzolin
337
2
0
25 Apr 2025
LLM-DaaS: LLM-driven Drone-as-a-Service Operations from Text User
  Requests
LLM-DaaS: LLM-driven Drone-as-a-Service Operations from Text User Requests
Lillian Wassim
Kamal Mohamed
Ali Hamdi
209
5
0
16 Dec 2024
Is Our Chatbot Telling Lies? Assessing Correctness of an LLM-based Dutch Support Chatbot
Is Our Chatbot Telling Lies? Assessing Correctness of an LLM-based Dutch Support Chatbot
Herman Lassche
Michiel Overeem
Ayushi Rastogi
305
0
0
29 Oct 2024
Design of a Quality Management System based on the EU Artificial
  Intelligence Act
Design of a Quality Management System based on the EU Artificial Intelligence Act
Henryk Mustroph
Stefanie Rinderle-Ma
161
2
0
08 Aug 2024
A Comparison of LLM Finetuning Methods & Evaluation Metrics with Travel
  Chatbot Use Case
A Comparison of LLM Finetuning Methods & Evaluation Metrics with Travel Chatbot Use Case
Sonia Meyer
Shreya Singh
Bertha Tam
Christopher Ton
Angel Ren
315
10
0
07 Aug 2024
LEXI: Large Language Models Experimentation Interface
LEXI: Large Language Models Experimentation Interface
Guy Laban
Tomer Laban
Hatice Gunes
266
9
0
01 Jul 2024
Few-shot Personalization of LLMs with Mis-aligned Responses
Few-shot Personalization of LLMs with Mis-aligned Responses
Jaehyung Kim
Yiming Yang
390
24
0
26 Jun 2024
Aligning Agents like Large Language Models
Aligning Agents like Large Language Models
Adam Jelley
Yuhan Cao
Dave Bignell
Sam Devlin
Tabish Rashid
Tabish Rashid
LM&Ro
237
1
0
06 Jun 2024
Evaluating Large Language Models for Structured Science Summarization in
  the Open Research Knowledge Graph
Evaluating Large Language Models for Structured Science Summarization in the Open Research Knowledge Graph
Vladyslav Nechakhin
Jennifer D'Souza
Steffen Eger
238
6
0
03 May 2024
Enhancing Summarization Performance through Transformer-Based Prompt
  Engineering in Automated Medical Reporting
Enhancing Summarization Performance through Transformer-Based Prompt Engineering in Automated Medical ReportingInternational Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC), 2023
Daphne van Zandvoort
Laura Wiersema
Tom Huibers
S. Dulmen
S. Brinkkemper
LM&MAMedIm
161
11
0
22 Nov 2023
Bridging Code Semantic and LLMs: Semantic Chain-of-Thought Prompting for Code Generation
Yingwei Ma
Yue Yu
Shanshan Li
Yu Jiang
Yong Guo
Yuanliang Zhang
Yutao Xie
Xiangke Liao
177
13
0
16 Oct 2023
An In-depth Survey of Large Language Model-based Artificial Intelligence
  Agents
An In-depth Survey of Large Language Model-based Artificial Intelligence Agents
Pengyu Zhao
Zijian Jin
Ning Cheng
LLMAG
193
37
0
23 Sep 2023
A Survey on Large Language Model based Autonomous Agents
A Survey on Large Language Model based Autonomous Agents
Lei Wang
Chengbang Ma
Xueyang Feng
Zeyu Zhang
Hao-ran Yang
...
Xu Chen
Yankai Lin
Wayne Xin Zhao
Zhewei Wei
Ji-Rong Wen
LLMAGAI4CELM&Ro
661
2,132
0
22 Aug 2023
1