ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.09972
  4. Cited By
A Better LLM Evaluator for Text Generation: The Impact of Prompt Output
  Sequencing and Optimization

A Better LLM Evaluator for Text Generation: The Impact of Prompt Output Sequencing and Optimization

14 June 2024
Kuanchao Chu
Yi-Pei Chen
Hideki Nakayama
ArXivPDFHTML

Papers citing "A Better LLM Evaluator for Text Generation: The Impact of Prompt Output Sequencing and Optimization"

8 / 8 papers shown
Title
Decision Information Meets Large Language Models: The Future of Explainable Operations Research
Decision Information Meets Large Language Models: The Future of Explainable Operations Research
Yansen Zhang
Qingcan Kang
Wing-Yin Yu
Hailei Gong
Xiaojin Fu
Xiongwei Han
Tao Zhong
Chen Ma
OffRL
37
1
0
14 Feb 2025
A review of faithfulness metrics for hallucination assessment in Large Language Models
Ben Malin
Tatiana Kalganova
Nikoloas Boulgouris
HILM
59
2
0
03 Jan 2025
Difficult Task Yes but Simple Task No: Unveiling the Laziness in
  Multimodal LLMs
Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs
Sihang Zhao
Youliang Yuan
Xiaoying Tang
Pinjia He
24
2
0
15 Oct 2024
PersoBench: Benchmarking Personalized Response Generation in Large
  Language Models
PersoBench: Benchmarking Personalized Response Generation in Large Language Models
Saleh Afzoon
Usman Naseem
Amin Beheshti
Zahra Jamali
26
1
0
04 Oct 2024
DocKD: Knowledge Distillation from LLMs for Open-World Document
  Understanding Models
DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models
Sungnyun Kim
Haofu Liao
Srikar Appalaraju
Peng Tang
Zhuowen Tu
R. Satzoda
R. Manmatha
Vijay Mahadevan
Stefano Soatto
34
0
0
04 Oct 2024
MemSim: A Bayesian Simulator for Evaluating Memory of LLM-based Personal
  Assistants
MemSim: A Bayesian Simulator for Evaluating Memory of LLM-based Personal Assistants
Zeyu Zhang
Quanyu Dai
Luyu Chen
Zeren Jiang
Rui Li
Jieming Zhu
Xu Chen
Yi Xie
Zhenhua Dong
Ji-Rong Wen
LLMAG
23
4
0
30 Sep 2024
SAGED: A Holistic Bias-Benchmarking Pipeline for Language Models with
  Customisable Fairness Calibration
SAGED: A Holistic Bias-Benchmarking Pipeline for Language Models with Customisable Fairness Calibration
Xin Guan
Nathaniel Demchak
Saloni Gupta
Ze Wang
Ediz Ertekin Jr.
Adriano Soares Koshiyama
Emre Kazim
Zekun Wu
32
2
0
17 Sep 2024
Generative Agents: Interactive Simulacra of Human Behavior
Generative Agents: Interactive Simulacra of Human Behavior
J. Park
Joseph C. O'Brien
Carrie J. Cai
Meredith Ringel Morris
Percy Liang
Michael S. Bernstein
LM&Ro
AI4CE
215
1,701
0
07 Apr 2023
1