Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.09972
Cited By
A Better LLM Evaluator for Text Generation: The Impact of Prompt Output Sequencing and Optimization
14 June 2024
Kuanchao Chu
Yi-Pei Chen
Hideki Nakayama
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Better LLM Evaluator for Text Generation: The Impact of Prompt Output Sequencing and Optimization"
8 / 8 papers shown
Title
Decision Information Meets Large Language Models: The Future of Explainable Operations Research
Yansen Zhang
Qingcan Kang
Wing-Yin Yu
Hailei Gong
Xiaojin Fu
Xiongwei Han
Tao Zhong
Chen Ma
OffRL
37
1
0
14 Feb 2025
A review of faithfulness metrics for hallucination assessment in Large Language Models
Ben Malin
Tatiana Kalganova
Nikoloas Boulgouris
HILM
59
2
0
03 Jan 2025
Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs
Sihang Zhao
Youliang Yuan
Xiaoying Tang
Pinjia He
24
2
0
15 Oct 2024
PersoBench: Benchmarking Personalized Response Generation in Large Language Models
Saleh Afzoon
Usman Naseem
Amin Beheshti
Zahra Jamali
26
1
0
04 Oct 2024
DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models
Sungnyun Kim
Haofu Liao
Srikar Appalaraju
Peng Tang
Zhuowen Tu
R. Satzoda
R. Manmatha
Vijay Mahadevan
Stefano Soatto
34
0
0
04 Oct 2024
MemSim: A Bayesian Simulator for Evaluating Memory of LLM-based Personal Assistants
Zeyu Zhang
Quanyu Dai
Luyu Chen
Zeren Jiang
Rui Li
Jieming Zhu
Xu Chen
Yi Xie
Zhenhua Dong
Ji-Rong Wen
LLMAG
23
4
0
30 Sep 2024
SAGED: A Holistic Bias-Benchmarking Pipeline for Language Models with Customisable Fairness Calibration
Xin Guan
Nathaniel Demchak
Saloni Gupta
Ze Wang
Ediz Ertekin Jr.
Adriano Soares Koshiyama
Emre Kazim
Zekun Wu
32
2
0
17 Sep 2024
Generative Agents: Interactive Simulacra of Human Behavior
J. Park
Joseph C. O'Brien
Carrie J. Cai
Meredith Ringel Morris
Percy Liang
Michael S. Bernstein
LM&Ro
AI4CE
215
1,701
0
07 Apr 2023
1