Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.12869
Cited By
Language Model Preference Evaluation with Multiple Weak Evaluators
14 October 2024
Zhengyu Hu
Jieyu Zhang
Zhihan Xiong
Alexander Ratner
Hui Xiong
Ranjay Krishna
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Language Model Preference Evaluation with Multiple Weak Evaluators"
2 / 2 papers shown
Title
Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems
Shaokun Zhang
Ming Yin
Jieyu Zhang
J. H. Liu
Zhiguang Han
...
Beibin Li
Chi Wang
H. Wang
Y. Chen
Qingyun Wu
47
0
0
30 Apr 2025
Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks
Yixin Cao
Shibo Hong
X. Li
Jiahao Ying
Yubo Ma
...
Juanzi Li
Aixin Sun
Xuanjing Huang
Tat-Seng Chua
Yu Jiang
ALM
ELM
84
0
0
26 Apr 2025
1