Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.04168
Cited By
From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks
6 September 2024
Andreas Stephan
D. Zhu
Matthias Aßenmacher
Xiaoyu Shen
Benjamin Roth
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks"
3 / 3 papers shown
Title
Brains vs. Bytes: Evaluating LLM Proficiency in Olympiad Mathematics
Hamed Mahdavi
Alireza Hashemi
Majid Daliri
Pegah Mohammadipour
Alireza Farhadi
Samira Malek
Yekta Yazdanifard
Amir Khasahmadi
V. Honavar
ELM
LRM
38
1
0
01 Apr 2025
Improving Preference Extraction In LLMs By Identifying Latent Knowledge Through Classifying Probes
Sharan Maiya
Yinhong Liu
Ramit Debnath
Anna Korhonen
25
0
0
22 Mar 2025
UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation
Qihui Zhang
Munan Ning
Zheyuan Liu
Yanbo Wang
Jiayi Ye
Yue Huang
Shuo Yang
Xiao Chen
Y. Song
Li Yuan
LRM
56
0
0
19 Mar 2025
1