Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2506.13639
Cited By
An Empirical Study of LLM-as-a-Judge: How Design Choices Impact Evaluation Reliability
16 June 2025
Yusuke Yamauchi
Taro Yano
Masafumi Oyamada
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An Empirical Study of LLM-as-a-Judge: How Design Choices Impact Evaluation Reliability"
3 / 3 papers shown
The Artificial Intelligence Cognitive Examination: A Survey on the Evolution of Multimodal Evaluation from Recognition to Reasoning
Mayank Ravishankara
Varindra V. Persad Maharaj
ELM
206
1
0
05 Oct 2025
Uncovering Vulnerabilities of LLM-Assisted Cyber Threat Intelligence
Y. Meng
Luoxi Tang
Feiyang Yu
Jinyuan Jia
Guanhua Yan
Ping Yang
Zhaohan Xi
168
0
0
28 Sep 2025
CancerGUIDE: Cancer Guideline Understanding via Internal Disagreement Estimation
Alyssa Unell
Noel C. F. Codella
Sam Preston
Peniel Argaw
Wen-wai Yim
...
Jiachen Li
Shrey Jain
Mu-Hsin Wei
M. Lungren
Hoifung Poon
AI4TS
267
0
0
09 Sep 2025
1
Page 1 of 1