ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.13639
  4. Cited By
An Empirical Study of LLM-as-a-Judge: How Design Choices Impact Evaluation Reliability

An Empirical Study of LLM-as-a-Judge: How Design Choices Impact Evaluation Reliability

16 June 2025
Yusuke Yamauchi
Taro Yano
Masafumi Oyamada
    ELM
ArXiv (abs)PDFHTML

Papers citing "An Empirical Study of LLM-as-a-Judge: How Design Choices Impact Evaluation Reliability"

3 / 3 papers shown
The Artificial Intelligence Cognitive Examination: A Survey on the Evolution of Multimodal Evaluation from Recognition to Reasoning
The Artificial Intelligence Cognitive Examination: A Survey on the Evolution of Multimodal Evaluation from Recognition to Reasoning
Mayank Ravishankara
Varindra V. Persad Maharaj
ELM
206
1
0
05 Oct 2025
Uncovering Vulnerabilities of LLM-Assisted Cyber Threat Intelligence
Uncovering Vulnerabilities of LLM-Assisted Cyber Threat Intelligence
Y. Meng
Luoxi Tang
Feiyang Yu
Jinyuan Jia
Guanhua Yan
Ping Yang
Zhaohan Xi
168
0
0
28 Sep 2025
CancerGUIDE: Cancer Guideline Understanding via Internal Disagreement Estimation
CancerGUIDE: Cancer Guideline Understanding via Internal Disagreement Estimation
Alyssa Unell
Noel C. F. Codella
Sam Preston
Peniel Argaw
Wen-wai Yim
...
Jiachen Li
Shrey Jain
Mu-Hsin Wei
M. Lungren
Hoifung Poon
AI4TS
267
0
0
09 Sep 2025
1
Page 1 of 1