Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2506.13639
Cited By

An Empirical Study of LLM-as-a-Judge: How Design Choices Impact Evaluation Reliability

An Empirical Study of LLM-as-a-Judge: How Design Choices Impact Evaluation Reliability

16 June 2025

Yusuke Yamauchi

Masafumi Oyamada

ArXiv (abs)PDF HTML

Papers citing "An Empirical Study of LLM-as-a-Judge: How Design Choices Impact Evaluation Reliability"

3 / 3 papers shown

The Artificial Intelligence Cognitive Examination: A Survey on the Evolution of Multimodal Evaluation from Recognition to Reasoning

The Artificial Intelligence Cognitive Examination: A Survey on the Evolution of Multimodal Evaluation from Recognition to Reasoning

Mayank Ravishankara

Varindra V. Persad Maharaj

206

1

0

05 Oct 2025

Uncovering Vulnerabilities of LLM-Assisted Cyber Threat Intelligence

Uncovering Vulnerabilities of LLM-Assisted Cyber Threat Intelligence

168

0

0

28 Sep 2025

CancerGUIDE: Cancer Guideline Understanding via Internal Disagreement Estimation

CancerGUIDE: Cancer Guideline Understanding via Internal Disagreement Estimation

Noel C. F. Codella

...

267

0

0

09 Sep 2025

Page 1 of 1