Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2506.09443
Cited By

LLMs Cannot Reliably Judge (Yet?): A Comprehensive Assessment on the Robustness of LLM-as-a-Judge

v1v2 (latest)

LLMs Cannot Reliably Judge (Yet?): A Comprehensive Assessment on the Robustness of LLM-as-a-Judge

11 June 2025

ArXiv (abs)PDF HTML Github

Papers citing "LLMs Cannot Reliably Judge (Yet?): A Comprehensive Assessment on the Robustness of LLM-as-a-Judge"

4 / 4 papers shown

KEO: Knowledge Extraction on OMIn via Knowledge Graphs and RAG for Safety-Critical Aviation Maintenance

KEO: Knowledge Extraction on OMIn via Knowledge Graphs and RAG for Safety-Critical Aviation Maintenance

Jonathan A. Karr Jr.

214

1

0

10 Apr 2026

Unsupervised Evaluation of Multi-Turn Objective-Driven Interactions

Unsupervised Evaluation of Multi-Turn Objective-Driven Interactions

392

0

0

04 Nov 2025

Distractor Injection Attacks on Large Reasoning Models: Characterization and Defense

Distractor Injection Attacks on Large Reasoning Models: Characterization and Defense

Chandan K. Reddy

169

0

0

17 Oct 2025

Knowledge-Graph Based RAG System Evaluation Framework

Knowledge-Graph Based RAG System Evaluation Framework

Vahid Zolfaghari

182

0

0

02 Oct 2025

Page 1 of 1