ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.21157
  4. Cited By
Real-Time Evaluation Models for RAG: Who Detects Hallucinations Best?
v1v2v3 (latest)

Real-Time Evaluation Models for RAG: Who Detects Hallucinations Best?

27 March 2025
Ashish Sardana
    HILMVLM
ArXiv (abs)PDFHTML

Papers citing "Real-Time Evaluation Models for RAG: Who Detects Hallucinations Best?"

7 / 7 papers shown
HEDGE: Hallucination Estimation via Dense Geometric Entropy for VQA with Vision-Language Models
HEDGE: Hallucination Estimation via Dense Geometric Entropy for VQA with Vision-Language Models
Sushant Gautam
Michael A. Riegler
Pål Halvorsen
VLM
201
1
0
16 Nov 2025
WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection
WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection
Guanzhong He
Zhen Yang
Jinxin Liu
Bin Xu
Lei Hou
Juanzi Li
109
1
0
21 Oct 2025
Prometheus 2: An Open Source Language Model Specialized in Evaluating
  Other Language Models
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Seungone Kim
Juyoung Suk
Shayne Longpre
Bill Yuchen Lin
Jamin Shin
Sean Welleck
Graham Neubig
Moontae Lee
Kyungjae Lee
Minjoon Seo
MoMeALMELM
375
321
0
02 May 2024
Quantifying Uncertainty in Answers from any Language Model and Enhancing
  their Trustworthiness
Quantifying Uncertainty in Answers from any Language Model and Enhancing their TrustworthinessAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Jiuhai Chen
Jonas W. Mueller
360
113
0
30 Aug 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Judging LLM-as-a-Judge with MT-Bench and Chatbot ArenaNeural Information Processing Systems (NeurIPS), 2023
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALMOSLMELM
3.2K
6,617
0
09 Jun 2023
Exploring the Use of Large Language Models for Reference-Free Text
  Quality Evaluation: An Empirical Study
Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: An Empirical StudyInternational Joint Conference on Natural Language Processing (IJCNLP), 2023
Yi Chen
Rui Wang
Haiyun Jiang
Shuming Shi
Ruifeng Xu
LM&MA
417
115
0
03 Apr 2023
ELI5: Long Form Question Answering
ELI5: Long Form Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Angela Fan
Yacine Jernite
Ethan Perez
David Grangier
Jason Weston
Michael Auli
AI4MHELM
436
735
0
22 Jul 2019
1