Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.14783
Cited By
Evaluating RAG-Fusion with RAGElo: an Automated Elo-based Framework
20 June 2024
Zackary Rackauckas
Arthur Camara
Jakub Zavrel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Evaluating RAG-Fusion with RAGElo: an Automated Elo-based Framework"
5 / 5 papers shown
Title
Can LLMs Be Trusted for Evaluating RAG Systems? A Survey of Methods and Datasets
Lorenz Brehme
Thomas Ströhle
Ruth Breu
59
0
0
28 Apr 2025
Varco Arena: A Tournament Approach to Reference-Free Benchmarking Large Language Models
Seonil Son
Ju-Min Oh
Heegon Jin
Cheolhun Jang
Jeongbeom Jeong
Kuntae Kim
44
0
0
20 Feb 2025
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Dawei Li
Bohan Jiang
Liangjie Huang
Alimohammad Beigi
Chengshuai Zhao
...
Canyu Chen
Tianhao Wu
Kai Shu
Lu Cheng
Huan Liu
ELM
AILaw
108
63
0
25 Nov 2024
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems
Nandan Thakur
Suleman Kazi
Ge Luo
Jimmy J. Lin
Amin Ahmad
VLM
RALM
26
6
0
17 Oct 2024
CREAM: Comparison-Based Reference-Free ELO-Ranked Automatic Evaluation for Meeting Summarization
Ziwei Gong
Lin Ai
Harshsaiprasad Deshpande
Alexander Johnson
Emmy Phung
Zehui Wu
Ahmad Emami
Julia Hirschberg
36
2
0
17 Sep 2024
1