ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.09476
  4. Cited By
ARES: An Automated Evaluation Framework for Retrieval-Augmented
  Generation Systems
v1v2 (latest)

ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems

North American Chapter of the Association for Computational Linguistics (NAACL), 2023
16 November 2023
Jon Saad-Falcon
Omar Khattab
Christopher Potts
Matei A. Zaharia
    RALM
ArXiv (abs)PDFHTMLHuggingFace (6 upvotes)

Papers citing "ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems"

25 / 75 papers shown
Rationale-Guided Retrieval Augmented Generation for Medical Question Answering
Rationale-Guided Retrieval Augmented Generation for Medical Question AnsweringNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Jiwoong Sohn
Yein Park
Chanwoong Yoon
Sihyeon Park
Hyeon Hwang
Mujeen Sung
Hyunjae Kim
Jaewoo Kang
RALM
461
30
0
01 Nov 2024
Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses
  with Sub-Question Coverage
Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question CoverageNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Kaige Xie
Philippe Laban
Prafulla Kumar Choubey
Caiming Xiong
Chien-Sheng Wu
163
3
0
20 Oct 2024
Limits to scalable evaluation at the frontier: LLM as Judge won't beat twice the data
Limits to scalable evaluation at the frontier: LLM as Judge won't beat twice the dataInternational Conference on Learning Representations (ICLR), 2024
Florian E. Dorner
Vivian Y. Nastl
Moritz Hardt
ELMALM
407
22
0
17 Oct 2024
Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation
Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented GenerationInternational Conference on the Theory of Information Retrieval (ICTIR), 2024
To Eun Kim
Fernando Diaz
662
11
0
17 Sep 2024
HyPA-RAG: A Hybrid Parameter Adaptive Retrieval-Augmented Generation System for AI Legal and Policy Applications
HyPA-RAG: A Hybrid Parameter Adaptive Retrieval-Augmented Generation System for AI Legal and Policy Applications
Rishi Kalra
Zekun Wu
Ayesha Gulley
Airlie Hilliard
Xin Guan
Adriano Soares Koshiyama
Philip C. Treleaven
RALMAILaw
294
22
0
29 Aug 2024
Can Unconfident LLM Annotations Be Used for Confident Conclusions?
Can Unconfident LLM Annotations Be Used for Confident Conclusions?North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Kristina Gligorić
Tijana Zrnic
Cinoo Lee
Emmanuel J. Candès
Dan Jurafsky
386
26
0
27 Aug 2024
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented
  Generation
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation
Daniel Fleischer
Moshe Berchansky
Moshe Wasserblat
Peter Izsak
3DV
293
8
0
05 Aug 2024
RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
RAGEval: Scenario Specific RAG Evaluation Dataset Generation FrameworkAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Kunlun Zhu
Yifan Luo
Dingling Xu
Ruobing Wang
Shi Yu
...
Yishan Li
Zhiyuan Liu
Xu Han
Zhiyuan Liu
Maosong Sun
669
38
0
02 Aug 2024
Retrieval-Augmented Generation for Natural Language Processing: A Survey
Retrieval-Augmented Generation for Natural Language Processing: A Survey
Shangyu Wu
Ying Xiong
Yufei Cui
Haolun Wu
Can Chen
...
Lianming Huang
Xue Liu
Tei-Wei Kuo
Nan Guan
Chun Jason Xue
3DVRALM
455
97
0
18 Jul 2024
Evaluation of RAG Metrics for Question Answering in the Telecom Domain
Evaluation of RAG Metrics for Question Answering in the Telecom Domain
Sujoy Roychowdhury
Sumit Soman
H. G. Ranjani
Neeraj Gunda
Vansh Chhabra
Sai Krishna Bala
289
30
0
15 Jul 2024
Grounding and Evaluation for Large Language Models: Practical Challenges
  and Lessons Learned (Survey)
Grounding and Evaluation for Large Language Models: Practical Challenges and Lessons Learned (Survey)
K. Kenthapadi
M. Sameki
Ankur Taly
HILMELMAILaw
219
33
0
10 Jul 2024
BERGEN: A Benchmarking Library for Retrieval-Augmented Generation
BERGEN: A Benchmarking Library for Retrieval-Augmented Generation
David Rau
Hervé Déjean
Nadezhda Chirkova
Thibault Formal
Shuai Wang
Vassilina Nikoulina
Stéphane Clinchant
304
23
0
01 Jul 2024
When Search Engine Services meet Large Language Models: Visions and
  Challenges
When Search Engine Services meet Large Language Models: Visions and Challenges
Haoyi Xiong
Jiang Bian
Yuchen Li
Xuhong Li
Jundong Li
Shuaiqiang Wang
D. Yin
Sumi Helal
353
79
0
28 Jun 2024
Evaluating Quality of Answers for Retrieval-Augmented Generation: A
  Strong LLM Is All You Need
Evaluating Quality of Answers for Retrieval-Augmented Generation: A Strong LLM Is All You Need
Yang Wang
Alberto Garcia Hernandez
Roman Kyslyi
Nicholas S. Kersting
316
7
0
26 Jun 2024
The Challenges of Evaluating LLM Applications: An Analysis of Automated,
  Human, and LLM-Based Approaches
The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches
Bhashithe Abeysinghe
Ruhan Circi
ELM
289
39
0
05 Jun 2024
Luna: An Evaluation Foundation Model to Catch Language Model
  Hallucinations with High Accuracy and Low Cost
Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost
Masha Belyi
Robert Friel
Shuai Shao
Atindriyo Sanyal
HILMRALM
443
8
0
03 Jun 2024
Evaluation of Retrieval-Augmented Generation: A Survey
Evaluation of Retrieval-Augmented Generation: A Survey
Hao Yu
Aoran Gan
Kai Zhang
Shiwei Tong
Qi Liu
Zhaofeng Liu
3DV
393
195
0
13 May 2024
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
Yucheng Hu
Yuxing Lu
RALM
402
31
0
30 Apr 2024
Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation
Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented GenerationIEEE Transactions on Audio, Speech, and Language Processing (IEEE TASLP), 2024
Guanhua Chen
Wenhan Yu
Xiao Lu
Xiao Zhang
Erli Meng
Lei Sha
3DV
211
1
0
19 Apr 2024
A Survey on Retrieval-Augmented Text Generation for Large Language
  Models
A Survey on Retrieval-Augmented Text Generation for Large Language Models
Yizheng Huang
Jimmy X. Huang
3DVRALM
318
91
0
17 Apr 2024
ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence
ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence
Kevin Wu
Eric Wu
James Zou
AAML
638
77
0
16 Apr 2024
AutoEval Done Right: Using Synthetic Data for Model Evaluation
AutoEval Done Right: Using Synthetic Data for Model Evaluation
Pierre Boyeau
Anastasios Nikolas Angelopoulos
N. Yosef
Jitendra Malik
Michael I. Jordan
SyDa
350
31
0
09 Mar 2024
Prediction-Powered Ranking of Large Language Models
Prediction-Powered Ranking of Large Language Models
Ivi Chatzi
Eleni Straitouri
Suhas Thejaswi
Manuel Gomez Rodriguez
ALM
453
13
0
27 Feb 2024
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented
  Generation of Large Language Models
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
Yuanjie Lyu
Zhiyu Li
Pengnian Qi
Feiyu Xiong
Shichao Song
Wenjin Wang
Huayi Lai
Huan Liu
Tong Xu
Enhong Chen
RALM
300
75
0
30 Jan 2024
Billion-scale similarity search with GPUs
Billion-scale similarity search with GPUsIEEE Transactions on Big Data (TBD), 2017
Jeff Johnson
Matthijs Douze
Edouard Grave
970
4,531
0
28 Feb 2017
Previous
12