ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.15217
  4. Cited By
Ragas: Automated Evaluation of Retrieval Augmented Generation

Ragas: Automated Evaluation of Retrieval Augmented Generation

26 September 2023
ES Shahul
Jithin James
Luis Espinosa-Anke
Steven Schockaert
ArXivPDFHTML

Papers citing "Ragas: Automated Evaluation of Retrieval Augmented Generation"

44 / 94 papers shown
Title
HeCiX: Integrating Knowledge Graphs and Large Language Models for
  Biomedical Research
HeCiX: Integrating Knowledge Graphs and Large Language Models for Biomedical Research
Prerana Sanjay Kulkarni
Muskaan Jain
Disha Sheshanarayana
Srinivasan Parthiban
24
0
0
19 Jul 2024
StuGPTViz: A Visual Analytics Approach to Understand Student-ChatGPT
  Interactions
StuGPTViz: A Visual Analytics Approach to Understand Student-ChatGPT Interactions
Zixin Chen
Jiachen Wang
Meng Xia
Kento Shigyo
Dingdong Liu
Rong Zhang
Huamin Qu
34
1
0
17 Jul 2024
Evaluation of RAG Metrics for Question Answering in the Telecom Domain
Evaluation of RAG Metrics for Question Answering in the Telecom Domain
Sujoy Roychowdhury
Sumit Soman
H. G. Ranjani
Neeraj Gunda
Vansh Chhabra
Sai Krishna Bala
28
12
0
15 Jul 2024
Lynx: An Open Source Hallucination Evaluation Model
Lynx: An Open Source Hallucination Evaluation Model
Selvan Sunitha Ravi
B. Mielczarek
Anand Kannappan
Douwe Kiela
Rebecca Qian
VLM
RALM
HILM
25
15
0
11 Jul 2024
FACTS About Building Retrieval Augmented Generation-based Chatbots
FACTS About Building Retrieval Augmented Generation-based Chatbots
Rama Akkiraju
Anbang Xu
Deepak Bora
Tan Yu
Lu An
...
Nave Algarici
Jacob Liberman
Joey Conway
Sonu Nayyar
Justin Boitano
39
6
0
10 Jul 2024
Towards Optimizing and Evaluating a Retrieval Augmented QA Chatbot using
  LLMs with Human in the Loop
Towards Optimizing and Evaluating a Retrieval Augmented QA Chatbot using LLMs with Human in the Loop
Anum Afzal
Alexander Kowsik
Rajna Fani
Florian Matthes
31
6
0
08 Jul 2024
Searching for Best Practices in Retrieval-Augmented Generation
Searching for Best Practices in Retrieval-Augmented Generation
Xiaohua Wang
Zhenghua Wang
Xuan Gao
Feiran Zhang
Yixin Wu
...
Qi Qian
Ruicheng Yin
Changze Lv
Xiaoqing Zheng
Xuanjing Huang
27
1
0
01 Jul 2024
Evaluating Quality of Answers for Retrieval-Augmented Generation: A
  Strong LLM Is All You Need
Evaluating Quality of Answers for Retrieval-Augmented Generation: A Strong LLM Is All You Need
Yang Wang
Alberto Garcia Hernandez
Roman Kyslyi
Nicholas S. Kersting
26
3
0
26 Jun 2024
Enhancing Commentary Strategies for Imperfect Information Card Games: A Study of Large Language Models in Guandan Commentary
Enhancing Commentary Strategies for Imperfect Information Card Games: A Study of Large Language Models in Guandan Commentary
Meiling Tao
Xuechen Liang
Ziyi Wang
Yiling Tao
Yiling Tao
Jianhui Wang
Sun Li Tianyu Shi
32
1
0
23 Jun 2024
NLP-KG: A System for Exploratory Search of Scientific Literature in
  Natural Language Processing
NLP-KG: A System for Exploratory Search of Scientific Literature in Natural Language Processing
Tim Schopf
Florian Matthes
36
0
0
21 Jun 2024
Multi-Head RAG: Solving Multi-Aspect Problems with LLMs
Multi-Head RAG: Solving Multi-Aspect Problems with LLMs
Maciej Besta
Aleš Kubíček
Roman Niggli
Robert Gerstenberger
Lucas Weitzendorf
...
Jürgen Müller
H. Niewiadomski
Marcin Chrapek
Michał Podstawski
Torsten Hoefler
24
1
0
07 Jun 2024
Luna: An Evaluation Foundation Model to Catch Language Model
  Hallucinations with High Accuracy and Low Cost
Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost
Masha Belyi
Robert Friel
Shuai Shao
Atindriyo Sanyal
HILM
RALM
45
5
0
03 Jun 2024
CMDBench: A Benchmark for Coarse-to-fine Multimodal Data Discovery in
  Compound AI Systems
CMDBench: A Benchmark for Coarse-to-fine Multimodal Data Discovery in Compound AI Systems
Yanlin Feng
Sajjadur Rahman
Aaron Feng
Vincent Chen
Eser Kandogan
28
4
0
02 Jun 2024
Automated Evaluation of Retrieval-Augmented Language Models with
  Task-Specific Exam Generation
Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation
Gauthier Guinet
Behrooz Omidvar-Tehrani
Anoop Deoras
Laurent Callot
RALM
46
14
0
22 May 2024
From Questions to Insightful Answers: Building an Informed Chatbot for
  University Resources
From Questions to Insightful Answers: Building an Informed Chatbot for University Resources
Subash Neupane
Elias Hossain
Jason Keith
Himanshu Tripathi
Farbod Ghiasi
Noorbakhsh Amiri Golilarz
Amin Amirlatifi
Sudip Mittal
Shahram Rahimi
27
7
0
13 May 2024
Evaluation of Retrieval-Augmented Generation: A Survey
Evaluation of Retrieval-Augmented Generation: A Survey
Hao Yu
Aoran Gan
Kai Zhang
Shiwei Tong
Qi Liu
Zhaofeng Liu
3DV
36
1
0
13 May 2024
Automatic Generation of Model and Data Cards: A Step Towards Responsible
  AI
Automatic Generation of Model and Data Cards: A Step Towards Responsible AI
Jiarui Liu
Wenkai Li
Zhijing Jin
Mona T. Diab
SyDa
36
3
0
10 May 2024
PrivComp-KG : Leveraging Knowledge Graph and Large Language Models for
  Privacy Policy Compliance Verification
PrivComp-KG : Leveraging Knowledge Graph and Large Language Models for Privacy Policy Compliance Verification
Leon Garza
Lavanya Elluri
Anantaa Kotal
Aritran Piplai
Deepti Gupta
Anupam Joshi
19
2
0
30 Apr 2024
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural
  Language Processing
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
Yucheng Hu
Yuxing Lu
RALM
42
14
0
30 Apr 2024
GRAMMAR: Grounded and Modular Methodology for Assessment of
  Closed-Domain Retrieval-Augmented Language Model
GRAMMAR: Grounded and Modular Methodology for Assessment of Closed-Domain Retrieval-Augmented Language Model
Xinzhe Li
Ming Liu
Shang Gao
RALM
24
0
0
30 Apr 2024
InspectorRAGet: An Introspection Platform for RAG Evaluation
InspectorRAGet: An Introspection Platform for RAG Evaluation
Kshitij P. Fadnis
Siva Sankalp Patel
O. Boni
Yannis Katsis
Sara Rosenthal
Benjamin Sznajder
Marina Danilevsky
22
1
0
26 Apr 2024
Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented
  Generation
Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation
Guanhua Chen
Wenhan Yu
Lei Sha
3DV
24
0
0
19 Apr 2024
ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence
ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence
Kevin Wu
Eric Wu
James Y. Zou
AAML
31
38
0
16 Apr 2024
CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions
  for RAG systems
CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems
Sara Rosenthal
Avirup Sil
Radu Florian
Salim Roukos
28
9
0
02 Apr 2024
Observations on Building RAG Systems for Technical Documents
Observations on Building RAG Systems for Technical Documents
Sumit Soman
Sujoy Roychowdhury
3DGS
25
6
0
31 Mar 2024
MedInsight: A Multi-Source Context Augmentation Framework for Generating
  Patient-Centric Medical Responses using Large Language Models
MedInsight: A Multi-Source Context Augmentation Framework for Generating Patient-Centric Medical Responses using Large Language Models
Subash Neupane
Shaswata Mitra
Sudip Mittal
Noorbakhsh Amiri Golilarz
Shahram Rahimi
Amin Amirlatifi
35
3
0
13 Mar 2024
FaaF: Facts as a Function for the evaluation of generated text
FaaF: Facts as a Function for the evaluation of generated text
Vasileios Katranidis
Gabor Barany
HILM
RALM
23
4
0
06 Mar 2024
To Generate or to Retrieve? On the Effectiveness of Artificial Contexts
  for Medical Open-Domain Question Answering
To Generate or to Retrieve? On the Effectiveness of Artificial Contexts for Medical Open-Domain Question Answering
Giacomo Frisoni
Alessio Cocchieri
Alex Presepi
Gianluca Moro
Zaiqiao Meng
RALM
MedIm
27
13
0
04 Mar 2024
Retrieval-Augmented Generation for AI-Generated Content: A Survey
Retrieval-Augmented Generation for AI-Generated Content: A Survey
Penghao Zhao
Hailin Zhang
Qinhan Yu
Zhengren Wang
Yunteng Geng
Fangcheng Fu
Ling Yang
Wentao Zhang
Jie Jiang
Bin Cui
3DV
66
182
0
29 Feb 2024
Retrieval Augmented Generation Systems: Automatic Dataset Creation,
  Evaluation and Boolean Agent Setup
Retrieval Augmented Generation Systems: Automatic Dataset Creation, Evaluation and Boolean Agent Setup
Tristan Kenneweg
Philip Kenneweg
Barbara Hammer
3DV
21
1
0
26 Feb 2024
RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for
  Short-form Open-Domain Question Answering
RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering
Zihan Zhang
Meng Fang
Ling-Hao Chen
RALM
32
11
0
26 Feb 2024
MoRAL: MoE Augmented LoRA for LLMs' Lifelong Learning
MoRAL: MoE Augmented LoRA for LLMs' Lifelong Learning
Shu Yang
Muhammad Asif Ali
Cheng-Long Wang
Lijie Hu
Di Wang
CLL
MoE
16
1
0
17 Feb 2024
LLM-based NLG Evaluation: Current Status and Challenges
LLM-based NLG Evaluation: Current Status and Challenges
Mingqi Gao
Xinyu Hu
Jie Ruan
Xiao Pu
Xiaojun Wan
ELM
LM&MA
19
1
0
02 Feb 2024
HiQA: A Hierarchical Contextual Augmentation RAG for Massive Documents
  QA
HiQA: A Hierarchical Contextual Augmentation RAG for Massive Documents QA
Xinyue Chen
Pengyu Gao
Jiangjiang Song
Xiaoyang Tan
34
4
0
01 Feb 2024
RAG-Fusion: a New Take on Retrieval-Augmented Generation
RAG-Fusion: a New Take on Retrieval-Augmented Generation
Zackary Rackauckas
16
33
0
31 Jan 2024
MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop
  Queries
MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop Queries
Yixuan Tang
Yi Yang
RALM
23
1
0
27 Jan 2024
From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical
  Regulatory Compliance Process
From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance Process
Jaewoong Kim
Moohong Min
23
11
0
26 Jan 2024
LOCALINTEL: Generating Organizational Threat Intelligence from Global and Local Cyber Knowledge
LOCALINTEL: Generating Organizational Threat Intelligence from Global and Local Cyber Knowledge
Shaswata Mitra
Subash Neupane
Trisha Chakraborty
Sudip Mittal
Aritran Piplai
Manas Gaur
Shahram Rahimi
16
20
0
18 Jan 2024
Opening A Pandora's Box: Things You Should Know in the Era of Custom
  GPTs
Opening A Pandora's Box: Things You Should Know in the Era of Custom GPTs
Guanhong Tao
Shuyang Cheng
Zhuo Zhang
Junmin Zhu
Guangyu Shen
Xiangyu Zhang
SILM
20
10
0
31 Dec 2023
Retrieval-Augmented Generation for Large Language Models: A Survey
Retrieval-Augmented Generation for Large Language Models: A Survey
Yunfan Gao
Yun Xiong
Xinyu Gao
Kangxiang Jia
Jinliu Pan
Yuxi Bi
Yi Dai
Jiawei Sun
Meng Wang
Haofen Wang
3DV
RALM
27
1,364
1
18 Dec 2023
PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning
PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning
Zhihan Zhang
Dong-Ho Lee
Yuwei Fang
W. Yu
Mengzhao Jia
Meng-Long Jiang
Francesco Barbieri
ALM
25
26
0
15 Nov 2023
The Internal State of an LLM Knows When It's Lying
The Internal State of an LLM Knows When It's Lying
A. Azaria
Tom Michael Mitchell
HILM
186
192
0
26 Apr 2023
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sébastien Bubeck
Varun Chandrasekaran
Ronen Eldan
J. Gehrke
Eric Horvitz
...
Scott M. Lundberg
Harsha Nori
Hamid Palangi
Marco Tulio Ribeiro
Yi Zhang
ELM
AI4MH
AI4CE
ALM
187
2,232
0
22 Mar 2023
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for
  Generative Large Language Models
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
Potsawee Manakul
Adian Liusie
Mark J. F. Gales
HILM
LRM
126
217
0
15 Mar 2023
Previous
12