Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.15217
Cited By
Ragas: Automated Evaluation of Retrieval Augmented Generation
26 September 2023
ES Shahul
Jithin James
Luis Espinosa-Anke
Steven Schockaert
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Ragas: Automated Evaluation of Retrieval Augmented Generation"
44 / 94 papers shown
Title
HeCiX: Integrating Knowledge Graphs and Large Language Models for Biomedical Research
Prerana Sanjay Kulkarni
Muskaan Jain
Disha Sheshanarayana
Srinivasan Parthiban
24
0
0
19 Jul 2024
StuGPTViz: A Visual Analytics Approach to Understand Student-ChatGPT Interactions
Zixin Chen
Jiachen Wang
Meng Xia
Kento Shigyo
Dingdong Liu
Rong Zhang
Huamin Qu
34
1
0
17 Jul 2024
Evaluation of RAG Metrics for Question Answering in the Telecom Domain
Sujoy Roychowdhury
Sumit Soman
H. G. Ranjani
Neeraj Gunda
Vansh Chhabra
Sai Krishna Bala
28
12
0
15 Jul 2024
Lynx: An Open Source Hallucination Evaluation Model
Selvan Sunitha Ravi
B. Mielczarek
Anand Kannappan
Douwe Kiela
Rebecca Qian
VLM
RALM
HILM
25
15
0
11 Jul 2024
FACTS About Building Retrieval Augmented Generation-based Chatbots
Rama Akkiraju
Anbang Xu
Deepak Bora
Tan Yu
Lu An
...
Nave Algarici
Jacob Liberman
Joey Conway
Sonu Nayyar
Justin Boitano
39
6
0
10 Jul 2024
Towards Optimizing and Evaluating a Retrieval Augmented QA Chatbot using LLMs with Human in the Loop
Anum Afzal
Alexander Kowsik
Rajna Fani
Florian Matthes
31
6
0
08 Jul 2024
Searching for Best Practices in Retrieval-Augmented Generation
Xiaohua Wang
Zhenghua Wang
Xuan Gao
Feiran Zhang
Yixin Wu
...
Qi Qian
Ruicheng Yin
Changze Lv
Xiaoqing Zheng
Xuanjing Huang
27
1
0
01 Jul 2024
Evaluating Quality of Answers for Retrieval-Augmented Generation: A Strong LLM Is All You Need
Yang Wang
Alberto Garcia Hernandez
Roman Kyslyi
Nicholas S. Kersting
26
3
0
26 Jun 2024
Enhancing Commentary Strategies for Imperfect Information Card Games: A Study of Large Language Models in Guandan Commentary
Meiling Tao
Xuechen Liang
Ziyi Wang
Yiling Tao
Yiling Tao
Jianhui Wang
Sun Li Tianyu Shi
32
1
0
23 Jun 2024
NLP-KG: A System for Exploratory Search of Scientific Literature in Natural Language Processing
Tim Schopf
Florian Matthes
36
0
0
21 Jun 2024
Multi-Head RAG: Solving Multi-Aspect Problems with LLMs
Maciej Besta
Aleš Kubíček
Roman Niggli
Robert Gerstenberger
Lucas Weitzendorf
...
Jürgen Müller
H. Niewiadomski
Marcin Chrapek
Michał Podstawski
Torsten Hoefler
24
1
0
07 Jun 2024
Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost
Masha Belyi
Robert Friel
Shuai Shao
Atindriyo Sanyal
HILM
RALM
45
5
0
03 Jun 2024
CMDBench: A Benchmark for Coarse-to-fine Multimodal Data Discovery in Compound AI Systems
Yanlin Feng
Sajjadur Rahman
Aaron Feng
Vincent Chen
Eser Kandogan
28
4
0
02 Jun 2024
Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation
Gauthier Guinet
Behrooz Omidvar-Tehrani
Anoop Deoras
Laurent Callot
RALM
46
14
0
22 May 2024
From Questions to Insightful Answers: Building an Informed Chatbot for University Resources
Subash Neupane
Elias Hossain
Jason Keith
Himanshu Tripathi
Farbod Ghiasi
Noorbakhsh Amiri Golilarz
Amin Amirlatifi
Sudip Mittal
Shahram Rahimi
27
7
0
13 May 2024
Evaluation of Retrieval-Augmented Generation: A Survey
Hao Yu
Aoran Gan
Kai Zhang
Shiwei Tong
Qi Liu
Zhaofeng Liu
3DV
36
1
0
13 May 2024
Automatic Generation of Model and Data Cards: A Step Towards Responsible AI
Jiarui Liu
Wenkai Li
Zhijing Jin
Mona T. Diab
SyDa
36
3
0
10 May 2024
PrivComp-KG : Leveraging Knowledge Graph and Large Language Models for Privacy Policy Compliance Verification
Leon Garza
Lavanya Elluri
Anantaa Kotal
Aritran Piplai
Deepti Gupta
Anupam Joshi
19
2
0
30 Apr 2024
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
Yucheng Hu
Yuxing Lu
RALM
42
14
0
30 Apr 2024
GRAMMAR: Grounded and Modular Methodology for Assessment of Closed-Domain Retrieval-Augmented Language Model
Xinzhe Li
Ming Liu
Shang Gao
RALM
24
0
0
30 Apr 2024
InspectorRAGet: An Introspection Platform for RAG Evaluation
Kshitij P. Fadnis
Siva Sankalp Patel
O. Boni
Yannis Katsis
Sara Rosenthal
Benjamin Sznajder
Marina Danilevsky
22
1
0
26 Apr 2024
Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation
Guanhua Chen
Wenhan Yu
Lei Sha
3DV
24
0
0
19 Apr 2024
ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence
Kevin Wu
Eric Wu
James Y. Zou
AAML
31
38
0
16 Apr 2024
CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems
Sara Rosenthal
Avirup Sil
Radu Florian
Salim Roukos
28
9
0
02 Apr 2024
Observations on Building RAG Systems for Technical Documents
Sumit Soman
Sujoy Roychowdhury
3DGS
25
6
0
31 Mar 2024
MedInsight: A Multi-Source Context Augmentation Framework for Generating Patient-Centric Medical Responses using Large Language Models
Subash Neupane
Shaswata Mitra
Sudip Mittal
Noorbakhsh Amiri Golilarz
Shahram Rahimi
Amin Amirlatifi
35
3
0
13 Mar 2024
FaaF: Facts as a Function for the evaluation of generated text
Vasileios Katranidis
Gabor Barany
HILM
RALM
23
4
0
06 Mar 2024
To Generate or to Retrieve? On the Effectiveness of Artificial Contexts for Medical Open-Domain Question Answering
Giacomo Frisoni
Alessio Cocchieri
Alex Presepi
Gianluca Moro
Zaiqiao Meng
RALM
MedIm
27
13
0
04 Mar 2024
Retrieval-Augmented Generation for AI-Generated Content: A Survey
Penghao Zhao
Hailin Zhang
Qinhan Yu
Zhengren Wang
Yunteng Geng
Fangcheng Fu
Ling Yang
Wentao Zhang
Jie Jiang
Bin Cui
3DV
66
182
0
29 Feb 2024
Retrieval Augmented Generation Systems: Automatic Dataset Creation, Evaluation and Boolean Agent Setup
Tristan Kenneweg
Philip Kenneweg
Barbara Hammer
3DV
21
1
0
26 Feb 2024
RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering
Zihan Zhang
Meng Fang
Ling-Hao Chen
RALM
32
11
0
26 Feb 2024
MoRAL: MoE Augmented LoRA for LLMs' Lifelong Learning
Shu Yang
Muhammad Asif Ali
Cheng-Long Wang
Lijie Hu
Di Wang
CLL
MoE
16
1
0
17 Feb 2024
LLM-based NLG Evaluation: Current Status and Challenges
Mingqi Gao
Xinyu Hu
Jie Ruan
Xiao Pu
Xiaojun Wan
ELM
LM&MA
19
1
0
02 Feb 2024
HiQA: A Hierarchical Contextual Augmentation RAG for Massive Documents QA
Xinyue Chen
Pengyu Gao
Jiangjiang Song
Xiaoyang Tan
34
4
0
01 Feb 2024
RAG-Fusion: a New Take on Retrieval-Augmented Generation
Zackary Rackauckas
16
33
0
31 Jan 2024
MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop Queries
Yixuan Tang
Yi Yang
RALM
23
1
0
27 Jan 2024
From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance Process
Jaewoong Kim
Moohong Min
23
11
0
26 Jan 2024
LOCALINTEL: Generating Organizational Threat Intelligence from Global and Local Cyber Knowledge
Shaswata Mitra
Subash Neupane
Trisha Chakraborty
Sudip Mittal
Aritran Piplai
Manas Gaur
Shahram Rahimi
16
20
0
18 Jan 2024
Opening A Pandora's Box: Things You Should Know in the Era of Custom GPTs
Guanhong Tao
Shuyang Cheng
Zhuo Zhang
Junmin Zhu
Guangyu Shen
Xiangyu Zhang
SILM
20
10
0
31 Dec 2023
Retrieval-Augmented Generation for Large Language Models: A Survey
Yunfan Gao
Yun Xiong
Xinyu Gao
Kangxiang Jia
Jinliu Pan
Yuxi Bi
Yi Dai
Jiawei Sun
Meng Wang
Haofen Wang
3DV
RALM
27
1,364
1
18 Dec 2023
PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning
Zhihan Zhang
Dong-Ho Lee
Yuwei Fang
W. Yu
Mengzhao Jia
Meng-Long Jiang
Francesco Barbieri
ALM
25
26
0
15 Nov 2023
The Internal State of an LLM Knows When It's Lying
A. Azaria
Tom Michael Mitchell
HILM
186
192
0
26 Apr 2023
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sébastien Bubeck
Varun Chandrasekaran
Ronen Eldan
J. Gehrke
Eric Horvitz
...
Scott M. Lundberg
Harsha Nori
Hamid Palangi
Marco Tulio Ribeiro
Yi Zhang
ELM
AI4MH
AI4CE
ALM
187
2,232
0
22 Mar 2023
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
Potsawee Manakul
Adian Liusie
Mark J. F. Gales
HILM
LRM
126
217
0
15 Mar 2023
Previous
1
2