ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.00396
  4. Cited By
RAGTruth: A Hallucination Corpus for Developing Trustworthy
  Retrieval-Augmented Language Models

RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models

31 December 2023
Cheng Niu
Yuanhao Wu
Juno Zhu
Siliang Xu
Kashun Shum
Randy Zhong
Juntong Song
Tong Zhang
    HILM
ArXivPDFHTML

Papers citing "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"

13 / 63 papers shown
Title
Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented
  Generation
Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation
Di Wu
Jia-Chen Gu
Fan Yin
Nanyun Peng
Kai-Wei Chang
HILM
53
1
0
19 Jun 2024
R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval
  Augmented Large Language Models
R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models
Shangqing Tu
Yuanchun Wang
Jifan Yu
Yuyang Xie
Yaran Shi
Xiaozhi Wang
Jing Zhang
Lei Hou
Juanzi Li
ELM
35
3
0
17 Jun 2024
Superhuman performance in urology board questions by an explainable
  large language model enabled for context integration of the European
  Association of Urology guidelines: the UroBot study
Superhuman performance in urology board questions by an explainable large language model enabled for context integration of the European Association of Urology guidelines: the UroBot study
Martin J. Hetz
Nicolas Carl
Sarah Haggenmüller
Christoph Wies
Maurice Stephan Michel
Frederik Wessels
T. Brinker
ELM
34
0
0
03 Jun 2024
Luna: An Evaluation Foundation Model to Catch Language Model
  Hallucinations with High Accuracy and Low Cost
Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost
Masha Belyi
Robert Friel
Shuai Shao
Atindriyo Sanyal
HILM
RALM
61
5
0
03 Jun 2024
From Generalist to Specialist: Improving Large Language Models for
  Medical Physics Using ARCoT
From Generalist to Specialist: Improving Large Language Models for Medical Physics Using ARCoT
Jace Grandinetti
R. Mcbeth
AI4CE
LRM
LM&MA
30
0
0
17 May 2024
Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on
  Graphs
Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs
Bowen Jin
Chulin Xie
Jiawei Zhang
Kashob Kumar Roy
Yu Zhang
...
Ruirui Li
Xianfeng Tang
Suhang Wang
Yu Meng
Jiawei Han
LRM
RALM
53
37
0
10 Apr 2024
Mafin: Enhancing Black-Box Embeddings with Model Augmented Fine-Tuning
Mafin: Enhancing Black-Box Embeddings with Model Augmented Fine-Tuning
Mingtian Zhang
Shawn Lan
Peter Hayes
David Barber
29
2
0
19 Feb 2024
An Examination on the Effectiveness of Divide-and-Conquer Prompting in
  Large Language Models
An Examination on the Effectiveness of Divide-and-Conquer Prompting in Large Language Models
Yizhou Zhang
Lun Du
Defu Cao
Qiang Fu
Yan Liu
LRM
20
7
0
08 Feb 2024
The Knowledge Alignment Problem: Bridging Human and External Knowledge
  for Large Language Models
The Knowledge Alignment Problem: Bridging Human and External Knowledge for Large Language Models
Shuo Zhang
Liangming Pan
Junzhou Zhao
W. Wang
HILM
26
0
0
23 May 2023
The Internal State of an LLM Knows When It's Lying
The Internal State of an LLM Knows When It's Lying
A. Azaria
Tom Michael Mitchell
HILM
218
299
0
26 Apr 2023
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
311
11,915
0
04 Mar 2022
Entity-Based Knowledge Conflicts in Question Answering
Entity-Based Knowledge Conflicts in Question Answering
Shayne Longpre
Kartik Perisetla
Anthony Chen
Nikhil Ramesh
Chris DuBois
Sameer Singh
HILM
245
236
0
10 Sep 2021
Evaluating Attribution in Dialogue Systems: The BEGIN Benchmark
Evaluating Attribution in Dialogue Systems: The BEGIN Benchmark
Nouha Dziri
Hannah Rashkin
Tal Linzen
David Reitter
ALM
187
79
0
30 Apr 2021
Previous
12