Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.09848
Cited By
Evaluating Verifiability in Generative Search Engines
19 April 2023
Nelson F. Liu
Tianyi Zhang
Percy Liang
HILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Evaluating Verifiability in Generative Search Engines"
50 / 158 papers shown
Title
Atomic Consistency Preference Optimization for Long-Form Question Answering
Jingfeng Chen
Raghuveer Thirukovalluru
Junlin Wang
Kaiwei Luo
Bhuwan Dhingra
KELM
HILM
20
0
0
14 May 2025
CiteFix: Enhancing RAG Accuracy Through Post-Processing Citation Correction
Harsh Maheshwari
Srikanth Tenneti
Alwarappan Nakkiran
3DV
29
0
0
22 Apr 2025
Transparentize the Internal and External Knowledge Utilization in LLMs with Trustworthy Citation
Jiajun Shen
Tong Zhou
Yubo Chen
Delai Qiu
Shengping Liu
Kang-Jun Liu
Jun Zhao
HILM
RALM
86
0
0
21 Apr 2025
Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges
Nandan Thakur
Ronak Pradeep
Shivani Upadhyay
Daniel Fernando Campos
Nick Craswell
Jimmy Lin
ELM
38
0
0
21 Apr 2025
ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph
Langming Liu
Haibin Chen
Yuhao Wang
Yujin Yuan
Shilei Liu
Wenbo Su
Xiangyu Zhao
Bo Zheng
RALM
58
0
0
20 Mar 2025
MCiteBench: A Benchmark for Multimodal Citation Text Generation in MLLMs
Caiyu Hu
Yikai Zhang
Tinghui Zhu
Yiwei Ye
Yanghua Xiao
83
0
0
04 Mar 2025
Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models
Aliyah R. Hsu
James Zhu
Zhichao Wang
Bin Bi
Shubham Mehrotra
...
Sougata Chaudhuri
Regunathan Radhakrishnan
S. Asur
Claire Na Cheng
Bin Yu
ALM
LRM
69
0
0
20 Feb 2025
CiteCheck: Towards Accurate Citation Faithfulness Detection
Ziyao Xu
Shaohang Wei
Zhuoheng Han
Jing Jin
Z. Yang
Xiaoguang Li
Haochen Tan
Zhijiang Guo
Houfeng Wang
34
0
0
15 Feb 2025
Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and Inconsistencies
Sunnie S. Y. Kim
J. Vaughan
Q. V. Liao
Tania Lombrozo
Olga Russakovsky
96
5
0
12 Feb 2025
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data
Deren Lei
Yaxi Li
Siyao Li
Mengya Hu
Rui Xu
Ken Archer
Mingyu Wang
Emily Ching
Alex Deng
SyDa
HILM
LRM
73
1
0
28 Jan 2025
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling
Junyi Li
Hwee Tou Ng
LRM
90
1
0
19 Dec 2024
Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation
S. Ramprasad
Byron C. Wallace
LLMAG
HILM
87
2
0
25 Nov 2024
Eliciting Critical Reasoning in Retrieval-Augmented Language Models via Contrastive Explanations
Leonardo Ranaldi
Marco Valentino
André Freitas
RALM
LRM
32
4
0
30 Oct 2024
Enhancing Answer Attribution for Faithful Text Generation with Large Language Models
Juraj Vladika
Luca Mülln
Florian Matthes
25
0
0
22 Oct 2024
Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage
Kaige Xie
Philippe Laban
Prafulla Kumar Choubey
Caiming Xiong
C. Wu
29
1
0
20 Oct 2024
Generative AI Agents in Autonomous Machines: A Safety Perspective
Jason J. Jabbour
Vijay Janapa Reddi
AI4CE
43
4
0
20 Oct 2024
On the Capacity of Citation Generation by Large Language Models
Haosheng Qian
Yixing Fan
Ruqing Zhang
J. Guo
HILM
23
1
0
15 Oct 2024
Search Engines in an AI Era: The False Promise of Factual and Verifiable Source-Cited Responses
Pranav Narayanan Venkit
Philippe Laban
Yilun Zhou
Yixin Mao
C. Wu
ELM
32
7
0
15 Oct 2024
Yesterday's News: Benchmarking Multi-Dimensional Out-of-Distribution Generalisation of Misinformation Detection Models
Ivo Verhoeven
Pushkar Mishra
Ekaterina Shutova
25
0
0
12 Oct 2024
ALR
2
^2
2
: A Retrieve-then-Reason Framework for Long-context Question Answering
Huayang Li
Pat Verga
Priyanka Sen
Bowen Yang
Vijay Viswanathan
Patrick Lewis
Taro Watanabe
Yixuan Su
RALM
LRM
46
7
0
04 Oct 2024
L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding?
Zecheng Tang
Keyan Zhou
Juntao Li
Baibei Ji
Jianye Hou
Min Zhang
39
2
0
03 Oct 2024
Neurosymbolic AI approach to Attribution in Large Language Models
Deepa Tilwani
R. Venkataramanan
Amit P. Sheth
32
1
0
30 Sep 2024
Enhancing Post-Hoc Attributions in Long Document Comprehension via Coarse Grained Answer Decomposition
Pritika Ramu
Koustava Goswami
Apoorv Saxena
Balaji Vasan Srinivavsan
33
1
0
25 Sep 2024
Learning When to Retrieve, What to Rewrite, and How to Respond in Conversational QA
Nirmal Roy
Leonardo F. R. Ribeiro
Rexhina Blloshmi
Kevin Small
RALM
22
2
0
23 Sep 2024
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
Maojia Song
Shang Hong Sim
Rishabh Bhardwaj
Hai Leong Chieu
Navonil Majumder
Soujanya Poria
34
6
0
17 Sep 2024
ContextCite: Attributing Model Generation to Context
Benjamin Cohen-Wang
Harshay Shah
Kristian Georgiev
Aleksander Madry
LRM
30
18
0
01 Sep 2024
Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations
Yucheng Jiang
Yijia Shao
Dekun Ma
Sina J. Semnani
Monica S. Lam
LLMAG
32
14
0
27 Aug 2024
A Comparative Analysis of Faithfulness Metrics and Humans in Citation Evaluation
Weijia Zhang
Mohammad Aliannejadi
Jiahuan Pei
Yifei Yuan
Jia-Hong Huang
Evangelos Kanoulas
HILM
37
4
0
22 Aug 2024
Analysis of Plan-based Retrieval for Grounded Text Generation
Ameya Godbole
Nicholas Monath
Seungyeon Kim
A. S. Rawat
Andrew McCallum
Manzil Zaheer
RALM
38
2
0
20 Aug 2024
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Dongyu Ru
Lin Qiu
Xiangkun Hu
Tianhang Zhang
Peng Shi
...
Tong He
Zhiguo Wang
Pengfei Liu
Yue Zhang
Zheng Zhang
49
12
0
15 Aug 2024
Learning Fine-Grained Grounded Citations for Attributed Large Language Models
Lei Huang
Xiaocheng Feng
Weitao Ma
Yuxuan Gu
Weihong Zhong
...
Weijiang Yu
Weihua Peng
Duyu Tang
Dandan Tu
Bing Qin
HILM
24
4
0
08 Aug 2024
Zero-shot Factual Consistency Evaluation Across Domains
Raunak Agarwal
HILM
39
0
0
07 Aug 2024
RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
Kunlun Zhu
Yifan Luo
Dingling Xu
Ruobing Wang
Shi Yu
...
Yishan Li
Zhiyuan Liu
Xu Han
Zhiyuan Liu
Maosong Sun
29
17
0
02 Aug 2024
Improving Retrieval Augmented Language Model with Self-Reasoning
Yuan Xia
Jingbo Zhou
Zhenhui Shi
Jun Chen
Hai-ting Huang
AIFin
LRM
ReLM
KELM
45
8
0
29 Jul 2024
Grounding and Evaluation for Large Language Models: Practical Challenges and Lessons Learned (Survey)
K. Kenthapadi
M. Sameki
Ankur Taly
HILM
ELM
AILaw
36
12
0
10 Jul 2024
How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions
Bojana Bašaragin
Adela Ljajić
Darija Medvecki
Lorenzo Cassano
Milos Kosprdic
Nikola Milosevic
LM&MA
32
2
0
06 Jul 2024
Face4RAG: Factual Consistency Evaluation for Retrieval Augmented Generation in Chinese
Yunqi Xu
Tianchi Cai
Jiyan Jiang
Xierui Song
35
2
0
01 Jul 2024
Molecular Facts: Desiderata for Decontextualization in LLM Fact Verification
Anisha Gunjal
Greg Durrett
HILM
46
13
0
28 Jun 2024
CLERC: A Dataset for Legal Case Retrieval and Retrieval-Augmented Analysis Generation
Abe Bohan Hou
Orion Weller
Guanghui Qin
Eugene Yang
Dawn J Lawrie
Nils Holzenberger
Andrew Blair-Stanek
Benjamin Van Durme
AILaw
ELM
81
5
0
24 Jun 2024
FastMem: Fast Memorization of Prompt Improves Context Awareness of Large Language Models
Junyi Zhu
Shuochen Liu
Yu Yu
Bo Tang
Yibo Yan
Zhiyu Li
Feiyu Xiong
Tong Xu
Matthew B. Blaschko
44
3
0
23 Jun 2024
Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics
Weijia Zhang
Mohammad Aliannejadi
Yifei Yuan
Jiahuan Pei
Jia-Hong Huang
Evangelos Kanoulas
HILM
29
12
0
21 Jun 2024
FoRAG: Factuality-optimized Retrieval Augmented Generation for Web-enhanced Long-form Question Answering
Tianchi Cai
Zhiwen Tan
Xierui Song
Tao Sun
Jiyan Jiang
Yunqi Xu
Yinger Zhang
Jinjie Gu
27
5
0
19 Jun 2024
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation
Jirui Qi
Gabriele Sarti
Raquel Fernández
Arianna Bisazza
RALM
39
5
0
19 Jun 2024
ALiiCE: Evaluating Positional Fine-grained Citation Generation
Yilong Xu
Jinhua Gao
Xiaoming Yu
Baolong Bi
Huawei Shen
Xueqi Cheng
HILM
29
5
0
19 Jun 2024
Learning to Generate Answers with Citations via Factual Consistency Models
Rami Aly
Zhiqiang Tang
Samson Tan
George Karypis
HILM
34
4
0
19 Jun 2024
Know the Unknown: An Uncertainty-Sensitive Method for LLM Instruction Tuning
Jiaqi Li
Yixuan Tang
Yi Yang
46
5
0
14 Jun 2024
Post-Hoc Answer Attribution for Grounded and Trustworthy Long Document Comprehension: Task, Insights, and Challenges
Abhilasha Sancheti
Koustava Goswami
Balaji Vasan Srinivasan
RALM
30
1
0
11 Jun 2024
Verifiable Generation with Subsentence-Level Fine-Grained Citations
Shuyang Cao
Lu Wang
31
6
0
10 Jun 2024
A Reality check of the benefits of LLM in business
Ming Cheung
27
3
0
09 Jun 2024
CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation
I-Hung Hsu
Zifeng Wang
Long T. Le
Lesly Miculicich
Nanyun Peng
Chen-Yu Lee
Tomas Pfister
HILM
29
4
0
08 Jun 2024
1
2
3
4
Next