Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.09848
Cited By
Evaluating Verifiability in Generative Search Engines
19 April 2023
Nelson F. Liu
Tianyi Zhang
Percy Liang
HILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Evaluating Verifiability in Generative Search Engines"
50 / 158 papers shown
Title
Peering into the Mind of Language Models: An Approach for Attribution in Contextual Question Answering
Anirudh Phukan
Shwetha Somasundaram
Apoorv Saxena
Koustava Goswami
Balaji Vasan Srinivasan
32
8
0
28 May 2024
AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings
R. Reddy
Omar Attia
Yunyao Li
Heng Ji
Saloni Potdar
32
1
0
23 May 2024
Generative AI Search Engines as Arbiters of Public Knowledge: An Audit of Bias and Authority
Alice Li
Luanne Sinnamon
31
3
0
22 May 2024
DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning
Zijian Zhou
Xiaoqiang Lin
Xinyi Xu
Alok Prakash
Daniela Rus
K. H. Low
36
2
0
22 May 2024
Atomic Self-Consistency for Better Long Form Generations
Raghuveer Thirukovalluru
Yukun Huang
Bhuwan Dhingra
30
5
0
21 May 2024
Explainability for Transparent Conversational Information-Seeking
Weronika Lajewska
Damiano Spina
Johanne Trippas
K. Balog
34
7
0
06 May 2024
Efficient Data Generation for Source-grounded Information-seeking Dialogs: A Use Case for Meeting Transcripts
Lotem Golany
Filippo Galgani
Maya Mamo
Nimrod Parasol
Omer Vandsburger
Nadav Bar
Ido Dagan
27
2
0
02 May 2024
"I'm Not Sure, But...": Examining the Impact of Large Language Models' Uncertainty Expression on User Reliance and Trust
Sunnie S. Y. Kim
Q. V. Liao
Mihaela Vorvoreanu
Steph Ballard
Jennifer Wortman Vaughan
32
51
0
01 May 2024
Domain-Specific Improvement on Psychotherapy Chatbot Using Assistant
Cheng Kang
Daniel Novak
Kateřina Urbanová
Yuqing Cheng
Yong Hu
AI4MH
LM&MA
30
3
0
24 Apr 2024
From Matching to Generation: A Survey on Generative Information Retrieval
Xiaoxi Li
Jiajie Jin
Yujia Zhou
Yuyao Zhang
Peitian Zhang
Yutao Zhu
Zhicheng Dou
3DV
69
46
0
23 Apr 2024
ChatRetriever: Adapting Large Language Models for Generalized and Robust Conversational Dense Retrieval
Kelong Mao
Chenlong Deng
Haonan Chen
Fengran Mo
Zheng Liu
Tetsuya Sakai
Zhicheng Dou
KELM
45
11
0
21 Apr 2024
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents
Liyan Tang
Philippe Laban
Greg Durrett
HILM
SyDa
37
74
0
16 Apr 2024
CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level Granularity
Moshe Berchansky
Daniel Fleischer
Moshe Wasserblat
Peter Izsak
LRM
49
5
0
16 Apr 2024
Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study
Alessandro Stolfo
RALM
HILM
26
6
0
10 Apr 2024
How much reliable is ChatGPT's prediction on Information Extraction under Input Perturbations?
Ishani Mondal
Abhilasha Sancheti
17
1
0
07 Apr 2024
Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data
Jingyu Zhang
Marc Marone
Tianjian Li
Benjamin Van Durme
Daniel Khashabi
90
9
0
05 Apr 2024
Learning to Plan and Generate Text with Citations
Constanza Fierro
Reinald Kim Amplayo
Fantine Huot
Nicola De Cao
Joshua Maynez
Shashi Narayan
Mirella Lapata
24
17
0
04 Apr 2024
Source-Aware Training Enables Knowledge Attribution in Language Models
Muhammad Khalifa
David Wadden
Emma Strubell
Honglak Lee
Lu Wang
Iz Beltagy
Hao Peng
HILM
36
14
0
01 Apr 2024
Improving Attributed Text Generation of Large Language Models via Preference Learning
Dongfang Li
Zetian Sun
Baotian Hu
Zhenyu Liu
Xinshuo Hu
Xuebo Liu
Min Zhang
42
13
0
27 Mar 2024
Attribute First, then Generate: Locally-attributable Grounded Text Generation
Aviv Slobodkin
Eran Hirsch
Arie Cattan
Tal Schuster
Ido Dagan
71
20
0
25 Mar 2024
WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations
Haolin Deng
Chang Wang
Xin Li
Dezhang Yuan
Junlang Zhan
Tianhua Zhou
Jin Ma
Jun Gao
Ruifeng Xu
HILM
58
2
0
04 Mar 2024
Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Corby Rosset
Ho-Lam Chung
Guanghui Qin
Ethan C. Chau
Zhuo Feng
Ahmed Hassan Awadallah
Jennifer Neville
Nikhil Rao
37
10
0
27 Feb 2024
Evaluating Very Long-Term Conversational Memory of LLM Agents
A. Maharana
Dong-Ho Lee
Sergey Tulyakov
Mohit Bansal
Francesco Barbieri
Yuwei Fang
LLMAG
22
66
0
27 Feb 2024
Evaluating Robustness of Generative Search Engine on Adversarial Factual Questions
Xuming Hu
Xiaochuan Li
Junzhe Chen
Yinghui Li
Yangning Li
...
Yasheng Wang
Qun Liu
Lijie Wen
Philip S. Yu
Zhijiang Guo
AAML
ELM
24
5
0
25 Feb 2024
Fine-Grained Self-Endorsement Improves Factuality and Reasoning
Ante Wang
Linfeng Song
Baolin Peng
Ye Tian
Lifeng Jin
Haitao Mi
Jinsong Su
Dong Yu
HILM
LRM
23
6
0
23 Feb 2024
Fast Adversarial Attacks on Language Models In One GPU Minute
Vinu Sankar Sadasivan
Shoumik Saha
Gaurang Sriramanan
Priyatham Kattakinda
Atoosa Malemir Chegini
S. Feizi
MIALM
30
34
0
23 Feb 2024
AttributionBench: How Hard is Automatic Attribution Evaluation?
Yifei Li
Xiang Yue
Zeyi Liao
Huan Sun
HILM
27
13
0
23 Feb 2024
Search Engines Post-ChatGPT: How Generative Artificial Intelligence Could Make Search Less Reliable
Shahan Ali Memon
Jevin D. West
31
6
0
18 Feb 2024
Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering
Tobias Schimanski
Jingwei Ni
Mathias Kraus
Elliott Ash
Markus Leippold
21
4
0
13 Feb 2024
Generative Echo Chamber? Effects of LLM-Powered Search Systems on Diverse Information Seeking
Nikhil Sharma
Q. V. Liao
Ziang Xiao
30
19
0
08 Feb 2024
Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations
Cheng-Han Chiang
Hung-yi Lee
HILM
67
8
0
08 Feb 2024
Training Language Models to Generate Text with Citations via Fine-grained Rewards
Chengyu Huang
Zeqiu Wu
Yushi Hu
Wenya Wang
HILM
LRM
79
25
0
06 Feb 2024
How well do LLMs cite relevant medical references? An evaluation framework and analyses
Kevin Wu
Eric Wu
Ally Cassasola
Angela Zhang
Kevin Wei
Teresa Nguyen
Sith Riantawan
Patricia Shi Riantawan
Daniel E. Ho
James Y. Zou
LM&MA
ELM
AI4MH
23
26
0
03 Feb 2024
(A)I Am Not a Lawyer, But...: Engaging Legal Experts towards Responsible LLM Policies for Legal Advice
Inyoung Cheong
King Xia
K. J. Kevin Feng
Quan Ze Chen
Amy X. Zhang
AILaw
ELM
33
58
0
02 Feb 2024
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
Yuanjie Lyu
Zhiyu Li
Simin Niu
Feiyu Xiong
Bo Tang
Wenjin Wang
Hao Wu
Huan Liu
Tong Bill Xu
Enhong Chen
RALM
34
32
0
30 Jan 2024
Benchmarking Large Language Models in Complex Question Answering Attribution using Knowledge Graphs
Nan Hu
Jiaoyan Chen
Yike Wu
Guilin Qi
Sheng Bi
Tongtong Wu
Jeff Z. Pan
HILM
37
8
0
26 Jan 2024
Fine-grained Hallucination Detection and Editing for Language Models
Abhika Mishra
Akari Asai
Vidhisha Balachandran
Yizhong Wang
Graham Neubig
Yulia Tsvetkov
Hannaneh Hajishirzi
HILM
29
78
0
12 Jan 2024
LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test Construction
Yucheng Li
Frank Geurin
Chenghua Lin
10
26
0
19 Dec 2023
Towards Verifiable Text Generation with Evolving Memory and Self-Reflection
Hao-Lun Sun
Hengyi Cai
Bo Wang
Yingyan Hou
Xiaochi Wei
Shuaiqiang Wang
Yan Zhang
Dawei Yin
39
8
0
14 Dec 2023
Evaluating Large Language Models for Health-related Queries with Presuppositions
Navreet Kaur
Monojit Choudhury
Danish Pruthi
HILM
ELM
25
2
0
14 Dec 2023
Unlocking Anticipatory Text Generation: A Constrained Approach for Large Language Models Decoding
Lifu Tu
Semih Yavuz
Jin Qu
Jiacheng Xu
Rui Meng
Caiming Xiong
Yingbo Zhou
24
1
0
11 Dec 2023
Axiomatic Preference Modeling for Longform Question Answering
Corby Rosset
Guoqing Zheng
Victor C. Dibia
Ahmed Hassan Awadallah
Paul Bennett
SyDa
19
3
0
02 Dec 2023
A Survey of the Evolution of Language Model-Based Dialogue Systems
Hongru Wang
Lingzhi Wang
Yiming Du
Liang Chen
Jing Zhou
Yufei Wang
Kam-Fai Wong
LRM
53
20
0
28 Nov 2023
Unifying Corroborative and Contributive Attributions in Large Language Models
Theodora Worledge
Judy Hanwen Shen
Nicole Meister
Caleb Winston
Carlos Guestrin
TDI
24
10
0
20 Nov 2023
GEO: Generative Engine Optimization
Pranjal Aggarwal
Vishvak Murahari
Tanmay Rajpurohit
A. Kalyan
Karthik Narasimhan
A. Deshpande
38
2
0
16 Nov 2023
DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation
Yiqing Xie
Sheng Zhang
Hao Cheng
Pengfei Liu
Zelalem Gero
Cliff Wong
Tristan Naumann
Hoifung Poon
Carolyn Rose
MedIm
16
4
0
16 Nov 2023
Effective Large Language Model Adaptation for Improved Grounding and Citation Generation
Xi Ye
Ruoxi Sun
Sercan Ö. Arik
Tomas Pfister
HILM
31
25
0
16 Nov 2023
Towards Verifiable Text Generation with Symbolic References
Lucas Torroba Hennigen
Zejiang Shen
Aniruddha Nrusimha
Bernhard Gapp
David Sontag
Yoon Kim
20
10
0
15 Nov 2023
How Well Do Large Language Models Truly Ground?
Hyunji Lee
Se June Joo
Chaeeun Kim
Joel Jang
Doyoung Kim
Kyoung-Woon On
Minjoon Seo
HILM
25
6
0
15 Nov 2023
LLatrieval: LLM-Verified Retrieval for Verifiable Generation
Xiaonan Li
Changtai Zhu
Linyang Li
Zhangyue Yin
Tianxiang Sun
Xipeng Qiu
RALM
29
24
0
14 Nov 2023
Previous
1
2
3
4
Next