Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2305.14251
Cited By
v1
v2 (latest)
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
23 May 2023
Sewon Min
Kalpesh Krishna
Xinxi Lyu
M. Lewis
Anuj Kumar
Pang Wei Koh
Mohit Iyyer
Luke Zettlemoyer
Hannaneh Hajishirzi
HILM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"
50 / 615 papers shown
BookWorm: A Dataset for Character Description and Analysis
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Argyrios Papoudakis
Mirella Lapata
Frank Keller
195
2
0
14 Oct 2024
Can Knowledge Graphs Make Large Language Models More Trustworthy? An Empirical Study Over Open-ended Question Answering
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Yuan Sui
Yufei He
Zifeng Ding
Bryan Hooi
HILM
RALM
ELM
497
22
0
10 Oct 2024
ReIFE: Re-evaluating Instruction-Following Evaluation
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Yixin Liu
Kejian Shi
Alexander R. Fabbri
Yilun Zhao
Peifeng Wang
Chien-Sheng Wu
Shafiq Joty
Arman Cohan
215
13
0
09 Oct 2024
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Thomas Palmeira Ferraz
Kartik Mehta
Yu-Hsiang Lin
Haw-Shiuan Chang
Shereen Oraby
Sijia Liu
Vivek Subramanian
Tagyoung Chung
Mohit Bansal
Nanyun Peng
295
26
0
09 Oct 2024
Uncovering Factor Level Preferences to Improve Human-Model Alignment
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Juhyun Oh
Eunsu Kim
Jiseon Kim
Wenda Xu
Inha Cha
William Yang Wang
Alice Oh
374
1
0
09 Oct 2024
ReFIR: Grounding Large Restoration Models with Retrieval Augmentation
Neural Information Processing Systems (NeurIPS), 2024
Hang Guo
Tao Dai
Zhihao Ouyang
Taolin Zhang
Yaohua Zha
Bin Chen
Shu-Tao Xia
DiffM
208
10
0
08 Oct 2024
Why am I seeing this: Democratizing End User Auditing for Online Content Recommendations
ACM Symposium on User Interface Software and Technology (UIST), 2024
Chaoran Chen
Leyang Li
Luke Cao
Yanfang Ye
Tianshi Li
Yaxing Yao
Toby Jia-jun Li
MLAU
249
6
0
07 Oct 2024
Realizing Video Summarization from the Path of Language-based Semantic Understanding
Kuan-Chen Mu
Zhi-Yi Chin
Wei-Chen Chiu
169
0
0
06 Oct 2024
Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Farhan Samir
Chan Young Park
Anjalie Field
Vered Shwartz
Yulia Tsvetkov
172
8
0
05 Oct 2024
CS4: Measuring the Creativity of Large Language Models Automatically by Controlling the Number of Story-Writing Constraints
Anirudh Atmakuru
Jatin Nainani
Rohith Siddhartha Reddy Bheemreddy
Anirudh Lakkaraju
Zonghai Yao
Hamed Zamani
Haw-Shiuan Chang
367
12
0
05 Oct 2024
ECon: On the Detection and Resolution of Evidence Conflicts
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Cheng Jiayang
Chunkit Chan
Qianqian Zhuang
Lin Qiu
Tianhang Zhang
Tengxiao Liu
Yangqiu Song
Yue Zhang
Pengfei Liu
Zheng Zhang
260
13
0
05 Oct 2024
FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs
Deema Alnuhait
Neeraja Kirtane
Muhammad Khalifa
Hao Peng
LRM
HILM
367
7
0
03 Oct 2024
Loki: An Open-Source Tool for Fact Verification
International Conference on Computational Linguistics (COLING), 2024
Haonan Li
Xudong Han
Hao Wang
Yuxia Wang
Minghan Wang
Daniil Vasilev
Yilin Geng
Zenan Zhai
Preslav Nakov
Timothy Baldwin
SyDa
HILM
619
16
0
02 Oct 2024
Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Shayekh Bin Islam
Md Asib Rahman
K S M Tozammel Hossain
Enamul Hoque
Shafiq Joty
Md. Rizwan Parvez
RALM
AIFin
LRM
VLM
207
36
0
02 Oct 2024
FactAlign: Long-form Factuality Alignment of Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Chao-Wei Huang
Yun-Nung Chen
HILM
142
11
0
02 Oct 2024
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"
International Conference on Learning Representations (ICLR), 2024
Yifei Ming
Senthil Purushwalkam
Shrey Pandit
Zixuan Ke
Xuan-Phi Nguyen
Caiming Xiong
Shafiq Joty
HILM
619
44
0
30 Sep 2024
CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yike Wu
Yi Huang
Nan Hu
Yuncheng Hua
Guilin Qi
Jiaoyan Chen
Jeff Z. Pan
388
21
0
29 Sep 2024
Model-based Preference Optimization in Abstractive Summarization without Human Feedback
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jaepill Choi
Kyubyung Chae
Jiwoo Song
Yohan Jo
Taesup Kim
426
7
0
27 Sep 2024
HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection
Neural Information Processing Systems (NeurIPS), 2024
Xuefeng Du
Chaowei Xiao
Yixuan Li
HILM
254
60
0
26 Sep 2024
Enhancing Post-Hoc Attributions in Long Document Comprehension via Coarse Grained Answer Decomposition
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Pritika Ramu
Koustava Goswami
Apoorv Saxena
Balaji Vasan Srinivavsan
289
10
0
25 Sep 2024
LINKAGE: Listwise Ranking among Varied-Quality References for Non-Factoid QA Evaluation via LLMs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Sihui Yang
Keping Bi
Wanqing Cui
Jiafeng Guo
Xueqi Cheng
261
5
0
23 Sep 2024
The Ability of Large Language Models to Evaluate Constraint-satisfaction in Agent Responses to Open-ended Requests
Lior Madmoni
Amir Zait
Ilia Labzovsky
Danny Karmon
ELM
157
1
0
22 Sep 2024
The Factuality of Large Language Models in the Legal Domain
International Conference on Information and Knowledge Management (CIKM), 2024
Rajaa El Hamdani
Thomas Bonald
Fragkiskos D. Malliaros
Nils Holzenberger
Fabian M. Suchanek
AILaw
HILM
260
11
0
18 Sep 2024
LLM-as-a-Judge & Reward Model: What They Can and Cannot Do
Guijin Son
Hyunwoo Ko
Hoyoung Lee
Yewon Kim
Seunghyeok Hong
ALM
ELM
336
16
0
17 Sep 2024
HALO: Hallucination Analysis and Learning Optimization to Empower LLMs with Retrieval-Augmented Context for Guided Clinical Decision Making
IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE), 2024
Sumera Anjum
Hanzhi Zhang
Wenjun Zhou
Eun Jin Paek
Xiaopeng Zhao
Yunhe Feng
451
7
0
16 Sep 2024
Gaps or Hallucinations? Gazing into Machine-Generated Legal Analysis for Fine-grained Text Evaluations
Abe Bohan Hou
William Jurayj
Nils Holzenberger
Andrew Blair-Stanek
Benjamin Van Durme
ELM
244
2
0
16 Sep 2024
NovAScore: A New Automated Metric for Evaluating Document Level Novelty
International Conference on Computational Linguistics (COLING), 2024
Lin Ai
Ziwei Gong
Harshsaiprasad Deshpande
Alexander Johnson
Emmy Phung
Ahmad Emami
Julia Hirschberg
154
3
0
14 Sep 2024
When Context Leads but Parametric Memory Follows in Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yufei Tao
Adam Hiatt
Erik Haake
Antonie J. Jetter
Ameeta Agrawal
KELM
355
6
0
13 Sep 2024
AI-LieDar: Examine the Trade-off Between Utility and Truthfulness in LLM Agents
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Zhe Su
Xuhui Zhou
Sanketh Rangreji
Anubha Kabra
Julia Mendelsohn
Faeze Brahman
Maarten Sap
LLMAG
393
20
0
13 Sep 2024
Synthetic continued pretraining
International Conference on Learning Representations (ICLR), 2024
Zitong Yang
Neil Band
Shuangping Li
Emmanuel Candès
Tatsunori Hashimoto
CLL
SyDa
348
35
0
11 Sep 2024
GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering
International Conference on Computational Linguistics (COLING), 2024
Sacha Muller
António Loison
Bilel Omrani
Gautier Viaud
RALM
ELM
423
4
0
10 Sep 2024
Enhancing Temporal Understanding in Audio Question Answering for Large Audio Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
A. Sridhar
Yinyi Guo
Erik M. Visser
AuLLM
300
4
0
10 Sep 2024
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
777
55
0
10 Sep 2024
Hallucination Detection in LLMs: Fast and Memory-Efficient Finetuned Models
Gabriel Y. Arteaga
Thomas B. Schon
Nicolas Pielawski
331
20
0
04 Sep 2024
Generating Media Background Checks for Automated Source Critical Reasoning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Michael Schlichtkrull
240
7
0
01 Sep 2024
ContextCite: Attributing Model Generation to Context
Neural Information Processing Systems (NeurIPS), 2024
Benjamin Cohen-Wang
Harshay Shah
Kristian Georgiev
Aleksander Madry
LRM
356
59
0
01 Sep 2024
LoraMap: Harnessing the Power of LoRA Connections
Hyeryun Park
Jeongwon Kwak
Dongsuk Jang
Sumin Park
Jinwook Choi
MoMe
204
0
0
29 Aug 2024
Measuring text summarization factuality using atomic facts entailment metrics in the context of retrieval augmented generation
N. E. Kriman
HILM
202
3
0
27 Aug 2024
What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation
Dingyi Yang
Qin Jin
407
15
0
26 Aug 2024
Claim Verification in the Age of Large Language Models: A Survey
A. Dmonte
Roland Oruche
Marcos Zampieri
Prasad Calyam
Isabelle Augenstein
593
19
0
26 Aug 2024
SLM Meets LLM: Balancing Latency, Interpretability and Consistency in Hallucination Detection
Mengya Hu
Rui Xu
Deren Lei
Yaxi Li
Mingyu Wang
Emily Ching
Eslam Kamal
Alex Deng
170
6
0
22 Aug 2024
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Xuanwang Zhang
Yunze Song
Yidong Wang
Shuyun Tang
Xinfeng Li
...
Wei Dong
Yue Zhang
Xinyu Dai
Shikun Zhang
Qingsong Wen
325
8
0
21 Aug 2024
Analysis of Plan-based Retrieval for Grounded Text Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Ameya Godbole
Nicholas Monath
Seungyeon Kim
A. S. Rawat
Andrew McCallum
Manzil Zaheer
RALM
264
4
0
20 Aug 2024
Unconditional Truthfulness: Learning Unconditional Uncertainty of Large Language Models
Artem Vazhentsev
Ekaterina Fadeeva
Daniil Vasilev
Sergey Petrakov
Ivan Lazichny
Ilseyar Alimova
Preslav Nakov
Timothy Baldwin
Maxim Panov
Artem Shelmanov
HILM
245
3
0
20 Aug 2024
Web Retrieval Agents for Evidence-Based Misinformation Detection
Jacob-Junqi Tian
Hao Yu
Yury Orlovskiy
Tyler Vergho
Mauricio Rivera
Mayank Goel
Zachary Yang
Jean-Francois Godbout
Reihaneh Rabbany
Kellin Pelrine
LLMAG
OffRL
262
14
0
15 Aug 2024
Zero-shot Factual Consistency Evaluation Across Domains
Raunak Agarwal
HILM
322
1
0
07 Aug 2024
DebateQA: Evaluating Question Answering on Debatable Knowledge
Rongwu Xu
Xuan Qi
Zehan Qi
Wei Xu
Zhijiang Guo
ELM
223
11
0
02 Aug 2024
Misinforming LLMs: vulnerabilities, challenges and opportunities
Jaroslaw Kornowicz
Daniel Geissler
Kirsten Thommes
138
6
0
02 Aug 2024
A Course Shared Task on Evaluating LLM Output for Clinical Questions
Yufang Hou
Thy Thy Tran
Doan Nam Long Vu
Yiwen Cao
Kai Li
Lukas Rohde
Iryna Gurevych
LM&MA
ELM
164
0
0
31 Jul 2024
CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge
Tianshi Zheng
Jiaxin Bai
Yicheng Wang
Tianqing Fang
Yue Guo
Yauwai Yim
Yangqiu Song
ELM
LRM
263
6
0
30 Jul 2024
Previous
1
2
3
...
6
7
8
...
11
12
13
Next