Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2305.14251
Cited By
v1
v2 (latest)
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
23 May 2023
Sewon Min
Kalpesh Krishna
Xinxi Lyu
M. Lewis
Anuj Kumar
Pang Wei Koh
Mohit Iyyer
Luke Zettlemoyer
Hannaneh Hajishirzi
HILM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"
50 / 608 papers shown
Title
FACTS&EVIDENCE: An Interactive Tool for Transparent Fine-Grained Factual Verification of Machine-Generated Text
Varich Boonsanong
Vidhisha Balachandran
Xiaochuang Han
Shangbin Feng
Lucy Lu Wang
Yulia Tsvetkov
296
3
0
19 Mar 2025
Optimizing Decomposition for Optimal Claim Verification
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yining Lu
Noah Ziems
Hy Dang
Meng Jiang
319
2
0
19 Mar 2025
GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation
International Conference on Learning Representations (ICLR), 2025
Tao Feng
Yihang Sun
Jiaxuan You
347
14
0
16 Mar 2025
AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation
Fengyu Li
Yilin Li
Junhao Zhu
Lu Chen
Yanfei Zhang
Jia Zhou
Hui Zu
Jingwen Zhao
Yunjun Gao
LLMAG
180
0
0
14 Mar 2025
Odysseus Navigates the Sirens' Song: Dynamic Focus Decoding for Factual and Diverse Open-Ended Text Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Wen Luo
Feifan Song
Wei Li
Guangyue Peng
Shaohang Wei
Houfeng Wang
AI4CE
184
1
0
11 Mar 2025
Evaluating open-source Large Language Models for automated fact-checking
Nicoló Fontana
Francesco Corso
Enrico Zuccolotto
Francesco Pierri
HILM
208
5
0
07 Mar 2025
Benchmarking Large Language Models on Multiple Tasks in Bioinformatics NLP with Prompting
Jiyue Jiang
Pengan Chen
Jinqiao Wang
Dongchen He
Ziqin Wei
...
Yimin Fan
Xiangyu Shi
Jimeng Sun
Chuan Wu
Yuan Li
LM&MA
185
6
0
06 Mar 2025
Uncovering Gaps in How Humans and LLMs Interpret Subjective Language
International Conference on Learning Representations (ICLR), 2025
Erik Jones
Arjun Patrawala
Jacob Steinhardt
176
2
0
06 Mar 2025
DSVD: Dynamic Self-Verify Decoding for Faithful Generation in Large Language Models
Y. Guo
Yuchen Yang
Zhe Chen
Pingjie Wang
Yusheng Liao
Yujiao Shi
Yanfeng Wang
Yu Wang
HILM
229
2
0
05 Mar 2025
AILS-NTUA at SemEval-2025 Task 3: Leveraging Large Language Models and Translation Strategies for Multilingual Hallucination Detection
Dimitra Karkani
Maria Lymperaiou
Giorgos Filandrianos
Nikolaos Spanos
Athanasios Voulodimos
Giorgos Stamou
HILM
LRM
218
0
0
04 Mar 2025
Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs
International Conference on Learning Representations (ICLR), 2025
Yuzhe Gu
Feiyu Xiong
Chengqi Lyu
Dahua Lin
Kai Chen
241
12
0
04 Mar 2025
LLM as a Broken Telephone: Iterative Generation Distorts Information
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Amr Mohamed
Mingmeng Geng
Michalis Vazirgiannis
Guokan Shang
350
3
0
27 Feb 2025
Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in Product QA Agents
A. Lewis
Michael White
Jing Liu
T. Koike-Akino
K. Parsons
Yanjie Wang
HILM
265
1
0
26 Feb 2025
Conformal Linguistic Calibration: Trading-off between Factuality and Specificity
Zhengping Jiang
Anqi Liu
Benjamin Van Durme
452
7
0
26 Feb 2025
Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Kuang Wang
Xianrui Li
Steve Yang
Li Zhou
Feng Jiang
Haoyang Li
308
6
0
26 Feb 2025
Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Hao Peng
Yunjia Qi
Xiaozhi Wang
Zijun Yao
Bin Xu
Lei Hou
Juanzi Li
ALM
LRM
186
25
0
26 Feb 2025
FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Radu Marinescu
D. Bhattacharjya
Junkyu Lee
T. Tchrakian
Javier Carnerero-Cano
Yufang Hou
Elizabeth M. Daly
Alessandra Pascale
HILM
LRM
242
2
0
25 Feb 2025
Uncertainty Quantification in Retrieval Augmented Question Answering
Laura Perez-Beltrachini
Mirella Lapata
RALM
459
4
0
25 Feb 2025
Faster, Cheaper, Better: Multi-Objective Hyperparameter Optimization for LLM and RAG Systems
Matthew Barker
Andrew Bell
Evan Thomas
James Carr
Thomas Andrews
Umang Bhatt
337
4
0
25 Feb 2025
Is Free Self-Alignment Possible?
Dyah Adila
Changho Shin
Yijing Zhang
Frederic Sala
MoMe
370
2
0
24 Feb 2025
AI Realtor: Towards Grounded Persuasive Language Generation for Automated Copywriting
Jibang Wu
Chenghao Yang
Simon Mahns
Simon Mahns
Hao Zhu
Hao Zhu
Haifeng Xu
Haifeng Xu
328
5
0
24 Feb 2025
Beyond Translation: LLM-Based Data Generation for Multilingual Fact-Checking
Yi-Ling Chung
Aurora Cobo
Pablo Serna
SyDa
HILM
194
6
0
24 Feb 2025
PosterSum: A Multimodal Benchmark for Scientific Poster Summarization
Rohit Saxena
Pasquale Minervini
Frank Keller
VLM
192
8
0
24 Feb 2025
Is Relevance Propagated from Retriever to Generator in RAG?
European Conference on Information Retrieval (ECIR), 2025
Fangzheng Tian
Debasis Ganguly
Craig Macdonald
RALM
202
6
0
24 Feb 2025
GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-Checking
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yingjian Chen
Haoran Liu
Yinhong Liu
Rui Yang
Han Yuan
...
Pengyuan Zhou
Peng Yuan Zhou
Qingyu Chen
James Caverlee
Irene Li
HILM
527
5
0
23 Feb 2025
Think Together and Work Better: Combining Humans' and LLMs' Think-Aloud Outcomes for Effective Text Evaluation
International Conference on Human Factors in Computing Systems (CHI), 2024
SeongYeub Chu
JongWoo Kim
MunYong Yi
306
12
0
21 Feb 2025
Hallucination Detection in Large Language Models with Metamorphic Relations
Borui Yang
Md Afif Al Mamun
Jie M. Zhang
Gias Uddin
HILM
396
17
0
20 Feb 2025
Rare Disease Differential Diagnosis with Large Language Models at Scale: From Abdominal Actinomycosis to Wilson's Disease
Elliot Schumacher
Dhruv Naik
Anitha Kannan
LM&MA
160
4
0
20 Feb 2025
How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild
Saad Obaid ul Islam
Anne Lauscher
Goran Glavaš
HILM
LRM
406
8
0
18 Feb 2025
STRIVE: Structured Reasoning for Self-Improvement in Claim Verification
Haisong Gong
Jing Li
Junfei Wu
Sihan Yang
Shu Wu
Shu Wu
LRM
233
2
0
17 Feb 2025
HalluEntity: Benchmarking and Understanding Entity-Level Hallucination Detection
Min-Hsuan Yeh
Max Kamachee
Seongheon Park
Yixuan Li
HILM
357
3
0
17 Feb 2025
Injecting Domain-Specific Knowledge into Large Language Models: A Comprehensive Survey
Zirui Song
Bin Yan
Yuhan Liu
Miao Fang
Mingzhe Li
Rui Yan
Preslav Nakov
KELM
LM&MA
356
40
0
15 Feb 2025
Optimizing Knowledge Integration in Retrieval-Augmented Generation with Self-Selection
Yan Weng
Fengbin Zhu
Tong Ye
Haoyan Liu
Fuli Feng
Tat-Seng Chua
RALM
337
3
0
10 Feb 2025
OverThink: Slowdown Attacks on Reasoning LLMs
A. Kumar
Jaechul Roh
A. Naseh
Marzena Karpinska
Mohit Iyyer
Amir Houmansadr
Eugene Bagdasarian
LRM
520
50
0
04 Feb 2025
Context-Aware Hierarchical Merging for Long Document Summarization
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Litu Ou
Mirella Lapata
MoMe
975
2
0
03 Feb 2025
Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Takyoung Kim
Kyungjae Lee
Y. Jang
Ji Yong Cho
Gangwoo Kim
Minseok Cho
Moontae Lee
382
1
0
28 Jan 2025
OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models
Chongren Sun
Jian Wang
Di Wu
Benoit Boulet
HILM
LRM
217
5
0
22 Jan 2025
Iterative Tree Analysis for Medical Critics
Zenan Huang
Mingwei Li
Zheng Zhou
Youxin Jiang
759
0
0
18 Jan 2025
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
Zekun Xi
Wenbiao Yin
Jizhan Fang
Jialong Wu
Runnan Fang
Ningyu Zhang
Jiang Yong
Pengjun Xie
Fei Huang
Ningyu Zhang
SyDa
LRM
386
12
0
16 Jan 2025
Enhancing Retrieval-Augmented Generation: A Study of Best Practices
International Conference on Computational Linguistics (COLING), 2025
Siran Li
Linus Stenzel
Carsten Eickhoff
Seyed Ali Bahrainian
RALM
3DV
193
22
0
13 Jan 2025
Lived Experience Not Found: LLMs Struggle to Align with Experts on Addressing Adverse Drug Reactions from Psychiatric Medication Use
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Mohit Chandra
Siddharth Sriraman
Gaurav Verma
Harneet Singh Khanuja
Jose Suarez Campayo
Zihang Li
Michael L. Birnbaum
M. D. Choudhury
AI4MH
314
12
0
08 Jan 2025
The FACTS Grounding Leaderboard: Benchmarking LLMs' Ability to Ground Responses to Long-Form Input
Alon Jacovi
Andrew Wang
Chris Alberti
Connie Tao
Jon Lipovetz
...
Rachana Fellinger
Rui Wang
Zizhao Zhang
Sasha Goldshtein
Dipanjan Das
HILM
ALM
419
30
0
06 Jan 2025
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Helia Hashemi
J. Eisner
Corby Rosset
Benjamin Van Durme
Chris Kedzie
365
33
0
03 Jan 2025
A review of faithfulness metrics for hallucination assessment in Large Language Models
IEEE Journal on Selected Topics in Signal Processing (JSTSP), 2024
Ben Malin
Tatiana Kalganova
Nikoloas Boulgouris
HILM
363
7
0
03 Jan 2025
Evaluate Summarization in Fine-Granularity: Auto Evaluation with LLM
Dong Yuan
Eti Rastogi
Fen Zhao
Sagar Goyal
Gautam Naik
Sree Prasanna Rajagopal
154
2
0
31 Dec 2024
ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Qing Zong
Zhaoxiang Wang
Tianshi Zheng
Xiyu Ren
Yangqiu Song
345
11
0
28 Dec 2024
A Survey of Calibration Process for Black-Box LLMs
Liangru Xie
Hui Liu
Jingying Zeng
Xianfeng Tang
Yan Han
Chen Luo
Jing Huang
Zhen Li
Suhang Wang
Qi He
266
8
0
17 Dec 2024
Attention with Dependency Parsing Augmentation for Fine-Grained Attribution
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Qiang Ding
Lvzhou Luo
Yixuan Cao
Ping Luo
240
4
0
16 Dec 2024
Coverage-based Fairness in Multi-document Summarization
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Haoyuan Li
Yusen Zhang
Rui Zhang
Snigdha Chaturvedi
319
2
0
11 Dec 2024
HalluCana: Fixing LLM Hallucination with A Canary Lookahead
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Tianyi Li
Erenay Dayanik
Shubhi Tyagi
Andrea Pierleoni
HILM
257
1
0
10 Dec 2024
Previous
1
2
3
4
5
6
...
11
12
13
Next