ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.14251
  4. Cited By
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long
  Form Text Generation
v1v2 (latest)

FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
23 May 2023
Sewon Min
Kalpesh Krishna
Xinxi Lyu
M. Lewis
Anuj Kumar
Pang Wei Koh
Mohit Iyyer
Luke Zettlemoyer
Hannaneh Hajishirzi
    HILMALM
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

50 / 615 papers shown
BookWorm: A Dataset for Character Description and Analysis
BookWorm: A Dataset for Character Description and AnalysisConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Argyrios Papoudakis
Mirella Lapata
Frank Keller
195
2
0
14 Oct 2024
Can Knowledge Graphs Make Large Language Models More Trustworthy? An Empirical Study Over Open-ended Question Answering
Can Knowledge Graphs Make Large Language Models More Trustworthy? An Empirical Study Over Open-ended Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Yuan Sui
Yufei He
Zifeng Ding
Bryan Hooi
HILMRALMELM
497
22
0
10 Oct 2024
ReIFE: Re-evaluating Instruction-Following Evaluation
ReIFE: Re-evaluating Instruction-Following EvaluationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Yixin Liu
Kejian Shi
Alexander R. Fabbri
Yilun Zhao
Peifeng Wang
Chien-Sheng Wu
Shafiq Joty
Arman Cohan
215
13
0
09 Oct 2024
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for
  Enhanced Following of Instructions with Multiple Constraints
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple ConstraintsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Thomas Palmeira Ferraz
Kartik Mehta
Yu-Hsiang Lin
Haw-Shiuan Chang
Shereen Oraby
Sijia Liu
Vivek Subramanian
Tagyoung Chung
Mohit Bansal
Nanyun Peng
295
26
0
09 Oct 2024
Uncovering Factor Level Preferences to Improve Human-Model Alignment
Uncovering Factor Level Preferences to Improve Human-Model AlignmentConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Juhyun Oh
Eunsu Kim
Jiseon Kim
Wenda Xu
Inha Cha
William Yang Wang
Alice Oh
374
1
0
09 Oct 2024
ReFIR: Grounding Large Restoration Models with Retrieval Augmentation
ReFIR: Grounding Large Restoration Models with Retrieval AugmentationNeural Information Processing Systems (NeurIPS), 2024
Hang Guo
Tao Dai
Zhihao Ouyang
Taolin Zhang
Yaohua Zha
Bin Chen
Shu-Tao Xia
DiffM
208
10
0
08 Oct 2024
Why am I seeing this: Democratizing End User Auditing for Online Content
  Recommendations
Why am I seeing this: Democratizing End User Auditing for Online Content RecommendationsACM Symposium on User Interface Software and Technology (UIST), 2024
Chaoran Chen
Leyang Li
Luke Cao
Yanfang Ye
Tianshi Li
Yaxing Yao
Toby Jia-jun Li
MLAU
249
6
0
07 Oct 2024
Realizing Video Summarization from the Path of Language-based Semantic
  Understanding
Realizing Video Summarization from the Path of Language-based Semantic Understanding
Kuan-Chen Mu
Zhi-Yi Chin
Wei-Chen Chiu
169
0
0
06 Oct 2024
Locating Information Gaps and Narrative Inconsistencies Across
  Languages: A Case Study of LGBT People Portrayals on Wikipedia
Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on WikipediaConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Farhan Samir
Chan Young Park
Anjalie Field
Vered Shwartz
Yulia Tsvetkov
172
8
0
05 Oct 2024
CS4: Measuring the Creativity of Large Language Models Automatically by
  Controlling the Number of Story-Writing Constraints
CS4: Measuring the Creativity of Large Language Models Automatically by Controlling the Number of Story-Writing Constraints
Anirudh Atmakuru
Jatin Nainani
Rohith Siddhartha Reddy Bheemreddy
Anirudh Lakkaraju
Zonghai Yao
Hamed Zamani
Haw-Shiuan Chang
367
12
0
05 Oct 2024
ECon: On the Detection and Resolution of Evidence Conflicts
ECon: On the Detection and Resolution of Evidence ConflictsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Cheng Jiayang
Chunkit Chan
Qianqian Zhuang
Lin Qiu
Tianhang Zhang
Tengxiao Liu
Yangqiu Song
Yue Zhang
Pengfei Liu
Zheng Zhang
260
13
0
05 Oct 2024
FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs
FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs
Deema Alnuhait
Neeraja Kirtane
Muhammad Khalifa
Hao Peng
LRMHILM
367
7
0
03 Oct 2024
Loki: An Open-Source Tool for Fact Verification
Loki: An Open-Source Tool for Fact VerificationInternational Conference on Computational Linguistics (COLING), 2024
Haonan Li
Xudong Han
Hao Wang
Yuxia Wang
Minghan Wang
Daniil Vasilev
Yilin Geng
Zenan Zhai
Preslav Nakov
Timothy Baldwin
SyDaHILM
619
16
0
02 Oct 2024
Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large
  Language Models
Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Shayekh Bin Islam
Md Asib Rahman
K S M Tozammel Hossain
Enamul Hoque
Shafiq Joty
Md. Rizwan Parvez
RALMAIFinLRMVLM
207
36
0
02 Oct 2024
FactAlign: Long-form Factuality Alignment of Large Language Models
FactAlign: Long-form Factuality Alignment of Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Chao-Wei Huang
Yun-Nung Chen
HILM
142
11
0
02 Oct 2024
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"International Conference on Learning Representations (ICLR), 2024
Yifei Ming
Senthil Purushwalkam
Shrey Pandit
Zixuan Ke
Xuan-Phi Nguyen
Caiming Xiong
Shafiq Joty
HILM
619
44
0
30 Sep 2024
CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering
CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yike Wu
Yi Huang
Nan Hu
Yuncheng Hua
Guilin Qi
Jiaoyan Chen
Jeff Z. Pan
388
21
0
29 Sep 2024
Model-based Preference Optimization in Abstractive Summarization without
  Human Feedback
Model-based Preference Optimization in Abstractive Summarization without Human FeedbackConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jaepill Choi
Kyubyung Chae
Jiwoo Song
Yohan Jo
Taesup Kim
426
7
0
27 Sep 2024
HaloScope: Harnessing Unlabeled LLM Generations for Hallucination
  Detection
HaloScope: Harnessing Unlabeled LLM Generations for Hallucination DetectionNeural Information Processing Systems (NeurIPS), 2024
Xuefeng Du
Chaowei Xiao
Yixuan Li
HILM
254
60
0
26 Sep 2024
Enhancing Post-Hoc Attributions in Long Document Comprehension via
  Coarse Grained Answer Decomposition
Enhancing Post-Hoc Attributions in Long Document Comprehension via Coarse Grained Answer DecompositionConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Pritika Ramu
Koustava Goswami
Apoorv Saxena
Balaji Vasan Srinivavsan
289
10
0
25 Sep 2024
LINKAGE: Listwise Ranking among Varied-Quality References for
  Non-Factoid QA Evaluation via LLMs
LINKAGE: Listwise Ranking among Varied-Quality References for Non-Factoid QA Evaluation via LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Sihui Yang
Keping Bi
Wanqing Cui
Jiafeng Guo
Xueqi Cheng
261
5
0
23 Sep 2024
The Ability of Large Language Models to Evaluate Constraint-satisfaction
  in Agent Responses to Open-ended Requests
The Ability of Large Language Models to Evaluate Constraint-satisfaction in Agent Responses to Open-ended Requests
Lior Madmoni
Amir Zait
Ilia Labzovsky
Danny Karmon
ELM
157
1
0
22 Sep 2024
The Factuality of Large Language Models in the Legal Domain
The Factuality of Large Language Models in the Legal DomainInternational Conference on Information and Knowledge Management (CIKM), 2024
Rajaa El Hamdani
Thomas Bonald
Fragkiskos D. Malliaros
Nils Holzenberger
Fabian M. Suchanek
AILawHILM
260
11
0
18 Sep 2024
LLM-as-a-Judge & Reward Model: What They Can and Cannot Do
LLM-as-a-Judge & Reward Model: What They Can and Cannot Do
Guijin Son
Hyunwoo Ko
Hoyoung Lee
Yewon Kim
Seunghyeok Hong
ALMELM
336
16
0
17 Sep 2024
HALO: Hallucination Analysis and Learning Optimization to Empower LLMs
  with Retrieval-Augmented Context for Guided Clinical Decision Making
HALO: Hallucination Analysis and Learning Optimization to Empower LLMs with Retrieval-Augmented Context for Guided Clinical Decision MakingIEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE), 2024
Sumera Anjum
Hanzhi Zhang
Wenjun Zhou
Eun Jin Paek
Xiaopeng Zhao
Yunhe Feng
451
7
0
16 Sep 2024
Gaps or Hallucinations? Gazing into Machine-Generated Legal Analysis for
  Fine-grained Text Evaluations
Gaps or Hallucinations? Gazing into Machine-Generated Legal Analysis for Fine-grained Text Evaluations
Abe Bohan Hou
William Jurayj
Nils Holzenberger
Andrew Blair-Stanek
Benjamin Van Durme
ELM
244
2
0
16 Sep 2024
NovAScore: A New Automated Metric for Evaluating Document Level Novelty
NovAScore: A New Automated Metric for Evaluating Document Level NoveltyInternational Conference on Computational Linguistics (COLING), 2024
Lin Ai
Ziwei Gong
Harshsaiprasad Deshpande
Alexander Johnson
Emmy Phung
Ahmad Emami
Julia Hirschberg
154
3
0
14 Sep 2024
When Context Leads but Parametric Memory Follows in Large Language
  Models
When Context Leads but Parametric Memory Follows in Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yufei Tao
Adam Hiatt
Erik Haake
Antonie J. Jetter
Ameeta Agrawal
KELM
355
6
0
13 Sep 2024
AI-LieDar: Examine the Trade-off Between Utility and Truthfulness in LLM Agents
AI-LieDar: Examine the Trade-off Between Utility and Truthfulness in LLM AgentsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Zhe Su
Xuhui Zhou
Sanketh Rangreji
Anubha Kabra
Julia Mendelsohn
Faeze Brahman
Maarten Sap
LLMAG
393
20
0
13 Sep 2024
Synthetic continued pretraining
Synthetic continued pretrainingInternational Conference on Learning Representations (ICLR), 2024
Zitong Yang
Neil Band
Shuangping Li
Emmanuel Candès
Tatsunori Hashimoto
CLLSyDa
348
35
0
11 Sep 2024
GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering
GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question AnsweringInternational Conference on Computational Linguistics (COLING), 2024
Sacha Muller
António Loison
Bilel Omrani
Gautier Viaud
RALMELM
423
4
0
10 Sep 2024
Enhancing Temporal Understanding in Audio Question Answering for Large
  Audio Language Models
Enhancing Temporal Understanding in Audio Question Answering for Large Audio Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
A. Sridhar
Yinyi Guo
Erik M. Visser
AuLLM
300
4
0
10 Sep 2024
What is the Role of Small Models in the LLM Era: A Survey
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
777
55
0
10 Sep 2024
Hallucination Detection in LLMs: Fast and Memory-Efficient Finetuned
  Models
Hallucination Detection in LLMs: Fast and Memory-Efficient Finetuned Models
Gabriel Y. Arteaga
Thomas B. Schon
Nicolas Pielawski
331
20
0
04 Sep 2024
Generating Media Background Checks for Automated Source Critical
  Reasoning
Generating Media Background Checks for Automated Source Critical ReasoningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Michael Schlichtkrull
240
7
0
01 Sep 2024
ContextCite: Attributing Model Generation to Context
ContextCite: Attributing Model Generation to ContextNeural Information Processing Systems (NeurIPS), 2024
Benjamin Cohen-Wang
Harshay Shah
Kristian Georgiev
Aleksander Madry
LRM
356
59
0
01 Sep 2024
LoraMap: Harnessing the Power of LoRA Connections
LoraMap: Harnessing the Power of LoRA Connections
Hyeryun Park
Jeongwon Kwak
Dongsuk Jang
Sumin Park
Jinwook Choi
MoMe
204
0
0
29 Aug 2024
Measuring text summarization factuality using atomic facts entailment
  metrics in the context of retrieval augmented generation
Measuring text summarization factuality using atomic facts entailment metrics in the context of retrieval augmented generation
N. E. Kriman
HILM
202
3
0
27 Aug 2024
What Makes a Good Story and How Can We Measure It? A Comprehensive
  Survey of Story Evaluation
What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation
Dingyi Yang
Qin Jin
407
15
0
26 Aug 2024
Claim Verification in the Age of Large Language Models: A Survey
Claim Verification in the Age of Large Language Models: A Survey
A. Dmonte
Roland Oruche
Marcos Zampieri
Prasad Calyam
Isabelle Augenstein
593
19
0
26 Aug 2024
SLM Meets LLM: Balancing Latency, Interpretability and Consistency in
  Hallucination Detection
SLM Meets LLM: Balancing Latency, Interpretability and Consistency in Hallucination Detection
Mengya Hu
Rui Xu
Deren Lei
Yaxi Li
Mingyu Wang
Emily Ching
Eslam Kamal
Alex Deng
170
6
0
22 Aug 2024
RAGLAB: A Modular and Research-Oriented Unified Framework for
  Retrieval-Augmented Generation
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Xuanwang Zhang
Yunze Song
Yidong Wang
Shuyun Tang
Xinfeng Li
...
Wei Dong
Yue Zhang
Xinyu Dai
Shikun Zhang
Qingsong Wen
325
8
0
21 Aug 2024
Analysis of Plan-based Retrieval for Grounded Text Generation
Analysis of Plan-based Retrieval for Grounded Text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Ameya Godbole
Nicholas Monath
Seungyeon Kim
A. S. Rawat
Andrew McCallum
Manzil Zaheer
RALM
264
4
0
20 Aug 2024
Unconditional Truthfulness: Learning Unconditional Uncertainty of Large Language Models
Unconditional Truthfulness: Learning Unconditional Uncertainty of Large Language Models
Artem Vazhentsev
Ekaterina Fadeeva
Daniil Vasilev
Sergey Petrakov
Ivan Lazichny
Ilseyar Alimova
Preslav Nakov
Timothy Baldwin
Maxim Panov
Artem Shelmanov
HILM
245
3
0
20 Aug 2024
Web Retrieval Agents for Evidence-Based Misinformation Detection
Web Retrieval Agents for Evidence-Based Misinformation Detection
Jacob-Junqi Tian
Hao Yu
Yury Orlovskiy
Tyler Vergho
Mauricio Rivera
Mayank Goel
Zachary Yang
Jean-Francois Godbout
Reihaneh Rabbany
Kellin Pelrine
LLMAGOffRL
262
14
0
15 Aug 2024
Zero-shot Factual Consistency Evaluation Across Domains
Zero-shot Factual Consistency Evaluation Across Domains
Raunak Agarwal
HILM
322
1
0
07 Aug 2024
DebateQA: Evaluating Question Answering on Debatable Knowledge
DebateQA: Evaluating Question Answering on Debatable Knowledge
Rongwu Xu
Xuan Qi
Zehan Qi
Wei Xu
Zhijiang Guo
ELM
223
11
0
02 Aug 2024
Misinforming LLMs: vulnerabilities, challenges and opportunities
Misinforming LLMs: vulnerabilities, challenges and opportunities
Jaroslaw Kornowicz
Daniel Geissler
Kirsten Thommes
138
6
0
02 Aug 2024
A Course Shared Task on Evaluating LLM Output for Clinical Questions
A Course Shared Task on Evaluating LLM Output for Clinical Questions
Yufang Hou
Thy Thy Tran
Doan Nam Long Vu
Yiwen Cao
Kai Li
Lukas Rohde
Iryna Gurevych
LM&MAELM
164
0
0
31 Jul 2024
CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large
  Language Models over Factual Knowledge
CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge
Tianshi Zheng
Jiaxin Bai
Yicheng Wang
Tianqing Fang
Yue Guo
Yauwai Yim
Yangqiu Song
ELMLRM
263
6
0
30 Jul 2024
Previous
123...678...111213
Next