ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.04302
  4. Cited By
Annotating and Modeling Fine-grained Factuality in Summarization

Annotating and Modeling Fine-grained Factuality in Summarization

North American Chapter of the Association for Computational Linguistics (NAACL), 2021
9 April 2021
Tanya Goyal
Greg Durrett
    HILM
ArXiv (abs)PDFHTML

Papers citing "Annotating and Modeling Fine-grained Factuality in Summarization"

50 / 119 papers shown
Title
Stress Testing Factual Consistency Metrics for Long-Document Summarization
Stress Testing Factual Consistency Metrics for Long-Document Summarization
Zain Muhammad Mujahid
Dustin Wright
Isabelle Augenstein
HILM
89
0
0
10 Nov 2025
Gaze-VLM:Bridging Gaze and VLMs through Attention Regularization for Egocentric Understanding
Gaze-VLM:Bridging Gaze and VLMs through Attention Regularization for Egocentric Understanding
Anupam Pani
Yanchao Yang
60
0
0
24 Oct 2025
Enhancing Faithfulness in Abstractive Summarization via Span-Level Fine-Tuning
Enhancing Faithfulness in Abstractive Summarization via Span-Level Fine-Tuning
Sicong Huang
Qianqi Yan
Shengze Wang
Ian Lane
HILM
121
0
0
10 Oct 2025
Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings
Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual SettingsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Austin Xu
Srijan Bansal
Yifei Ming
Semih Yavuz
Shafiq Joty
ELM
312
13
0
19 Mar 2025
SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation
SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text GenerationInternational Conference on Learning Representations (ICLR), 2025
Song Duong
Florian Le Bronnec
Alexandre Allauzen
Vincent Guigue
Alberto Lumbreras
Laure Soulier
Patrick Gallinari
HILM
225
3
0
20 Feb 2025
Bridging Context Gaps: Enhancing Comprehension in Long-Form Social Conversations Through Contextualized Excerpts
Bridging Context Gaps: Enhancing Comprehension in Long-Form Social Conversations Through Contextualized ExcerptsInternational Conference on Computational Linguistics (COLING), 2024
Shrestha Mohanty
Sarah Xuan
Jacob Jobraeel
Anurag Kumar
Deb Roy
Jad Kabbara
250
0
0
31 Dec 2024
Learning to Verify Summary Facts with Fine-Grained LLM Feedback
Learning to Verify Summary Facts with Fine-Grained LLM FeedbackInternational Conference on Computational Linguistics (COLING), 2024
Jihwan Oh
J. Choi
Nicole Hee-Yeon Kim
Taewon Yun
Hwanjun Song
SyDaALMHILM
269
2
0
14 Dec 2024
Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation
Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation
S. Ramprasad
Byron C. Wallace
LLMAGHILM
547
7
0
25 Nov 2024
From Single to Multi: How LLMs Hallucinate in Multi-Document Summarization
From Single to Multi: How LLMs Hallucinate in Multi-Document SummarizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Catarina G. Belem
Pouya Pezeskhpour
Hayate Iso
Seiji Maekawa
Nikita Bhutani
Estevam R. Hruschka
HILM
273
10
0
17 Oct 2024
Using Similarity to Evaluate Factual Consistency in Summaries
Using Similarity to Evaluate Factual Consistency in Summaries
Yuxuan Ye
Edwin Simpson
Raul Santos Rodriguez
HILM
136
4
0
23 Sep 2024
Leveraging Entailment Judgements in Cross-Lingual Summarisation
Leveraging Entailment Judgements in Cross-Lingual SummarisationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Huajian Zhang
Laura Perez-Beltrachini
HILM
162
2
0
01 Aug 2024
Multilingual Fine-Grained News Headline Hallucination Detection
Multilingual Fine-Grained News Headline Hallucination Detection
Jiaming Shen
Tianqi Liu
Jialu Liu
Zhen Qin
Jay Pavagadhi
Simon Baumgartner
Michael Bendersky
161
0
0
22 Jul 2024
Molecular Facts: Desiderata for Decontextualization in LLM Fact
  Verification
Molecular Facts: Desiderata for Decontextualization in LLM Fact Verification
Anisha Gunjal
Greg Durrett
HILM
235
34
0
28 Jun 2024
FastMem: Fast Memorization of Prompt Improves Context Awareness of Large
  Language Models
FastMem: Fast Memorization of Prompt Improves Context Awareness of Large Language Models
Junyi Zhu
Shuochen Liu
Yu Yu
Bo Tang
Yibo Yan
Zhiyu Li
Feiyu Xiong
Tong Xu
Matthew B. Blaschko
180
6
0
23 Jun 2024
Factual Dialogue Summarization via Learning from Large Language Models
Factual Dialogue Summarization via Learning from Large Language Models
Rongxin Zhu
Jey Han Lau
Jianzhong Qi
HILM
223
6
0
20 Jun 2024
Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM
  Framework for Detecting Factual Errors
Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM Framework for Detecting Factual ErrorsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Alex Chandler
Devesh Surve
Hui Su
HILMUQCV
121
3
0
18 Jun 2024
Analyzing LLM Behavior in Dialogue Summarization: Unveiling
  Circumstantial Hallucination Trends
Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends
S. Ramprasad
Elisa Ferracane
Zachary Chase Lipton
HILM
160
19
0
05 Jun 2024
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent
  Debate Framework
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework
Xiaoxi Sun
Jinpeng Li
Yan Zhong
Dongyan Zhao
Rui Yan
LLMAGHILM
192
17
0
05 Jun 2024
A Survey of Automatic Hallucination Evaluation on Natural Language Generation
A Survey of Automatic Hallucination Evaluation on Natural Language Generation
Siya Qi
Petr Slovak
Yulan He
Zheng Yuan
LRMHILM
301
1
0
18 Apr 2024
FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out
  Document
FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document
Joonho Yang
Seunghyun Yoon
Byeongjeong Kim
Hwanhee Lee
HILM
273
13
0
17 Apr 2024
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents
Liyan Tang
Philippe Laban
Greg Durrett
HILMSyDa
259
160
0
16 Apr 2024
On the Benefits of Fine-Grained Loss Truncation: A Case Study on
  Factuality in Summarization
On the Benefits of Fine-Grained Loss Truncation: A Case Study on Factuality in SummarizationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024
Lorenzo Jaime Yu Flores
Arman Cohan
HILM
211
2
0
09 Mar 2024
Know Your Audience: The benefits and pitfalls of generating plain
  language summaries beyond the "general" audience
Know Your Audience: The benefits and pitfalls of generating plain language summaries beyond the "general" audienceInternational Conference on Human Factors in Computing Systems (CHI), 2024
Tal August
Kyle Lo
Noah A. Smith
Katharina Reinecke
249
21
0
08 Mar 2024
German also Hallucinates! Inconsistency Detection in News Summaries with
  the Absinth Dataset
German also Hallucinates! Inconsistency Detection in News Summaries with the Absinth Dataset
Laura Mascarell
Ribin Chalumattu
Annette Rios
HILM
140
1
0
06 Mar 2024
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods
Hanlei Jin
Yang Zhang
Dan Meng
Jun Wang
Jinghua Tan
531
162
0
05 Mar 2024
FENICE: Factuality Evaluation of summarization based on Natural language
  Inference and Claim Extraction
FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction
Alessandro Sciré
Karim Ghonim
Roberto Navigli
HILM
202
20
0
04 Mar 2024
Fine-Grained Natural Language Inference Based Faithfulness Evaluation
  for Diverse Summarisation Tasks
Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks
Huajian Zhang
Yumo Xu
Laura Perez-Beltrachini
HILM
161
23
0
27 Feb 2024
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue
  Summarization
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Liyan Tang
Igor Shalyminov
Amy Wing-mei Wong
Jon Burnsky
Jake W. Vincent
...
Hang Su
Lijia Sun
Yi Zhang
Saab Mansour
Kathleen McKeown
HILM
165
70
0
20 Feb 2024
Identifying Factual Inconsistencies in Summaries: Grounding Model
  Inference via Task Taxonomy
Identifying Factual Inconsistencies in Summaries: Grounding Model Inference via Task Taxonomy
Liyan Xu
Zhenlin Su
Mo Yu
Jin Xu
Jinho D. Choi
Jie Zhou
Fei Liu
HILM
258
5
0
20 Feb 2024
GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
Kundan Krishna
S. Ramprasad
Prakhar Gupta
Byron C. Wallace
Zachary Chase Lipton
Jeffrey P. Bigham
HILMKELMSyDa
304
15
0
19 Feb 2024
FactPICO: Factuality Evaluation for Plain Language Summarization of
  Medical Evidence
FactPICO: Factuality Evaluation for Plain Language Summarization of Medical Evidence
Sebastian Antony Joseph
Lily Chen
Jan Trienes
Hannah Louisa Göke
Monika Coers
Wei Xu
Byron C. Wallace
Junyi Jessy Li
LM&MAHILM
153
21
0
18 Feb 2024
Fine-grained and Explainable Factuality Evaluation for Multimodal Summarization
Fine-grained and Explainable Factuality Evaluation for Multimodal Summarization
Liqiang Jing
Jingxuan Zuo
Yue Zhang
Liqiang Jing
267
13
0
18 Feb 2024
Can LLMs Produce Faithful Explanations For Fact-checking? Towards
  Faithful Explainable Fact-Checking via Multi-Agent Debate
Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate
Kyungha Kim
Sangyun Lee
Kung-Hsiang Huang
Hou Pong Chan
Pengfei Yu
Chenhui Xu
LRM
280
54
0
12 Feb 2024
Evaluating the Factuality of Zero-shot Summarizers Across Varied Domains
Evaluating the Factuality of Zero-shot Summarizers Across Varied DomainsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024
S. Ramprasad
Kundan Krishna
Zachary Chase Lipton
Byron C. Wallace
HILM
136
8
0
05 Feb 2024
BatchEval: Towards Human-like Text Evaluation
BatchEval: Towards Human-like Text EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Peiwen Yuan
Shaoxiong Feng
Yiwei Li
Xinglin Wang
Boyuan Pan
Heda Wang
Kan Li
ALM
191
16
0
31 Dec 2023
Do Androids Know They're Only Dreaming of Electric Sheep?
Do Androids Know They're Only Dreaming of Electric Sheep?Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Sky CH-Wang
Benjamin Van Durme
Jason Eisner
Chris Kedzie
HILM
226
52
0
28 Dec 2023
P^3SUM: Preserving Author's Perspective in News Summarization with
  Diffusion Language Models
P^3SUM: Preserving Author's Perspective in News Summarization with Diffusion Language Models
Yuhan Liu
Shangbin Feng
Xiaochuang Han
Vidhisha Balachandran
Chan Young Park
Sachin Kumar
Yulia Tsvetkov
DiffM
230
6
0
16 Nov 2023
AMRFact: Enhancing Summarization Factuality Evaluation with AMR-Driven
  Negative Samples Generation
AMRFact: Enhancing Summarization Factuality Evaluation with AMR-Driven Negative Samples Generation
Haoyi Qiu
Kung-Hsiang Huang
Jingnong Qu
Nanyun Peng
HILM
247
12
0
16 Nov 2023
A Survey on Hallucination in Large Language Models: Principles,
  Taxonomy, Challenges, and Open Questions
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions
Lei Huang
Weijiang Yu
Weitao Ma
Weihong Zhong
Zhangyin Feng
...
Qianglong Chen
Weihua Peng
Xiaocheng Feng
Bing Qin
Ting Liu
LRMHILM
322
1,755
0
09 Nov 2023
'Don't Get Too Technical with Me': A Discourse Structure-Based Framework
  for Science Journalism
'Don't Get Too Technical with Me': A Discourse Structure-Based Framework for Science Journalism
Ronald Cardenas
Bingsheng Yao
Dakuo Wang
Yufang Hou
226
0
0
23 Oct 2023
Calibrating Likelihoods towards Consistency in Summarization Models
Calibrating Likelihoods towards Consistency in Summarization Models
Polina Zablotskaia
Misha Khalman
Rishabh Joshi
Livio Baldini Soares
Shoshana Jakobovits
Joshua Maynez
Shashi Narayan
119
5
0
12 Oct 2023
Quantifying the Plausibility of Context Reliance in Neural Machine
  Translation
Quantifying the Plausibility of Context Reliance in Neural Machine TranslationInternational Conference on Learning Representations (ICLR), 2023
Gabriele Sarti
Grzegorz Chrupala
Malvina Nissim
Arianna Bisazza
242
5
0
02 Oct 2023
BooookScore: A systematic exploration of book-length summarization in
  the era of LLMs
BooookScore: A systematic exploration of book-length summarization in the era of LLMsInternational Conference on Learning Representations (ICLR), 2023
Yapei Chang
Kyle Lo
Tanya Goyal
Mohit Iyyer
ALM
325
150
0
01 Oct 2023
Beyond the Chat: Executable and Verifiable Text-Editing with LLMs
Beyond the Chat: Executable and Verifiable Text-Editing with LLMsACM Symposium on User Interface Software and Technology (UIST), 2023
Philippe Laban
Jesse Vig
Marti A. Hearst
Caiming Xiong
Chien-Sheng Wu
KELM
228
48
0
27 Sep 2023
Evaluation of Faithfulness Using the Longest Supported Subsequence
Evaluation of Faithfulness Using the Longest Supported Subsequence
Anirudh Mittal
Timo Schick
Mikel Artetxe
Jane Dwivedi-Yu
ALM
134
2
0
23 Aug 2023
PromptSum: Parameter-Efficient Controllable Abstractive Summarization
PromptSum: Parameter-Efficient Controllable Abstractive Summarization
Mathieu Ravaut
Hailin Chen
Ruochen Zhao
Chengwei Qin
Shafiq Joty
Nancy Chen
129
3
0
06 Aug 2023
Improving Factuality of Abstractive Summarization via Contrastive Reward
  Learning
Improving Factuality of Abstractive Summarization via Contrastive Reward Learning
Ethan Chern
Zhiruo Wang
Sanjan Das
Bhavuk Sharma
Pengfei Liu
Graham Neubig
HILM
175
14
0
10 Jul 2023
Opportunities and Risks of LLMs for Scalable Deliberation with Polis
Opportunities and Risks of LLMs for Scalable Deliberation with Polis
Christopher T. Small
Ivan Vendrov
Esin Durmus
Hadjar Homaei
Elizabeth Barry
Julien Cornebise
Ted Suzman
Deep Ganguli
Colin Megill
164
48
0
20 Jun 2023
Reference Matters: Benchmarking Factual Error Correction for Dialogue
  Summarization with Fine-grained Evaluation Framework
Reference Matters: Benchmarking Factual Error Correction for Dialogue Summarization with Fine-grained Evaluation FrameworkAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Mingqi Gao
Xiaojun Wan
Jia Su
Zhefeng Wang
Baoxing Huai
HILM
143
10
0
08 Jun 2023
Multi-Dimensional Evaluation of Text Summarization with In-Context
  Learning
Multi-Dimensional Evaluation of Text Summarization with In-Context LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Sameer Jain
Vaishakh Keshava
Swarnashree Mysore Sathyendra
Patrick Fernandes
Pengfei Liu
Graham Neubig
Chunting Zhou
ELM
178
51
0
01 Jun 2023
123
Next