Annotating and Modeling Fine-grained Factuality in Summarization

North American Chapter of the Association for Computational Linguistics (NAACL), 2021

9 April 2021

Papers citing "Annotating and Modeling Fine-grained Factuality in Summarization"

50 / 119 papers shown

Title
Stress Testing Factual Consistency Metrics for Long-Document Summarization Zain Muhammad Mujahid Dustin Wright Isabelle Augenstein HILM 89 0 0 10 Nov 2025
Gaze-VLM:Bridging Gaze and VLMs through Attention Regularization for Egocentric Understanding Anupam Pani Yanchao Yang 60 0 0 24 Oct 2025
Enhancing Faithfulness in Abstractive Summarization via Span-Level Fine-Tuning Sicong Huang Qianqi Yan Shengze Wang Ian Lane HILM 121 0 0 10 Oct 2025
Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual SettingsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Austin Xu Srijan Bansal Yifei Ming Semih Yavuz Shafiq Joty ELM 312 13 0 19 Mar 2025
SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text GenerationInternational Conference on Learning Representations (ICLR), 2025 Song Duong Florian Le Bronnec Alexandre Allauzen Vincent Guigue Alberto Lumbreras Laure Soulier Patrick Gallinari HILM 225 3 0 20 Feb 2025
Bridging Context Gaps: Enhancing Comprehension in Long-Form Social Conversations Through Contextualized ExcerptsInternational Conference on Computational Linguistics (COLING), 2024 Shrestha Mohanty Sarah Xuan Jacob Jobraeel Anurag Kumar Deb Roy Jad Kabbara 250 0 0 31 Dec 2024
Learning to Verify Summary Facts with Fine-Grained LLM FeedbackInternational Conference on Computational Linguistics (COLING), 2024 Jihwan Oh J. Choi Nicole Hee-Yeon Kim Taewon Yun Hwanjun Song SyDa ALM HILM 269 2 0 14 Dec 2024
Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation S. Ramprasad Byron C. Wallace LLMAG HILM 547 7 0 25 Nov 2024
From Single to Multi: How LLMs Hallucinate in Multi-Document SummarizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 Catarina G. Belem Pouya Pezeskhpour Hayate Iso Seiji Maekawa Nikita Bhutani Estevam R. Hruschka HILM 273 10 0 17 Oct 2024
Using Similarity to Evaluate Factual Consistency in Summaries Yuxuan Ye Edwin Simpson Raul Santos Rodriguez HILM 136 4 0 23 Sep 2024
Leveraging Entailment Judgements in Cross-Lingual SummarisationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 Huajian Zhang Laura Perez-Beltrachini HILM 162 2 0 01 Aug 2024
Multilingual Fine-Grained News Headline Hallucination Detection Jiaming Shen Tianqi Liu Jialu Liu Zhen Qin Jay Pavagadhi Simon Baumgartner Michael Bendersky 161 0 0 22 Jul 2024
Molecular Facts: Desiderata for Decontextualization in LLM Fact Verification Anisha Gunjal Greg Durrett HILM 235 34 0 28 Jun 2024
FastMem: Fast Memorization of Prompt Improves Context Awareness of Large Language Models Junyi Zhu Shuochen Liu Yu Yu Bo Tang Yibo Yan Zhiyu Li Feiyu Xiong Tong Xu Matthew B. Blaschko 180 6 0 23 Jun 2024
Factual Dialogue Summarization via Learning from Large Language Models Rongxin Zhu Jey Han Lau Jianzhong Qi HILM 223 6 0 20 Jun 2024
Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM Framework for Detecting Factual ErrorsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 Alex Chandler Devesh Surve Hui Su HILM UQCV 121 3 0 18 Jun 2024
Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends S. Ramprasad Elisa Ferracane Zachary Chase Lipton HILM 160 19 0 05 Jun 2024
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework Xiaoxi Sun Jinpeng Li Yan Zhong Dongyan Zhao Rui Yan LLMAG HILM 192 17 0 05 Jun 2024
A Survey of Automatic Hallucination Evaluation on Natural Language Generation Siya Qi Petr Slovak Yulan He Zheng Yuan LRM HILM 301 1 0 18 Apr 2024
FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document Joonho Yang Seunghyun Yoon Byeongjeong Kim Hwanhee Lee HILM 273 13 0 17 Apr 2024
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents Liyan Tang Philippe Laban Greg Durrett HILM SyDa 259 160 0 16 Apr 2024
On the Benefits of Fine-Grained Loss Truncation: A Case Study on Factuality in SummarizationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024 Lorenzo Jaime Yu Flores Arman Cohan HILM 211 2 0 09 Mar 2024
Know Your Audience: The benefits and pitfalls of generating plain language summaries beyond the "general" audienceInternational Conference on Human Factors in Computing Systems (CHI), 2024 Tal August Kyle Lo Noah A. Smith Katharina Reinecke 249 21 0 08 Mar 2024
German also Hallucinates! Inconsistency Detection in News Summaries with the Absinth Dataset Laura Mascarell Ribin Chalumattu Annette Rios HILM 140 1 0 06 Mar 2024
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods Hanlei Jin Yang Zhang Dan Meng Jun Wang Jinghua Tan 531 162 0 05 Mar 2024
FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction Alessandro Sciré Karim Ghonim Roberto Navigli HILM 202 20 0 04 Mar 2024
Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks Huajian Zhang Yumo Xu Laura Perez-Beltrachini HILM 161 23 0 27 Feb 2024
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization Liyan Tang Igor Shalyminov Amy Wing-mei Wong Jon Burnsky Jake W. Vincent ... Hang Su Lijia Sun Yi Zhang Saab Mansour Kathleen McKeown HILM 165 70 0 20 Feb 2024
Identifying Factual Inconsistencies in Summaries: Grounding Model Inference via Task Taxonomy Liyan Xu Zhenlin Su Mo Yu Jin Xu Jinho D. Choi Jie Zhou Fei Liu HILM 258 5 0 20 Feb 2024
GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence Kundan Krishna S. Ramprasad Prakhar Gupta Byron C. Wallace Zachary Chase Lipton Jeffrey P. Bigham HILM KELM SyDa 304 15 0 19 Feb 2024
FactPICO: Factuality Evaluation for Plain Language Summarization of Medical Evidence Sebastian Antony Joseph Lily Chen Jan Trienes Hannah Louisa Göke Monika Coers Wei Xu Byron C. Wallace Junyi Jessy Li LM&MA HILM 153 21 0 18 Feb 2024
Fine-grained and Explainable Factuality Evaluation for Multimodal Summarization Liqiang Jing Jingxuan Zuo Yue Zhang Liqiang Jing 267 13 0 18 Feb 2024
Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate Kyungha Kim Sangyun Lee Kung-Hsiang Huang Hou Pong Chan Pengfei Yu Chenhui Xu LRM 280 54 0 12 Feb 2024
Evaluating the Factuality of Zero-shot Summarizers Across Varied DomainsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024 S. Ramprasad Kundan Krishna Zachary Chase Lipton Byron C. Wallace HILM 136 8 0 05 Feb 2024
BatchEval: Towards Human-like Text EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Peiwen Yuan Shaoxiong Feng Yiwei Li Xinglin Wang Boyuan Pan Heda Wang Kan Li ALM 191 16 0 31 Dec 2023
Do Androids Know They're Only Dreaming of Electric Sheep?Annual Meeting of the Association for Computational Linguistics (ACL), 2023 Sky CH-Wang Benjamin Van Durme Jason Eisner Chris Kedzie HILM 226 52 0 28 Dec 2023
P^3SUM: Preserving Author's Perspective in News Summarization with Diffusion Language Models Yuhan Liu Shangbin Feng Xiaochuang Han Vidhisha Balachandran Chan Young Park Sachin Kumar Yulia Tsvetkov DiffM 230 6 0 16 Nov 2023
AMRFact: Enhancing Summarization Factuality Evaluation with AMR-Driven Negative Samples Generation Haoyi Qiu Kung-Hsiang Huang Jingnong Qu Nanyun Peng HILM 247 12 0 16 Nov 2023
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions Lei Huang Weijiang Yu Weitao Ma Weihong Zhong Zhangyin Feng ... Qianglong Chen Weihua Peng Xiaocheng Feng Bing Qin Ting Liu LRM HILM 322 1,755 0 09 Nov 2023
'Don't Get Too Technical with Me': A Discourse Structure-Based Framework for Science Journalism Ronald Cardenas Bingsheng Yao Dakuo Wang Yufang Hou 226 0 0 23 Oct 2023
Calibrating Likelihoods towards Consistency in Summarization Models Polina Zablotskaia Misha Khalman Rishabh Joshi Livio Baldini Soares Shoshana Jakobovits Joshua Maynez Shashi Narayan 119 5 0 12 Oct 2023
Quantifying the Plausibility of Context Reliance in Neural Machine TranslationInternational Conference on Learning Representations (ICLR), 2023 Gabriele Sarti Grzegorz Chrupala Malvina Nissim Arianna Bisazza 242 5 0 02 Oct 2023
BooookScore: A systematic exploration of book-length summarization in the era of LLMsInternational Conference on Learning Representations (ICLR), 2023 Yapei Chang Kyle Lo Tanya Goyal Mohit Iyyer ALM 325 150 0 01 Oct 2023
Beyond the Chat: Executable and Verifiable Text-Editing with LLMsACM Symposium on User Interface Software and Technology (UIST), 2023 Philippe Laban Jesse Vig Marti A. Hearst Caiming Xiong Chien-Sheng Wu KELM 228 48 0 27 Sep 2023
Evaluation of Faithfulness Using the Longest Supported Subsequence Anirudh Mittal Timo Schick Mikel Artetxe Jane Dwivedi-Yu ALM 134 2 0 23 Aug 2023
PromptSum: Parameter-Efficient Controllable Abstractive Summarization Mathieu Ravaut Hailin Chen Ruochen Zhao Chengwei Qin Shafiq Joty Nancy Chen 129 3 0 06 Aug 2023
Improving Factuality of Abstractive Summarization via Contrastive Reward Learning Ethan Chern Zhiruo Wang Sanjan Das Bhavuk Sharma Pengfei Liu Graham Neubig HILM 175 14 0 10 Jul 2023
Opportunities and Risks of LLMs for Scalable Deliberation with Polis Christopher T. Small Ivan Vendrov Esin Durmus Hadjar Homaei Elizabeth Barry Julien Cornebise Ted Suzman Deep Ganguli Colin Megill 164 48 0 20 Jun 2023
Reference Matters: Benchmarking Factual Error Correction for Dialogue Summarization with Fine-grained Evaluation FrameworkAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Mingqi Gao Xiaojun Wan Jia Su Zhefeng Wang Baoxing Huai HILM 143 10 0 08 Jun 2023
Multi-Dimensional Evaluation of Text Summarization with In-Context LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Sameer Jain Vaishakh Keshava Swarnashree Mysore Sathyendra Patrick Fernandes Pengfei Liu Graham Neubig Chunting Zhou ELM 178 51 0 01 Jun 2023