Evaluating the Factual Consistency of Abstractive Text Summarization

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019

28 October 2019

Papers citing "Evaluating the Factual Consistency of Abstractive Text Summarization"

50 / 491 papers shown

Title
Rating Roulette: Self-Inconsistency in LLM-As-A-Judge Frameworks Rajarshi Haldar Julia Hockenmaier 16 0 0 31 Oct 2025
DRAMA: Unifying Data Retrieval and Analysis for Open-Domain Analytic Queries Chuxuan Hu Maxwell Yang James Weiland Yeji Lim Suhas Palawala Daniel Kang 12 0 0 31 Oct 2025
VISTA Score: Verification In Sequential Turn-based Assessment A. Lewis Andrew Perrault Eric Fosler-Lussier Michael White HILM 117 0 0 30 Oct 2025
Confabulations from ACL Publications (CAP): A Dataset for Scientific Hallucination Detection Federica Gamba Aman Sinha Timothee Mickus Raul Vazquez Patanjali Bhamidipati ... Aryan Chandramania Rohit Agarwal Chuyuan Li Ioana Buhnila Radhika Mamidi HILM 60 0 0 25 Oct 2025
Enhancing Faithfulness in Abstractive Summarization via Span-Level Fine-Tuning Sicong Huang Qianqi Yan Shengze Wang Ian Lane HILM 77 0 0 10 Oct 2025
Text2Stories: Evaluating the Alignment Between Stakeholder Interviews and Generated User Stories Francesco Dente Fabiano Dalpiaz Paolo Papotti 20 0 0 08 Oct 2025
Exposing Citation Vulnerabilities in Generative Engines Riku Mochizuki Shusuke Komatsu Souta Noguchi Kazuto Ataka ELM 36 0 0 08 Oct 2025
InforME: Improving Informativeness of Abstractive Text Summarization With Informative Attention Guided by Named Entity Salience Jianbin Shen Christy Jie Liang Junyu Xuan 16 0 0 07 Oct 2025
Large Language Models Hallucination: A Comprehensive Survey Aisha Alansari Hamzah Luqman HILM LRM 186 0 0 05 Oct 2025
ACT: Agentic Classification Tree Vincent Grari Tim Arni Thibault Laugel Sylvain Lamprier James Zou Marcin Detyniecki 36 0 0 30 Sep 2025
PerHalluEval: Persian Hallucination Evaluation Benchmark for Large Language Models Mohammad Hosseini Kimia Hosseini Shayan Bali Zahra Zanjani Saeedeh Momtazi HILM VLM 44 0 0 25 Sep 2025
Document Summarization with Conformal Importance Guarantees Bruce Kuwahara Chen-Yuan Lin Xiao Shi Huang Kin Kwan Leung Jullian Arta Yapeter Ilya Stanevich Felipe Perez Jesse C. Cresswell AI4TS 68 0 0 24 Sep 2025
Memory in Large Language Models: Mechanisms, Evaluation and Evolution D. Zhang Wendong Li Kani Song Jiaye Lu Gang Li Liuchun Yang Sheng Li KELM 105 0 0 23 Sep 2025
Efficient Extractive Text Summarization for Online News Articles Using Machine Learning Sajib Biswas Milon Biswas Arunima Mandal Fatema Tabassum Liza Joy Sarker 32 0 0 19 Sep 2025
MetaRAG: Metamorphic Testing for Hallucination Detection in RAG Systems Channdeth Sok David Luz Yacine Haddam HILM 128 0 0 11 Sep 2025
HALT-RAG: A Task-Adaptable Framework for Hallucination Detection with Calibrated NLI Ensembles and Abstention Saumya Goswami Siddharth Kurra VLM 12 0 0 09 Sep 2025
AraHalluEval: A Fine-grained Hallucination Evaluation Framework for Arabic LLMs Aisha Alansari Hamzah Luqman HILM LRM 24 2 0 04 Sep 2025
AllSummedUp: un framework open-source pour comparer les metriques dévaluation de resume Tanguy Herserant Vincent Guigue 36 0 0 29 Aug 2025
Coarse-to-Fine Personalized LLM Impressions for Streamlined Radiology Reports Chengbo Sun Hui Yi Leong Lei Li LM&MA 104 3 0 19 Aug 2025
Hallucination Detection and Mitigation in Scientific Text Simplification using Ensemble Approaches: DS@GT at CLEF 2025 SimpleText Krishna Chaitanya Marturi Heba H. Elwazzan 20 2 0 15 Aug 2025
Highlight All the Phrases: Enhancing LLM Transparency through Visual Factuality Indicators Hyo Jin Do Rachel Ostrand Werner Geyer K. Murugesan Dennis L. Wei Justin D. Weisz HILM 80 0 0 09 Aug 2025
ChartCap: Mitigating Hallucination of Dense Chart Captioning Junyoung Lim Jaewoo Ahn Gunhee Kim 48 1 0 05 Aug 2025
Harnessing RLHF for Robust Unanswerability Recognition and Trustworthy Response Generation in LLMs Shuyuan Lin Lei Duan Philip Hughes Yuxuan Sheng HILM 75 0 0 22 Jul 2025
Theoretical Foundations and Mitigation of Hallucination in Large Language Models Esmail Gumaan HILM 65 1 0 20 Jul 2025
Reranking-based Generation for Unbiased Perspective SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Narutatsu Ri Nicholas Deas Kathleen McKeown OffRL 94 0 0 19 Jun 2025
DiscoSum: Discourse-aware News Summarization Alexander Spangher Tenghao Huang Jialiang Gu Jiatong Shi Muhao Chen 116 0 0 07 Jun 2025
Contextual Candor: Enhancing LLM Trustworthiness Through Hierarchical Unanswerability Detection Steven Robinson Antonio Carlos Rivera HILM 84 0 0 01 Jun 2025
LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text Li yunhan Wu gengshen AILaw ELM ALM 208 0 0 30 May 2025
StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs Haohan Yuan Sukhwa Hong Haopeng Zhang RALM ReLM LRM 131 0 0 29 May 2025
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning Shuzheng Si Haozhe Zhao Cheng Gao Yuzhuo Bai Zhitong Wang ... Gang Chen Fanchao Qi Minjia Zhang Baobao Chang Maosong Sun SyDa HILM 142 2 0 22 May 2025
Resource for Error Analysis in Text Simplification: New Taxonomy and Test CollectionAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025 Benjamin Vendeville Liana Ermakova Pierre De Loor 85 3 0 22 May 2025
Long-Form Information Alignment Evaluation Beyond Atomic Facts Danna Zheng Mirella Lapata Jeff Z. Pan HILM 146 0 0 21 May 2025
Integrating Video and Text: A Balanced Approach to Multimodal Summary Generation and Evaluation Galann Pennec Zhengyuan Liu Nicholas Asher Philippe Muller Nancy F. Chen VGen 227 0 0 10 May 2025
SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation Tanguy Herserant Vincent Guigue ELM 103 1 0 04 May 2025
Combining LLMs with Logic-Based Framework to Explain MCTS Ziyan An Xia Wang Hendrik Baier Zirong Chen A. Dubey Taylor T. Johnson Jonathan Sprinkle Ayan Mukhopadhyay Meiyi Ma 172 2 0 01 May 2025
Can LLMs Detect Intrinsic Hallucinations in Paraphrasing and Machine Translation? Evangelia Gogoulou Shorouq Zahra Liane Guillou Luise Dürlich Joakim Nivre HILM LRM 698 2 0 29 Apr 2025
Towards Long Context Hallucination DetectionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025 Siyi Liu Kishaloy Halder Zheng Qi Wei Xiao Nikolaos Pappas Phu Mon Htut Neha Anna John Yassine Benajiba Dan Roth HILM 183 9 0 28 Apr 2025
Conflicts in Texts: Data, Implications and Challenges Siyi Liu Dan Roth 762 0 0 28 Apr 2025
ScholarMate: A Mixed-Initiative Tool for Qualitative Knowledge Work and Information SensemakingSymposium on Human-Computer Interaction for Work (CHIWORK), 2025 Runlong Ye Patrick Yung Kang Lee Matthew Varona Oliver Huang Carolina Nobre 216 1 0 19 Apr 2025
Large Language Models as Span Annotators Zdeněk Kasner Vilém Zouhar Patrícia Schmidtová Ivan Kartáč Kristýna Onderková Ondřej Plátek Dimitra Gkatzia Saad Mahamood Ondrej Dusek Simone Balloccu ALM 243 4 0 11 Apr 2025
Summarizing Speech: A Comprehensive Survey Fabian Retkowski Maike Züfle Andreas Sudmann Dinah Pfau Jan Niehues Jan Niehues Alexander H. Waibel 241 2 0 10 Apr 2025
CASCADE Your Datasets for Cross-Mode Knowledge Retrieval of Language Models Runlong Zhou Yi Zhang RALM 167 1 0 02 Apr 2025
Summarization Metrics for Spanish and Basque: Do Automatic Scores and LLM-Judges Correlate with Humans? Jeremy Barnes Naiara Perez Alba Bonet-Jover Begoña Altuna 186 3 0 21 Mar 2025
Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual SettingsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Austin Xu Srijan Bansal Yifei Ming Semih Yavuz Shafiq Joty ELM 240 11 0 19 Mar 2025
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs Ivan Kartáč Mateusz Lango Ondrej Dusek ELM 204 4 0 14 Mar 2025
Uncertainty-Aware Decoding with Minimum Bayes RiskInternational Conference on Learning Representations (ICLR), 2025 Nico Daheim Clara Meister Thomas Möllenhoff Iryna Gurevych 180 6 0 07 Mar 2025
Evaluating LLMs' Assessment of Mixed-Context Hallucination Through the Lens of SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Siya Qi Rui Cao Petr Slovak Zheng Yuan HILM 199 2 0 03 Mar 2025
HalCECE: A Framework for Explainable Hallucination Detection through Conceptual Counterfactuals in Image Captioning Maria Lymperaiou Giorgos Filandrianos Angeliki Dimitriou Athanasios Voulodimos Giorgos Stamou MLLM 102 0 0 01 Mar 2025
Semantic Integrity Constraints: Declarative Guardrails for AI-Augmented Data Processing SystemsProceedings of the VLDB Endowment (PVLDB), 2025 Alexander W. Lee Justin Chan Michael Fu Nicolas Kim Akshay Mehta Deepti Raghavan Ugur Cetintemel 168 1 0 01 Mar 2025
Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization Ryan Barron Maksim E. Eren Olga M. Serafimova Cynthia Matuszek Boian S. Alexandrov AILaw 245 6 0 27 Feb 2025