Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1910.12840
Cited By
Evaluating the Factual Consistency of Abstractive Text Summarization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
28 October 2019
Wojciech Kry'sciñski
Bryan McCann
Caiming Xiong
R. Socher
HILM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Evaluating the Factual Consistency of Abstractive Text Summarization"
50 / 491 papers shown
Title
Rating Roulette: Self-Inconsistency in LLM-As-A-Judge Frameworks
Rajarshi Haldar
Julia Hockenmaier
16
0
0
31 Oct 2025
DRAMA: Unifying Data Retrieval and Analysis for Open-Domain Analytic Queries
Chuxuan Hu
Maxwell Yang
James Weiland
Yeji Lim
Suhas Palawala
Daniel Kang
12
0
0
31 Oct 2025
VISTA Score: Verification In Sequential Turn-based Assessment
A. Lewis
Andrew Perrault
Eric Fosler-Lussier
Michael White
HILM
117
0
0
30 Oct 2025
Confabulations from ACL Publications (CAP): A Dataset for Scientific Hallucination Detection
Federica Gamba
Aman Sinha
Timothee Mickus
Raul Vazquez
Patanjali Bhamidipati
...
Aryan Chandramania
Rohit Agarwal
Chuyuan Li
Ioana Buhnila
Radhika Mamidi
HILM
60
0
0
25 Oct 2025
Enhancing Faithfulness in Abstractive Summarization via Span-Level Fine-Tuning
Sicong Huang
Qianqi Yan
Shengze Wang
Ian Lane
HILM
77
0
0
10 Oct 2025
Text2Stories: Evaluating the Alignment Between Stakeholder Interviews and Generated User Stories
Francesco Dente
Fabiano Dalpiaz
Paolo Papotti
20
0
0
08 Oct 2025
Exposing Citation Vulnerabilities in Generative Engines
Riku Mochizuki
Shusuke Komatsu
Souta Noguchi
Kazuto Ataka
ELM
36
0
0
08 Oct 2025
InforME: Improving Informativeness of Abstractive Text Summarization With Informative Attention Guided by Named Entity Salience
Jianbin Shen
Christy Jie Liang
Junyu Xuan
16
0
0
07 Oct 2025
Large Language Models Hallucination: A Comprehensive Survey
Aisha Alansari
Hamzah Luqman
HILM
LRM
186
0
0
05 Oct 2025
ACT: Agentic Classification Tree
Vincent Grari
Tim Arni
Thibault Laugel
Sylvain Lamprier
James Zou
Marcin Detyniecki
36
0
0
30 Sep 2025
PerHalluEval: Persian Hallucination Evaluation Benchmark for Large Language Models
Mohammad Hosseini
Kimia Hosseini
Shayan Bali
Zahra Zanjani
Saeedeh Momtazi
HILM
VLM
44
0
0
25 Sep 2025
Document Summarization with Conformal Importance Guarantees
Bruce Kuwahara
Chen-Yuan Lin
Xiao Shi Huang
Kin Kwan Leung
Jullian Arta Yapeter
Ilya Stanevich
Felipe Perez
Jesse C. Cresswell
AI4TS
68
0
0
24 Sep 2025
Memory in Large Language Models: Mechanisms, Evaluation and Evolution
D. Zhang
Wendong Li
Kani Song
Jiaye Lu
Gang Li
Liuchun Yang
Sheng Li
KELM
105
0
0
23 Sep 2025
Efficient Extractive Text Summarization for Online News Articles Using Machine Learning
Sajib Biswas
Milon Biswas
Arunima Mandal
Fatema Tabassum Liza
Joy Sarker
32
0
0
19 Sep 2025
MetaRAG: Metamorphic Testing for Hallucination Detection in RAG Systems
Channdeth Sok
David Luz
Yacine Haddam
HILM
128
0
0
11 Sep 2025
HALT-RAG: A Task-Adaptable Framework for Hallucination Detection with Calibrated NLI Ensembles and Abstention
Saumya Goswami
Siddharth Kurra
VLM
12
0
0
09 Sep 2025
AraHalluEval: A Fine-grained Hallucination Evaluation Framework for Arabic LLMs
Aisha Alansari
Hamzah Luqman
HILM
LRM
24
2
0
04 Sep 2025
AllSummedUp: un framework open-source pour comparer les metriques dévaluation de resume
Tanguy Herserant
Vincent Guigue
36
0
0
29 Aug 2025
Coarse-to-Fine Personalized LLM Impressions for Streamlined Radiology Reports
Chengbo Sun
Hui Yi Leong
Lei Li
LM&MA
104
3
0
19 Aug 2025
Hallucination Detection and Mitigation in Scientific Text Simplification using Ensemble Approaches: DS@GT at CLEF 2025 SimpleText
Krishna Chaitanya Marturi
Heba H. Elwazzan
20
2
0
15 Aug 2025
Highlight All the Phrases: Enhancing LLM Transparency through Visual Factuality Indicators
Hyo Jin Do
Rachel Ostrand
Werner Geyer
K. Murugesan
Dennis L. Wei
Justin D. Weisz
HILM
80
0
0
09 Aug 2025
ChartCap: Mitigating Hallucination of Dense Chart Captioning
Junyoung Lim
Jaewoo Ahn
Gunhee Kim
48
1
0
05 Aug 2025
Harnessing RLHF for Robust Unanswerability Recognition and Trustworthy Response Generation in LLMs
Shuyuan Lin
Lei Duan
Philip Hughes
Yuxuan Sheng
HILM
75
0
0
22 Jul 2025
Theoretical Foundations and Mitigation of Hallucination in Large Language Models
Esmail Gumaan
HILM
65
1
0
20 Jul 2025
Reranking-based Generation for Unbiased Perspective Summarization
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Narutatsu Ri
Nicholas Deas
Kathleen McKeown
OffRL
94
0
0
19 Jun 2025
DiscoSum: Discourse-aware News Summarization
Alexander Spangher
Tenghao Huang
Jialiang Gu
Jiatong Shi
Muhao Chen
116
0
0
07 Jun 2025
Contextual Candor: Enhancing LLM Trustworthiness Through Hierarchical Unanswerability Detection
Steven Robinson
Antonio Carlos Rivera
HILM
84
0
0
01 Jun 2025
LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text
Li yunhan
Wu gengshen
AILaw
ELM
ALM
208
0
0
30 May 2025
StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs
Haohan Yuan
Sukhwa Hong
Haopeng Zhang
RALM
ReLM
LRM
131
0
0
29 May 2025
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning
Shuzheng Si
Haozhe Zhao
Cheng Gao
Yuzhuo Bai
Zhitong Wang
...
Gang Chen
Fanchao Qi
Minjia Zhang
Baobao Chang
Maosong Sun
SyDa
HILM
142
2
0
22 May 2025
Resource for Error Analysis in Text Simplification: New Taxonomy and Test Collection
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Benjamin Vendeville
Liana Ermakova
Pierre De Loor
85
3
0
22 May 2025
Long-Form Information Alignment Evaluation Beyond Atomic Facts
Danna Zheng
Mirella Lapata
Jeff Z. Pan
HILM
146
0
0
21 May 2025
Integrating Video and Text: A Balanced Approach to Multimodal Summary Generation and Evaluation
Galann Pennec
Zhengyuan Liu
Nicholas Asher
Philippe Muller
Nancy F. Chen
VGen
227
0
0
10 May 2025
SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation
Tanguy Herserant
Vincent Guigue
ELM
103
1
0
04 May 2025
Combining LLMs with Logic-Based Framework to Explain MCTS
Ziyan An
Xia Wang
Hendrik Baier
Zirong Chen
A. Dubey
Taylor T. Johnson
Jonathan Sprinkle
Ayan Mukhopadhyay
Meiyi Ma
172
2
0
01 May 2025
Can LLMs Detect Intrinsic Hallucinations in Paraphrasing and Machine Translation?
Evangelia Gogoulou
Shorouq Zahra
Liane Guillou
Luise Dürlich
Joakim Nivre
HILM
LRM
698
2
0
29 Apr 2025
Towards Long Context Hallucination Detection
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Siyi Liu
Kishaloy Halder
Zheng Qi
Wei Xiao
Nikolaos Pappas
Phu Mon Htut
Neha Anna John
Yassine Benajiba
Dan Roth
HILM
183
9
0
28 Apr 2025
Conflicts in Texts: Data, Implications and Challenges
Siyi Liu
Dan Roth
762
0
0
28 Apr 2025
ScholarMate: A Mixed-Initiative Tool for Qualitative Knowledge Work and Information Sensemaking
Symposium on Human-Computer Interaction for Work (CHIWORK), 2025
Runlong Ye
Patrick Yung Kang Lee
Matthew Varona
Oliver Huang
Carolina Nobre
216
1
0
19 Apr 2025
Large Language Models as Span Annotators
Zdeněk Kasner
Vilém Zouhar
Patrícia Schmidtová
Ivan Kartáč
Kristýna Onderková
Ondřej Plátek
Dimitra Gkatzia
Saad Mahamood
Ondrej Dusek
Simone Balloccu
ALM
243
4
0
11 Apr 2025
Summarizing Speech: A Comprehensive Survey
Fabian Retkowski
Maike Züfle
Andreas Sudmann
Dinah Pfau
Jan Niehues
Jan Niehues
Alexander H. Waibel
241
2
0
10 Apr 2025
CASCADE Your Datasets for Cross-Mode Knowledge Retrieval of Language Models
Runlong Zhou
Yi Zhang
RALM
167
1
0
02 Apr 2025
Summarization Metrics for Spanish and Basque: Do Automatic Scores and LLM-Judges Correlate with Humans?
Jeremy Barnes
Naiara Perez
Alba Bonet-Jover
Begoña Altuna
186
3
0
21 Mar 2025
Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Austin Xu
Srijan Bansal
Yifei Ming
Semih Yavuz
Shafiq Joty
ELM
240
11
0
19 Mar 2025
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs
Ivan Kartáč
Mateusz Lango
Ondrej Dusek
ELM
204
4
0
14 Mar 2025
Uncertainty-Aware Decoding with Minimum Bayes Risk
International Conference on Learning Representations (ICLR), 2025
Nico Daheim
Clara Meister
Thomas Möllenhoff
Iryna Gurevych
180
6
0
07 Mar 2025
Evaluating LLMs' Assessment of Mixed-Context Hallucination Through the Lens of Summarization
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Siya Qi
Rui Cao
Petr Slovak
Zheng Yuan
HILM
199
2
0
03 Mar 2025
HalCECE: A Framework for Explainable Hallucination Detection through Conceptual Counterfactuals in Image Captioning
Maria Lymperaiou
Giorgos Filandrianos
Angeliki Dimitriou
Athanasios Voulodimos
Giorgos Stamou
MLLM
102
0
0
01 Mar 2025
Semantic Integrity Constraints: Declarative Guardrails for AI-Augmented Data Processing Systems
Proceedings of the VLDB Endowment (PVLDB), 2025
Alexander W. Lee
Justin Chan
Michael Fu
Nicolas Kim
Akshay Mehta
Deepti Raghavan
Ugur Cetintemel
168
1
0
01 Mar 2025
Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization
Ryan Barron
Maksim E. Eren
Olga M. Serafimova
Cynthia Matuszek
Boian S. Alexandrov
AILaw
245
6
0
27 Feb 2025
1
2
3
4
...
8
9
10
Next