ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.12840
  4. Cited By
Evaluating the Factual Consistency of Abstractive Text Summarization

Evaluating the Factual Consistency of Abstractive Text Summarization

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
28 October 2019
Wojciech Kry'sciñski
Bryan McCann
Caiming Xiong
R. Socher
    HILM
ArXiv (abs)PDFHTML

Papers citing "Evaluating the Factual Consistency of Abstractive Text Summarization"

50 / 491 papers shown
Title
Rating Roulette: Self-Inconsistency in LLM-As-A-Judge Frameworks
Rating Roulette: Self-Inconsistency in LLM-As-A-Judge Frameworks
Rajarshi Haldar
Julia Hockenmaier
16
0
0
31 Oct 2025
DRAMA: Unifying Data Retrieval and Analysis for Open-Domain Analytic Queries
DRAMA: Unifying Data Retrieval and Analysis for Open-Domain Analytic Queries
Chuxuan Hu
Maxwell Yang
James Weiland
Yeji Lim
Suhas Palawala
Daniel Kang
12
0
0
31 Oct 2025
VISTA Score: Verification In Sequential Turn-based Assessment
VISTA Score: Verification In Sequential Turn-based Assessment
A. Lewis
Andrew Perrault
Eric Fosler-Lussier
Michael White
HILM
117
0
0
30 Oct 2025
Confabulations from ACL Publications (CAP): A Dataset for Scientific Hallucination Detection
Confabulations from ACL Publications (CAP): A Dataset for Scientific Hallucination Detection
Federica Gamba
Aman Sinha
Timothee Mickus
Raul Vazquez
Patanjali Bhamidipati
...
Aryan Chandramania
Rohit Agarwal
Chuyuan Li
Ioana Buhnila
Radhika Mamidi
HILM
60
0
0
25 Oct 2025
Enhancing Faithfulness in Abstractive Summarization via Span-Level Fine-Tuning
Enhancing Faithfulness in Abstractive Summarization via Span-Level Fine-Tuning
Sicong Huang
Qianqi Yan
Shengze Wang
Ian Lane
HILM
77
0
0
10 Oct 2025
Text2Stories: Evaluating the Alignment Between Stakeholder Interviews and Generated User Stories
Text2Stories: Evaluating the Alignment Between Stakeholder Interviews and Generated User Stories
Francesco Dente
Fabiano Dalpiaz
Paolo Papotti
20
0
0
08 Oct 2025
Exposing Citation Vulnerabilities in Generative Engines
Exposing Citation Vulnerabilities in Generative Engines
Riku Mochizuki
Shusuke Komatsu
Souta Noguchi
Kazuto Ataka
ELM
36
0
0
08 Oct 2025
InforME: Improving Informativeness of Abstractive Text Summarization With Informative Attention Guided by Named Entity Salience
InforME: Improving Informativeness of Abstractive Text Summarization With Informative Attention Guided by Named Entity Salience
Jianbin Shen
Christy Jie Liang
Junyu Xuan
16
0
0
07 Oct 2025
Large Language Models Hallucination: A Comprehensive Survey
Large Language Models Hallucination: A Comprehensive Survey
Aisha Alansari
Hamzah Luqman
HILMLRM
186
0
0
05 Oct 2025
ACT: Agentic Classification Tree
ACT: Agentic Classification Tree
Vincent Grari
Tim Arni
Thibault Laugel
Sylvain Lamprier
James Zou
Marcin Detyniecki
36
0
0
30 Sep 2025
PerHalluEval: Persian Hallucination Evaluation Benchmark for Large Language Models
PerHalluEval: Persian Hallucination Evaluation Benchmark for Large Language Models
Mohammad Hosseini
Kimia Hosseini
Shayan Bali
Zahra Zanjani
Saeedeh Momtazi
HILMVLM
44
0
0
25 Sep 2025
Document Summarization with Conformal Importance Guarantees
Document Summarization with Conformal Importance Guarantees
Bruce Kuwahara
Chen-Yuan Lin
Xiao Shi Huang
Kin Kwan Leung
Jullian Arta Yapeter
Ilya Stanevich
Felipe Perez
Jesse C. Cresswell
AI4TS
68
0
0
24 Sep 2025
Memory in Large Language Models: Mechanisms, Evaluation and Evolution
Memory in Large Language Models: Mechanisms, Evaluation and Evolution
D. Zhang
Wendong Li
Kani Song
Jiaye Lu
Gang Li
Liuchun Yang
Sheng Li
KELM
105
0
0
23 Sep 2025
Efficient Extractive Text Summarization for Online News Articles Using Machine Learning
Efficient Extractive Text Summarization for Online News Articles Using Machine Learning
Sajib Biswas
Milon Biswas
Arunima Mandal
Fatema Tabassum Liza
Joy Sarker
32
0
0
19 Sep 2025
MetaRAG: Metamorphic Testing for Hallucination Detection in RAG Systems
MetaRAG: Metamorphic Testing for Hallucination Detection in RAG Systems
Channdeth Sok
David Luz
Yacine Haddam
HILM
128
0
0
11 Sep 2025
HALT-RAG: A Task-Adaptable Framework for Hallucination Detection with Calibrated NLI Ensembles and Abstention
HALT-RAG: A Task-Adaptable Framework for Hallucination Detection with Calibrated NLI Ensembles and Abstention
Saumya Goswami
Siddharth Kurra
VLM
12
0
0
09 Sep 2025
AraHalluEval: A Fine-grained Hallucination Evaluation Framework for Arabic LLMs
AraHalluEval: A Fine-grained Hallucination Evaluation Framework for Arabic LLMs
Aisha Alansari
Hamzah Luqman
HILMLRM
24
2
0
04 Sep 2025
AllSummedUp: un framework open-source pour comparer les metriques dévaluation de resume
AllSummedUp: un framework open-source pour comparer les metriques dévaluation de resume
Tanguy Herserant
Vincent Guigue
36
0
0
29 Aug 2025
Coarse-to-Fine Personalized LLM Impressions for Streamlined Radiology Reports
Coarse-to-Fine Personalized LLM Impressions for Streamlined Radiology Reports
Chengbo Sun
Hui Yi Leong
Lei Li
LM&MA
104
3
0
19 Aug 2025
Hallucination Detection and Mitigation in Scientific Text Simplification using Ensemble Approaches: DS@GT at CLEF 2025 SimpleText
Hallucination Detection and Mitigation in Scientific Text Simplification using Ensemble Approaches: DS@GT at CLEF 2025 SimpleText
Krishna Chaitanya Marturi
Heba H. Elwazzan
20
2
0
15 Aug 2025
Highlight All the Phrases: Enhancing LLM Transparency through Visual Factuality Indicators
Highlight All the Phrases: Enhancing LLM Transparency through Visual Factuality Indicators
Hyo Jin Do
Rachel Ostrand
Werner Geyer
K. Murugesan
Dennis L. Wei
Justin D. Weisz
HILM
80
0
0
09 Aug 2025
ChartCap: Mitigating Hallucination of Dense Chart Captioning
ChartCap: Mitigating Hallucination of Dense Chart Captioning
Junyoung Lim
Jaewoo Ahn
Gunhee Kim
48
1
0
05 Aug 2025
Harnessing RLHF for Robust Unanswerability Recognition and Trustworthy Response Generation in LLMs
Harnessing RLHF for Robust Unanswerability Recognition and Trustworthy Response Generation in LLMs
Shuyuan Lin
Lei Duan
Philip Hughes
Yuxuan Sheng
HILM
75
0
0
22 Jul 2025
Theoretical Foundations and Mitigation of Hallucination in Large Language Models
Theoretical Foundations and Mitigation of Hallucination in Large Language Models
Esmail Gumaan
HILM
65
1
0
20 Jul 2025
Reranking-based Generation for Unbiased Perspective Summarization
Reranking-based Generation for Unbiased Perspective SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Narutatsu Ri
Nicholas Deas
Kathleen McKeown
OffRL
94
0
0
19 Jun 2025
DiscoSum: Discourse-aware News Summarization
DiscoSum: Discourse-aware News Summarization
Alexander Spangher
Tenghao Huang
Jialiang Gu
Jiatong Shi
Muhao Chen
116
0
0
07 Jun 2025
Contextual Candor: Enhancing LLM Trustworthiness Through Hierarchical Unanswerability Detection
Contextual Candor: Enhancing LLM Trustworthiness Through Hierarchical Unanswerability Detection
Steven Robinson
Antonio Carlos Rivera
HILM
84
0
0
01 Jun 2025
LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text
LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text
Li yunhan
Wu gengshen
AILawELMALM
208
0
0
30 May 2025
StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs
StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs
Haohan Yuan
Sukhwa Hong
Haopeng Zhang
RALMReLMLRM
131
0
0
29 May 2025
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning
Shuzheng Si
Haozhe Zhao
Cheng Gao
Yuzhuo Bai
Zhitong Wang
...
Gang Chen
Fanchao Qi
Minjia Zhang
Baobao Chang
Maosong Sun
SyDaHILM
142
2
0
22 May 2025
Resource for Error Analysis in Text Simplification: New Taxonomy and Test Collection
Resource for Error Analysis in Text Simplification: New Taxonomy and Test CollectionAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Benjamin Vendeville
Liana Ermakova
Pierre De Loor
85
3
0
22 May 2025
Long-Form Information Alignment Evaluation Beyond Atomic Facts
Long-Form Information Alignment Evaluation Beyond Atomic Facts
Danna Zheng
Mirella Lapata
Jeff Z. Pan
HILM
146
0
0
21 May 2025
Integrating Video and Text: A Balanced Approach to Multimodal Summary Generation and Evaluation
Integrating Video and Text: A Balanced Approach to Multimodal Summary Generation and Evaluation
Galann Pennec
Zhengyuan Liu
Nicholas Asher
Philippe Muller
Nancy F. Chen
VGen
227
0
0
10 May 2025
SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation
SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation
Tanguy Herserant
Vincent Guigue
ELM
103
1
0
04 May 2025
Combining LLMs with Logic-Based Framework to Explain MCTS
Combining LLMs with Logic-Based Framework to Explain MCTS
Ziyan An
Xia Wang
Hendrik Baier
Zirong Chen
A. Dubey
Taylor T. Johnson
Jonathan Sprinkle
Ayan Mukhopadhyay
Meiyi Ma
172
2
0
01 May 2025
Can LLMs Detect Intrinsic Hallucinations in Paraphrasing and Machine Translation?
Can LLMs Detect Intrinsic Hallucinations in Paraphrasing and Machine Translation?
Evangelia Gogoulou
Shorouq Zahra
Liane Guillou
Luise Dürlich
Joakim Nivre
HILMLRM
698
2
0
29 Apr 2025
Towards Long Context Hallucination Detection
Towards Long Context Hallucination DetectionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Siyi Liu
Kishaloy Halder
Zheng Qi
Wei Xiao
Nikolaos Pappas
Phu Mon Htut
Neha Anna John
Yassine Benajiba
Dan Roth
HILM
183
9
0
28 Apr 2025
Conflicts in Texts: Data, Implications and Challenges
Conflicts in Texts: Data, Implications and Challenges
Siyi Liu
Dan Roth
762
0
0
28 Apr 2025
ScholarMate: A Mixed-Initiative Tool for Qualitative Knowledge Work and Information Sensemaking
ScholarMate: A Mixed-Initiative Tool for Qualitative Knowledge Work and Information SensemakingSymposium on Human-Computer Interaction for Work (CHIWORK), 2025
Runlong Ye
Patrick Yung Kang Lee
Matthew Varona
Oliver Huang
Carolina Nobre
216
1
0
19 Apr 2025
Large Language Models as Span Annotators
Large Language Models as Span Annotators
Zdeněk Kasner
Vilém Zouhar
Patrícia Schmidtová
Ivan Kartáč
Kristýna Onderková
Ondřej Plátek
Dimitra Gkatzia
Saad Mahamood
Ondrej Dusek
Simone Balloccu
ALM
243
4
0
11 Apr 2025
Summarizing Speech: A Comprehensive Survey
Summarizing Speech: A Comprehensive Survey
Fabian Retkowski
Maike Züfle
Andreas Sudmann
Dinah Pfau
Jan Niehues
Jan Niehues
Alexander H. Waibel
241
2
0
10 Apr 2025
CASCADE Your Datasets for Cross-Mode Knowledge Retrieval of Language Models
CASCADE Your Datasets for Cross-Mode Knowledge Retrieval of Language Models
Runlong Zhou
Yi Zhang
RALM
167
1
0
02 Apr 2025
Summarization Metrics for Spanish and Basque: Do Automatic Scores and LLM-Judges Correlate with Humans?
Summarization Metrics for Spanish and Basque: Do Automatic Scores and LLM-Judges Correlate with Humans?
Jeremy Barnes
Naiara Perez
Alba Bonet-Jover
Begoña Altuna
186
3
0
21 Mar 2025
Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings
Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual SettingsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Austin Xu
Srijan Bansal
Yifei Ming
Semih Yavuz
Shafiq Joty
ELM
240
11
0
19 Mar 2025
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs
Ivan Kartáč
Mateusz Lango
Ondrej Dusek
ELM
204
4
0
14 Mar 2025
Uncertainty-Aware Decoding with Minimum Bayes RiskInternational Conference on Learning Representations (ICLR), 2025
Nico Daheim
Clara Meister
Thomas Möllenhoff
Iryna Gurevych
180
6
0
07 Mar 2025
Evaluating LLMs' Assessment of Mixed-Context Hallucination Through the Lens of SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Siya Qi
Rui Cao
Petr Slovak
Zheng Yuan
HILM
199
2
0
03 Mar 2025
HalCECE: A Framework for Explainable Hallucination Detection through Conceptual Counterfactuals in Image Captioning
HalCECE: A Framework for Explainable Hallucination Detection through Conceptual Counterfactuals in Image Captioning
Maria Lymperaiou
Giorgos Filandrianos
Angeliki Dimitriou
Athanasios Voulodimos
Giorgos Stamou
MLLM
102
0
0
01 Mar 2025
Semantic Integrity Constraints: Declarative Guardrails for AI-Augmented Data Processing Systems
Semantic Integrity Constraints: Declarative Guardrails for AI-Augmented Data Processing SystemsProceedings of the VLDB Endowment (PVLDB), 2025
Alexander W. Lee
Justin Chan
Michael Fu
Nicolas Kim
Akshay Mehta
Deepti Raghavan
Ugur Cetintemel
168
1
0
01 Mar 2025
Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization
Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization
Ryan Barron
Maksim E. Eren
Olga M. Serafimova
Cynthia Matuszek
Boian S. Alexandrov
AILaw
245
6
0
27 Feb 2025
1234...8910
Next