ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.03608
  4. Cited By
Towards Interpretable and Efficient Automatic Reference-Based
  Summarization Evaluation
v1v2 (latest)

Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
7 March 2023
Yixin Liu
Alexander R. Fabbri
Yilun Zhao
Pengfei Liu
Shafiq Joty
Chien-Sheng Wu
Caiming Xiong
Dragomir R. Radev
ArXiv (abs)PDFHTMLGithub (11★)

Papers citing "Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation"

21 / 21 papers shown
LeMAJ (Legal LLM-as-a-Judge): Bridging Legal Reasoning and LLM Evaluation
LeMAJ (Legal LLM-as-a-Judge): Bridging Legal Reasoning and LLM Evaluation
Joseph Enguehard
Morgane Van Ermengem
Kate Atkinson
Sujeong Cha
Arijit Ghosh Chowdhury
...
Jeremy Roghair
Hannah R Marlowe
Carina Suzana Negreanu
Kitty Boxall
Diana Mincu
AILawELM
211
3
0
08 Oct 2025
Evaluating the Evaluators: Are readability metrics good measures of readability?
Evaluating the Evaluators: Are readability metrics good measures of readability?
Isabel Cachola
Daniel Khashabi
Mark Dredze
ELM
114
2
0
26 Aug 2025
Can Large Language Models be Effective Online Opinion Miners?
Can Large Language Models be Effective Online Opinion Miners?
Ryang Heo
Yongsik Seo
Junseong Lee
Dongha Lee
303
2
0
21 May 2025
Estimating Optimal Context Length for Hybrid Retrieval-augmented Multi-document Summarization
Estimating Optimal Context Length for Hybrid Retrieval-augmented Multi-document Summarization
Adithya Pratapa
Teruko Mitamura
RALM
283
0
0
17 Apr 2025
Multi2: Multi-Agent Test-Time Scalable Framework for Multi-Document Processing
Multi2: Multi-Agent Test-Time Scalable Framework for Multi-Document Processing
Juntai Cao
Xiang Zhang
Raymond Li
Chuyuan Li
Shafiq Joty
Shafiq Joty
Giuseppe Carenini
554
13
0
27 Feb 2025
GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-Checking
GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-CheckingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yingjian Chen
Haoran Liu
Yinhong Liu
Rui Yang
Han Yuan
...
Pengyuan Zhou
Peng Yuan Zhou
Qingyu Chen
James Caverlee
Irene Li
HILM
727
9
0
23 Feb 2025
Scaling Multi-Document Event Summarization: Evaluating Compression vs. Full-Text Approaches
Scaling Multi-Document Event Summarization: Evaluating Compression vs. Full-Text ApproachesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Adithya Pratapa
Teruko Mitamura
344
1
0
10 Feb 2025
QAPyramid: Fine-grained Evaluation of Content Selection for Text Summarization
QAPyramid: Fine-grained Evaluation of Content Selection for Text Summarization
Shiyue Zhang
David Wan
Arie Cattan
Ayal Klein
Ido Dagan
Joey Tianyi Zhou
405
5
0
10 Dec 2024
M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for
  Evaluating Foundation Models
M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Chuhan Li
Ziyao Shangguan
Yilun Zhao
Deyuan Li
Yongxu Liu
Arman Cohan
354
18
0
06 Nov 2024
4-LEGS: 4D Language Embedded Gaussian Splatting
4-LEGS: 4D Language Embedded Gaussian Splatting
Gal Fiebelman
Tamir Cohen
Ayellet Morgenstern
Peter Hedman
Hadar Averbuch-Elor
3DGS
517
4
0
14 Oct 2024
ReIFE: Re-evaluating Instruction-Following Evaluation
ReIFE: Re-evaluating Instruction-Following EvaluationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Yixin Liu
Kejian Shi
Alexander R. Fabbri
Yilun Zhao
Peifeng Wang
Chien-Sheng Wu
Shafiq Joty
Arman Cohan
288
15
0
09 Oct 2024
NovAScore: A New Automated Metric for Evaluating Document Level Novelty
NovAScore: A New Automated Metric for Evaluating Document Level NoveltyInternational Conference on Computational Linguistics (COLING), 2024
Lin Ai
Ziwei Gong
Harshsaiprasad Deshpande
Alexander Johnson
Emmy Phung
Ahmad Emami
Julia Hirschberg
197
3
0
14 Sep 2024
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Philippe Laban
Alexander R. Fabbri
Caiming Xiong
Chien-Sheng Wu
RALM
390
93
0
01 Jul 2024
Unveiling Implicit Table Knowledge with Question-Then-Pinpoint Reasoner
  for Insightful Table Summarization
Unveiling Implicit Table Knowledge with Question-Then-Pinpoint Reasoner for Insightful Table Summarization
Kwangwook Seo
Jinyoung Yeo
Dongha Lee
ReLMLMTDLRM
235
4
0
18 Jun 2024
Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark
Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark
Preslav Nakov
Tairan Wang
Qingqing Zhu
Taicheng Guo
Shen Gao
Zhiyong Lu
Xin Gao
Xiangliang Zhang
553
3
0
22 Feb 2024
Fair Abstractive Summarization of Diverse Perspectives
Fair Abstractive Summarization of Diverse PerspectivesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Yusen Zhang
Nan Zhang
Yixin Liu
Alexander R. Fabbri
Junru Liu
...
Caiming Xiong
Jieyu Zhao
Dragomir R. Radev
Kathleen McKeown
Rui Zhang
214
25
0
14 Nov 2023
On Context Utilization in Summarization with Large Language Models
On Context Utilization in Summarization with Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Mathieu Ravaut
Aixin Sun
Nancy F. Chen
Shafiq Joty
654
37
0
16 Oct 2023
The Extractive-Abstractive Axis: Measuring Content "Borrowing" in
  Generative Language Models
The Extractive-Abstractive Axis: Measuring Content "Borrowing" in Generative Language Models
Nedelina Teneva
200
1
0
20 Jul 2023
DecipherPref: Analyzing Influential Factors in Human Preference
  Judgments via GPT-4
DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ye Hu
Kaiqiang Song
Sangwoo Cho
Xiaoyang Wang
H. Foroosh
Fei Liu
398
17
0
24 May 2023
QTSumm: Query-Focused Summarization over Tabular Data
QTSumm: Query-Focused Summarization over Tabular DataConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yilun Zhao
Zhenting Qi
Linyong Nan
Boyu Mi
Yixin Liu
...
Ruizhe Chen
Xiangru Tang
Yumo Xu
Dragomir R. Radev
Arman Cohan
RALMLMTD
305
3
0
23 May 2023
On Learning to Summarize with Large Language Models as References
On Learning to Summarize with Large Language Models as ReferencesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Yixin Liu
Kejian Shi
Katherine S He
Longtian Ye
Alexander R. Fabbri
Pengfei Liu
Dragomir R. Radev
Arman Cohan
ELM
553
126
0
23 May 2023
1
Page 1 of 1