Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
2212.07981
Cited By
v1
v2 (latest)
Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation
15 December 2022
Yixin Liu
Alexander R. Fabbri
Pengfei Liu
Yilun Zhao
Linyong Nan
Ruilin Han
Simeng Han
Shafiq Joty
Chien-Sheng Wu
Caiming Xiong
Dragomir R. Radev
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation"
14 / 114 papers shown
Title
Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method
Yiming Wang
Zhuosheng Zhang
Rui Wang
138
97
0
22 May 2023
SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
Elizabeth Clark
Shruti Rijhwani
Sebastian Gehrmann
Joshua Maynez
Roee Aharoni
Vitaly Nikolaev
Thibault Sellam
Aditya Siddhant
Dipanjan Das
Ankur P. Parikh
126
43
0
22 May 2023
Complex Claim Verification with Evidence Retrieved in the Wild
Jifan Chen
Grace Kim
Aniruddh Sriram
Greg Durrett
Eunsol Choi
HILM
185
92
0
19 May 2023
FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge
Shangbin Feng
Vidhisha Balachandran
Yuyang Bai
Yulia Tsvetkov
KELM
HILM
111
60
0
14 May 2023
The Current State of Summarization
Fabian Retkowski
124
8
0
08 May 2023
Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
Yixin Liu
Alexander R. Fabbri
Yilun Zhao
Pengfei Liu
Shafiq Joty
Chien-Sheng Wu
Caiming Xiong
Dragomir R. Radev
88
29
0
07 Mar 2023
WiCE: Real-World Entailment for Claims in Wikipedia
Ryo Kamoi
Tanya Goyal
Juan Diego Rodriguez
Greg Durrett
153
100
0
02 Mar 2023
GPTScore: Evaluate as You Desire
Jinlan Fu
See-Kiong Ng
Zhengbao Jiang
Pengfei Liu
LM&MA
ALM
ELM
248
329
0
08 Feb 2023
LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control
Yilun Zhao
Zhenting Qi
Linyong Nan
Lorenzo Jaime Yu Flores
Dragomir R. Radev
LMTD
103
22
0
06 Feb 2023
Benchmarking Large Language Models for News Summarization
Tianyi Zhang
Faisal Ladhak
Esin Durmus
Percy Liang
Kathleen McKeown
Tatsunori B. Hashimoto
ELM
167
587
0
31 Jan 2023
LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization
Kalpesh Krishna
Erin Bransom
Bailey Kuehl
Mohit Iyyer
Pradeep Dasigi
Arman Cohan
Kyle Lo
100
103
0
30 Jan 2023
The Next Chapter: A Study of Large Language Models in Storytelling
Zhuohan Xie
Trevor Cohn
Jey Han Lau
151
52
0
24 Jan 2023
Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization
Artidoro Pagnoni
Alexander R. Fabbri
Wojciech Kry'sciñski
Chien-Sheng Wu
RALM
162
18
0
20 Dec 2022
Marvista: Exploring the Design of a Human-AI Collaborative News Reading Tool
Xiang Ánthony' Chen
Chien-Sheng Wu
Lidiya Murakhovs'ka
Philippe Laban
Tong Niu
Wenhao Liu
Caiming Xiong
SyDa
134
13
0
18 Jul 2022
Previous
1
2
3