ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.12356
  4. Cited By
News Summarization and Evaluation in the Era of GPT-3
v1v2 (latest)

News Summarization and Evaluation in the Era of GPT-3

26 September 2022
Tanya Goyal
Junyi Jessy Li
Greg Durrett
    ELM
ArXiv (abs)PDFHTML

Papers citing "News Summarization and Evaluation in the Era of GPT-3"

50 / 289 papers shown
Benchmarking Large Language Model Capabilities for Conditional
  Generation
Benchmarking Large Language Model Capabilities for Conditional GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Joshua Maynez
Priyanka Agrawal
Sebastian Gehrmann
ELMLM&MA
271
34
0
29 Jun 2023
Leveraging GPT-4 for Food Effect Summarization to Enhance
  Product-Specific Guidance Development via Iterative Prompting
Leveraging GPT-4 for Food Effect Summarization to Enhance Product-Specific Guidance Development via Iterative PromptingJournal of Biomedical Informatics (JBI), 2023
Yiwen Shi
Ping Ren
Jing Wang
Biao Han
Taha ValizadehAslani
Felix Agbavor
Yi Zhang
Meng Hu
Bo Pan
Hualou Liang
161
23
0
28 Jun 2023
Are aligned neural networks adversarially aligned?
Are aligned neural networks adversarially aligned?Neural Information Processing Systems (NeurIPS), 2023
Nicholas Carlini
Milad Nasr
Christopher A. Choquette-Choo
Matthew Jagielski
Irena Gao
...
Pang Wei Koh
Daphne Ippolito
Katherine Lee
Florian Tramèr
Ludwig Schmidt
AAML
287
312
0
26 Jun 2023
Cross-lingual Cross-temporal Summarization: Dataset, Models, Evaluation
Cross-lingual Cross-temporal Summarization: Dataset, Models, EvaluationComputational Linguistics (CL), 2023
Ran Zhang
Jihed Ouni
Steffen Eger
399
12
0
22 Jun 2023
Learning to Generate Better Than Your LLM
Learning to Generate Better Than Your LLM
Jonathan D. Chang
Kianté Brantley
Rajkumar Ramamurthy
Dipendra Kumar Misra
Wen Sun
272
54
0
20 Jun 2023
GUMSum: Multi-Genre Data and Evaluation for English Abstractive
  Summarization
GUMSum: Multi-Genre Data and Evaluation for English Abstractive SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yang Liu
Amir Zeldes
ELM
161
3
0
20 Jun 2023
Bloated Disclosures: Can ChatGPT Help Investors Process Information?
Bloated Disclosures: Can ChatGPT Help Investors Process Information?Social Science Research Network (SSRN), 2023
Alex G. Kim
Maximilian Muhn
Valeri V. Nikolaev
498
49
0
17 Jun 2023
CMMLU: Measuring massive multitask language understanding in Chinese
CMMLU: Measuring massive multitask language understanding in ChineseAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Jinyan Su
Yixuan Zhang
Fajri Koto
Yifei Yang
Hai Zhao
Yeyun Gong
Nan Duan
Tim Baldwin
ALMELM
437
413
0
15 Jun 2023
Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis,
  and LLMs Evaluations
Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs EvaluationsNeural Information Processing Systems (NeurIPS), 2023
Lifan Yuan
Yangyi Chen
Ganqu Cui
Hongcheng Gao
Fangyuan Zou
Xingyi Cheng
Heng Ji
Zhiyuan Liu
Maosong Sun
532
133
0
07 Jun 2023
Multi-Dimensional Evaluation of Text Summarization with In-Context
  Learning
Multi-Dimensional Evaluation of Text Summarization with In-Context LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Sameer Jain
Vaishakh Keshava
Swarnashree Mysore Sathyendra
Patrick Fernandes
Pengfei Liu
Graham Neubig
Chunting Zhou
ELM
222
53
0
01 Jun 2023
Concise Answers to Complex Questions: Summarization of Long-form Answers
Concise Answers to Complex Questions: Summarization of Long-form AnswersAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Abhilash Potluri
Fangyuan Xu
Eunsol Choi
ELM
164
12
0
30 May 2023
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark
  Datasets
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark DatasetsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Md Tahmid Rahman Laskar
M Saiful Bari
Mizanur Rahman
Md Amran Hossen Bhuiyan
Shafiq Joty
J. Huang
LM&MAELMALM
505
215
0
29 May 2023
Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Generating EDU Extracts for Plan-Guided Summary Re-RankingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Griffin Adams
Alexander R. Fabbri
Faisal Ladhak
Kathleen McKeown
Noémie Elhadad
199
13
0
28 May 2023
MeetingBank: A Benchmark Dataset for Meeting Summarization
MeetingBank: A Benchmark Dataset for Meeting SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yebowen Hu
Timothy Jeewun Ganter
Hanieh Deilamsalehy
Franck Dernoncourt
H. Foroosh
Fei Liu
AI4TS
230
67
0
27 May 2023
Do GPTs Produce Less Literal Translations?
Do GPTs Produce Less Literal Translations?Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Vikas Raunak
Arul Menezes
Matt Post
H. Awadallah
327
44
0
26 May 2023
Is Summary Useful or Not? An Extrinsic Human Evaluation of Text
  Summaries on Downstream Tasks
Is Summary Useful or Not? An Extrinsic Human Evaluation of Text Summaries on Downstream TasksInternational Conference on Language Resources and Evaluation (LREC), 2023
Xiao Pu
Mingqi Gao
Xiaojun Wan
ELM
254
9
0
24 May 2023
Investigating Table-to-Text Generation Capabilities of LLMs in
  Real-World Information Seeking Scenarios
Investigating Table-to-Text Generation Capabilities of LLMs in Real-World Information Seeking Scenarios
Yilun Zhao
Haowei Zhang
Shengyun Si
Linyong Nan
Xiangru Tang
Arman Cohan
LMTD
332
17
0
24 May 2023
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual
  Transfer
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual TransferNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Akari Asai
Sneha Kudugunta
Xinyan Velocity Yu
Terra Blevins
Hila Gonen
Machel Reid
Yulia Tsvetkov
Sebastian Ruder
Hannaneh Hajishirzi
319
82
0
24 May 2023
SummIt: Iterative Text Summarization via ChatGPT
SummIt: Iterative Text Summarization via ChatGPTConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Haopeng Zhang
Xiao Liu
Jiawei Zhang
329
91
0
24 May 2023
Using Natural Language Explanations to Rescale Human Judgments
Using Natural Language Explanations to Rescale Human Judgments
Manya Wadhwa
Jifan Chen
Junyi Jessy Li
Greg Durrett
324
12
0
24 May 2023
UniChart: A Universal Vision-language Pretrained Model for Chart
  Comprehension and Reasoning
UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and ReasoningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ahmed Masry
P. Kavehzadeh
Do Xuan Long
Enamul Hoque
Shafiq Joty
LRM
346
160
0
24 May 2023
DecipherPref: Analyzing Influential Factors in Human Preference
  Judgments via GPT-4
DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ye Hu
Kaiqiang Song
Sangwoo Cho
Xiaoyang Wang
H. Foroosh
Fei Liu
332
17
0
24 May 2023
Scientific Opinion Summarization: Paper Meta-review Generation Dataset,
  Methods, and Evaluation
Scientific Opinion Summarization: Paper Meta-review Generation Dataset, Methods, and Evaluation
Qi Zeng
Mankeerat Sidhu
Ansel Blume
Hou Pong Chan
Lu Wang
Heng Ji
245
12
0
24 May 2023
LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
Philippe Laban
Wojciech Kry'sciñski
Divyansh Agarwal
Alexander R. Fabbri
Caiming Xiong
Shafiq Joty
Chien-Sheng Wu
ALMHILM
153
45
0
23 May 2023
Dancing Between Success and Failure: Edit-level Simplification
  Evaluation using SALSA
Dancing Between Success and Failure: Edit-level Simplification Evaluation using SALSAConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
David Heineman
Yao Dou
Mounica Maddela
Wei Xu
310
24
0
23 May 2023
USB: A Unified Summarization Benchmark Across Tasks and Domains
USB: A Unified Summarization Benchmark Across Tasks and DomainsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Kundan Krishna
Prakhar Gupta
S. Ramprasad
Byron C. Wallace
Jeffrey P. Bigham
Zachary Chase Lipton
HILM
277
9
0
23 May 2023
On Learning to Summarize with Large Language Models as References
On Learning to Summarize with Large Language Models as ReferencesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Yixin Liu
Kejian Shi
Katherine S He
Longtian Ye
Alexander R. Fabbri
Pengfei Liu
Dragomir R. Radev
Arman Cohan
ELM
407
112
0
23 May 2023
CTQScorer: Combining Multiple Features for In-context Example Selection
  for Machine Translation
CTQScorer: Combining Multiple Features for In-context Example Selection for Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Aswanth Kumar
Ratish Puduppully
Mary Dabre
Anoop Kunchukuttan
254
16
0
23 May 2023
Automated Metrics for Medical Multi-Document Summarization Disagree with
  Human Evaluations
Automated Metrics for Medical Multi-Document Summarization Disagree with Human EvaluationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Lucy Lu Wang
Yulia Otmakhova
Jay DeYoung
Thinh Hung Truong
Bailey Kuehl
Erin Bransom
Byron C. Wallace
319
30
0
23 May 2023
APPLS: Evaluating Evaluation Metrics for Plain Language Summarization
APPLS: Evaluating Evaluation Metrics for Plain Language SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yue Guo
Tal August
Gondy Leroy
T. Cohen
Lucy Lu Wang
456
18
0
23 May 2023
Element-aware Summarization with Large Language Models: Expert-aligned
  Evaluation and Chain-of-Thought Method
Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought MethodAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yiming Wang
Zhuosheng Zhang
Rui Wang
270
117
0
22 May 2023
InheritSumm: A General, Versatile and Compact Summarizer by Distilling
  from GPT
InheritSumm: A General, Versatile and Compact Summarizer by Distilling from GPTConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yichong Xu
Ruochen Xu
Dan Iter
Yang Liu
Shuohang Wang
Chenguang Zhu
Michael Zeng
115
13
0
22 May 2023
Enhancing Coherence of Extractive Summarization with Multitask Learning
Enhancing Coherence of Extractive Summarization with Multitask Learning
Renlong Jie
Xiaojun Meng
Lifeng Shang
Xin Jiang
Qun Liu
201
1
0
22 May 2023
Revisiting the Architectures like Pointer Networks to Efficiently
  Improve the Next Word Distribution, Summarization Factuality, and Beyond
Revisiting the Architectures like Pointer Networks to Efficiently Improve the Next Word Distribution, Summarization Factuality, and BeyondAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Haw-Shiuan Chang
Zonghai Yao
Alolika Gon
Hong-ye Yu
Andrew McCallum
260
11
0
20 May 2023
Complex Claim Verification with Evidence Retrieved in the Wild
Complex Claim Verification with Evidence Retrieved in the WildNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Jifan Chen
Grace Kim
Aniruddh Sriram
Greg Durrett
Eunsol Choi
HILM
324
114
0
19 May 2023
Appraising the Potential Uses and Harms of LLMs for Medical Systematic
  Reviews
Appraising the Potential Uses and Harms of LLMs for Medical Systematic ReviewsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hye Sun Yun
Iain J. Marshall
T. Trikalinos
Byron C. Wallace
308
28
0
19 May 2023
TrueTeacher: Learning Factual Consistency Evaluation with Large Language
  Models
TrueTeacher: Learning Factual Consistency Evaluation with Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zorik Gekhman
Jonathan Herzig
Roee Aharoni
Chen Elkind
Idan Szpektor
HILMELM
402
97
0
18 May 2023
SLiC-HF: Sequence Likelihood Calibration with Human Feedback
SLiC-HF: Sequence Likelihood Calibration with Human Feedback
Yao-Min Zhao
Rishabh Joshi
Tianqi Liu
Misha Khalman
Mohammad Saleh
Peter J. Liu
233
375
0
17 May 2023
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for
  Foundation Models
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation ModelsNeural Information Processing Systems (NeurIPS), 2023
Yuzhen Huang
Yuzhuo Bai
Zhihao Zhu
Junlei Zhang
Jinghan Zhang
...
Yikai Zhang
Jiayi Lei
Yao Fu
Maosong Sun
Junxian He
ELMLRM
425
741
0
15 May 2023
Summarizing, Simplifying, and Synthesizing Medical Evidence Using GPT-3
  (with Varying Success)
Summarizing, Simplifying, and Synthesizing Medical Evidence Using GPT-3 (with Varying Success)Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Chantal Shaib
Millicent Li
Sebastian Antony Joseph
Iain J. Marshall
Junyi Jessy Li
Byron C. Wallace
LM&MAELM
213
80
0
10 May 2023
Generating medically-accurate summaries of patient-provider dialogue: A
  multi-stage approach using large language models
Generating medically-accurate summaries of patient-provider dialogue: A multi-stage approach using large language modelsClinical Natural Language Processing Workshop (ClinicalNLP), 2023
Varun Nair
Elliot Schumacher
Anitha Kannan
LM&MA
324
10
0
10 May 2023
GersteinLab at MEDIQA-Chat 2023: Clinical Note Summarization from
  Doctor-Patient Conversations through Fine-tuning and In-context Learning
GersteinLab at MEDIQA-Chat 2023: Clinical Note Summarization from Doctor-Patient Conversations through Fine-tuning and In-context LearningClinical Natural Language Processing Workshop (ClinicalNLP), 2023
Xiangru Tang
Andrew Tran
Jeffrey Tan
Mark B. Gerstein
157
11
0
08 May 2023
The Current State of Summarization
The Current State of Summarization
Fabian Retkowski
272
10
0
08 May 2023
Entity-Based Evaluation of Political Bias in Automatic Summarization
Entity-Based Evaluation of Political Bias in Automatic SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Karen Zhou
Chenhao Tan
325
3
0
03 May 2023
Can LMs Generalize to Future Data? An Empirical Analysis on Text
  Summarization
Can LMs Generalize to Future Data? An Empirical Analysis on Text SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
C. Cheang
Hou Pong Chan
Yang Li
Xuebo Liu
Zhao Li
Yanming Sun
Shudong Liu
Lidia S. Chao
528
10
0
03 May 2023
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and BeyondACM Transactions on Knowledge Discovery from Data (TKDD), 2023
Jingfeng Yang
Hongye Jin
Ruixiang Tang
Xiaotian Han
Qizhang Feng
Haoming Jiang
Bing Yin
Helen Zhou
LM&MA
431
930
0
26 Apr 2023
Can Large Language Models Transform Computational Social Science?
Can Large Language Models Transform Computational Social Science?International Conference on Computational Logic (ICCL), 2023
Caleb Ziems
William B. Held
Omar Shaikh
Jiaao Chen
Zhehao Zhang
Diyi Yang
LLMAG
486
432
0
12 Apr 2023
Extractive Summarization via ChatGPT for Faithful Summary Generation
Extractive Summarization via ChatGPT for Faithful Summary GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Haopeng Zhang
Xiao Liu
Jiawei Zhang
257
111
0
09 Apr 2023
BloombergGPT: A Large Language Model for Finance
BloombergGPT: A Large Language Model for Finance
Shijie Wu
Ozan Irsoy
Steven Lu
Vadim Dabravolski
Mark Dredze
Sebastian Gehrmann
P. Kambadur
David S. Rosenberg
Gideon Mann
AIFin
681
1,143
0
30 Mar 2023
DERA: Enhancing Large Language Model Completions with Dialog-Enabled
  Resolving Agents
DERA: Enhancing Large Language Model Completions with Dialog-Enabled Resolving AgentsClinical Natural Language Processing Workshop (ClinicalNLP), 2023
Varun Nair
Elliot Schumacher
Geoffrey Tso
Anitha Kannan
VLM
251
68
0
30 Mar 2023
Previous
123456
Next
Page 5 of 6