v1v2v3 (latest)

Towards a Decomposable Metric for Explainable Evaluation of Text Generation from AMR

20 August 2020

Juri Opitz

Anette Frank

ArXiv (abs)PDF HTML

Papers citing "Towards a Decomposable Metric for Explainable Evaluation of Text Generation from AMR"

50 / 68 papers shown

Putnam-AXIOM: A Functional and Static Benchmark for Measuring Higher Level Mathematical Reasoning in LLMs

541

05 Aug 2025

PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation MetricsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Daniil Larionov

Steffen Eger

291

20 Dec 2024

Faithful Chart Summarization with ChaTS-Pi

Syrine Krichene

Francesco Piccinno

Fangyu Liu

Julian Martin Eisenschlos

338

29 May 2024

Natural Language Processing RELIES on LinguisticsComputational Linguistics (CL), 2024

807

09 May 2024

A Systematic Review of Data-to-Text NLG

Chinonso Osuji

Thiago Castro Ferreira

Brian Davis

412

13 Feb 2024

A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains

545

01 Feb 2024

AMR4NLI: Interpretable and robust NLI measures from semantic graphsInternational Conference on Computational Semantics (IWCS), 2023

297

01 Jun 2023

SMATCH++: Standardized and Extended Evaluation of Semantic GraphsFindings (Findings), 2023

Juri Opitz

202

11 May 2023

Counterfactual Edits for Generative Evaluation

Maria Lymperaiou

Giorgos Filandrianos

Konstantinos Thomas

Giorgos Stamou

EGVM

285

02 Mar 2023

Investigating the Effect of Relative Positional Embeddings on AMR-to-Text Generation with Structural AdaptersConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

306

12 Feb 2023

MAUVE Scores for Generative Models: Theory and PracticeJournal of machine learning research (JMLR), 2022

Yejin Choi

351

30 Dec 2022

ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning

Luke Zettlemoyer

371

233

15 Dec 2022

Better Smatch = Better Parser? AMR evaluation is not so simple anymore

Juri Opitz

Anette Frank

191

12 Oct 2022

Graph-to-Text Generation with Dynamic Structure PruningInternational Conference on Computational Linguistics (COLING), 2022

239

15 Sep 2022

A Survey : Neural Networks for AMR-to-Text

Hongyu Hao

Guangtong Li

Zhiming Hu

Huafeng Wang

231

15 Jun 2022

SBERT studies Meaning Representations: Decomposing Sentence Embeddings into Explainable Semantic Features

Juri Opitz

Anette Frank

365

14 Jun 2022

A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation -- through the Lens of Semantic Similarity Rating

Laura Zeidler

Juri Opitz

Anette Frank

224

24 May 2022

SMARAGD: Learning SMatch for Accurate and Rapid Approximate Graph DistanceInternational Conference on Computational Semantics (IWCS), 2022

Juri Opitz

Philipp Meier

Anette Frank

278

24 Mar 2022

Unsupervised Full Constituency Parsing with Neighboring Distribution Divergence

Letian Peng

Zuchao Li

Hai Zhao

190

29 Oct 2021

Contextualized Semantic Distance between Highly Overlapped Texts

Letian Peng

Z. Li

Hai Zhao

262

04 Oct 2021

Weisfeiler-Leman in the BAMBOO: Novel AMR Graph Metrics and a Benchmark for AMR Graph SimilarityTransactions of the Association for Computational Linguistics (TACL), 2021

Juri Opitz

Angel Daza

A. Frank

207

26 Aug 2021

Evaluating the Tradeoff Between Abstractiveness and Factuality in Abstractive SummarizationFindings (Findings), 2021

265

05 Aug 2021

Data-QuestEval: A Referenceless Metric for Data-to-Text Semantic EvaluationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

310

15 Apr 2021

Structural Adapters in Pretrained Language Models for AMR-to-text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Leonardo F. R. Ribeiro

Yue Zhang

Iryna Gurevych

261

16 Mar 2021

The GEM Benchmark: Natural Language Generation, its Evaluation and MetricsIEEE Games Entertainment Media Conference (IEEE GEM), 2021

Sebastian Gehrmann

Tosin Adewumi

Karmanya Aggarwal

Pawan Sasanka Ammanamanchi

...

Diyi Yang

974

316

02 Feb 2021

MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence FrontiersNeural Information Processing Systems (NeurIPS), 2021

Yejin Choi

496

482

02 Feb 2021

Promoting Graph Awareness in Linearized Graph-to-Text GenerationFindings (Findings), 2020

Alexander Miserlis Hoyle

Ana Marasović

Noah A. Smith

AI4CE

215

31 Dec 2020

GRUEN for Evaluating Linguistic Quality of Generated TextFindings (Findings), 2020

Wanzheng Zhu

S. Bhat

329

06 Oct 2020

A Survey of Evaluation Metrics Used for NLG SystemsACM Computing Surveys (ACM CSUR), 2020

Ananya B. Sai

Akash Kumar Mohankumar

Mitesh M. Khapra

ELM

526

314

27 Aug 2020

Neural Machine Translation with Error Correction

Kaitao Song

Xu Tan

Jianfeng Lu

340

21 Jul 2020

Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evaluation MetricsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Nitika Mathur

Tim Baldwin

Trevor Cohn

291

293

11 Jun 2020

AMR Quality Rating with a Lightweight CNN

Juri Opitz

254

25 May 2020

GPT-too: A language-model-first approach for AMR-to-text generation

Manuel Mager

Ramón Fernández Astudillo

346

104

18 May 2020

On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation

Chaojun Wang

Rico Sennrich

311

187

07 May 2020

USR: An Unsupervised and Reference Free Evaluation Metric for Dialog GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Shikib Mehri

M. Eskénazi

278

266

01 May 2020

ToTTo: A Controlled Table-To-Text Generation DatasetConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

Diyi Yang

532

432

29 Apr 2020

A Human Evaluation of AMR-to-English Generation SystemsInternational Conference on Computational Linguistics (COLING), 2020

Emma Manning

Shira Wein

Nathan Schneider

302

14 Apr 2020

AMR Parsing via Graph-Sequence Iterative InferenceAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Deng Cai

Wai Lam

GNN

281

120

12 Apr 2020

BLEURT: Learning Robust Metrics for Text GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Thibault Sellam

Dipanjan Das

Ankur P. Parikh

762

1,838

09 Apr 2020

How Furiously Can Colourless Green Ideas Sleep? Sentence Acceptability in ContextTransactions of the Association for Computational Linguistics (TACL), 2020

205

02 Apr 2020

Learning to Compare for Better Training and Evaluation of Open Domain Natural Language Generation ModelsAAAI Conference on Artificial Intelligence (AAAI), 2020

Wangchunshu Zhou

Ke Xu

ELM ALM

246

12 Feb 2020

AMR Similarity Metrics from PrinciplesTransactions of the Association for Computational Linguistics (TACL), 2020

Juri Opitz

Letitia Parcalabescu

Anette Frank

225

29 Jan 2020

Graph Transformer for Graph-to-Sequence LearningAAAI Conference on Artificial Intelligence (AAAI), 2019

Deng Cai

W. Lam

395

247

18 Nov 2019

Semantic Noise Matters for Neural Natural Language GenerationInternational Conference on Natural Language Generation (INLG), 2019

Ondrej Dusek

David M. Howcroft

Verena Rieser

309

121

10 Nov 2019

DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

3.4K

9,449

02 Oct 2019

Enhancing AMR-to-Text Generation with Dual Graph RepresentationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

Leonardo F. R. Ribeiro

Claire Gardent

Iryna Gurevych

211

01 Sep 2019

Densely Connected Graph Convolutional Networks for Graph-to-Sequence LearningTransactions of the Association for Computational Linguistics (TACL), 2019

Wei Lu

307

144

16 Aug 2019

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Luke Zettlemoyer

6.0K

29,143

26 Jul 2019

Barack's Wife Hillary: Using Knowledge-Graphs for Fact-Aware Language ModelingAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

397

202

17 Jun 2019

SemBleu: A Robust Metric for AMR Parsing EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

Linfeng Song

D. Gildea

218

26 May 2019