ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.08896
  4. Cited By
Towards a Decomposable Metric for Explainable Evaluation of Text
  Generation from AMR
v1v2v3 (latest)

Towards a Decomposable Metric for Explainable Evaluation of Text Generation from AMR

20 August 2020
Juri Opitz
Anette Frank
ArXiv (abs)PDFHTML

Papers citing "Towards a Decomposable Metric for Explainable Evaluation of Text Generation from AMR"

50 / 68 papers shown
Putnam-AXIOM: A Functional and Static Benchmark for Measuring Higher Level Mathematical Reasoning in LLMs
Putnam-AXIOM: A Functional and Static Benchmark for Measuring Higher Level Mathematical Reasoning in LLMs
Aryan Gulati
Brando Miranda
Eric Chen
Emily Xia
Kai Fronsdal
Bruno Dumont
Elyas Obbad
Sanmi Koyejo
AIMatReLMLRM
541
9
0
05 Aug 2025
PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation
  Metrics
PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation MetricsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Daniil Larionov
Steffen Eger
291
7
0
20 Dec 2024
Faithful Chart Summarization with ChaTS-Pi
Faithful Chart Summarization with ChaTS-Pi
Syrine Krichene
Francesco Piccinno
Fangyu Liu
Julian Martin Eisenschlos
338
3
0
29 May 2024
Natural Language Processing RELIES on Linguistics
Natural Language Processing RELIES on LinguisticsComputational Linguistics (CL), 2024
Juri Opitz
Shira Wein
Nathan Schneider
AI4CE
807
13
0
09 May 2024
A Systematic Review of Data-to-Text NLG
A Systematic Review of Data-to-Text NLG
Chinonso Osuji
Thiago Castro Ferreira
Brian Davis
412
5
0
13 Feb 2024
A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for
  Verifiers of Reasoning Chains
A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains
Alon Jacovi
Yonatan Bitton
Bernd Bohnet
Jonathan Herzig
Or Honovich
Michael Tseng
Michael Collins
Roee Aharoni
Mor Geva
LRM
545
55
0
01 Feb 2024
AMR4NLI: Interpretable and robust NLI measures from semantic graphs
AMR4NLI: Interpretable and robust NLI measures from semantic graphsInternational Conference on Computational Semantics (IWCS), 2023
Juri Opitz
Shira Wein
Julius Steen
Anette Frank
Nathan Schneider
297
1
0
01 Jun 2023
SMATCH++: Standardized and Extended Evaluation of Semantic Graphs
SMATCH++: Standardized and Extended Evaluation of Semantic GraphsFindings (Findings), 2023
Juri Opitz
202
30
0
11 May 2023
Counterfactual Edits for Generative Evaluation
Counterfactual Edits for Generative Evaluation
Maria Lymperaiou
Giorgos Filandrianos
Konstantinos Thomas
Giorgos Stamou
EGVM
285
1
0
02 Mar 2023
Investigating the Effect of Relative Positional Embeddings on
  AMR-to-Text Generation with Structural Adapters
Investigating the Effect of Relative Positional Embeddings on AMR-to-Text Generation with Structural AdaptersConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Sébastien Montella
Alexis Nasr
Johannes Heinecke
Frédéric Béchet
L. Rojas-Barahona
306
3
0
12 Feb 2023
MAUVE Scores for Generative Models: Theory and Practice
MAUVE Scores for Generative Models: Theory and PracticeJournal of machine learning research (JMLR), 2022
Krishna Pillutla
Lang Liu
John Thickstun
Sean Welleck
Swabha Swayamdipta
Rowan Zellers
Sewoong Oh
Yejin Choi
Zaïd Harchaoui
EGVM
351
36
0
30 Dec 2022
ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning
ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning
O. Yu. Golovneva
Moya Chen
Spencer Poff
Martin Corredor
Luke Zettlemoyer
Maryam Fazel-Zarandi
Asli Celikyilmaz
ReLMLRM
371
233
0
15 Dec 2022
Better Smatch = Better Parser? AMR evaluation is not so simple anymore
Better Smatch = Better Parser? AMR evaluation is not so simple anymore
Juri Opitz
Anette Frank
191
20
0
12 Oct 2022
Graph-to-Text Generation with Dynamic Structure Pruning
Graph-to-Text Generation with Dynamic Structure PruningInternational Conference on Computational Linguistics (COLING), 2022
Liang Li
Ruiying Geng
Bowen Li
Can Ma
Yinliang Yue
Binhua Li
Yongbin Li
239
4
0
15 Sep 2022
A Survey : Neural Networks for AMR-to-Text
Hongyu Hao
Guangtong Li
Zhiming Hu
Huafeng Wang
231
1
0
15 Jun 2022
SBERT studies Meaning Representations: Decomposing Sentence Embeddings
  into Explainable Semantic Features
SBERT studies Meaning Representations: Decomposing Sentence Embeddings into Explainable Semantic Features
Juri Opitz
Anette Frank
365
49
0
14 Jun 2022
A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric
  Evaluation -- through the Lens of Semantic Similarity Rating
A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation -- through the Lens of Semantic Similarity Rating
Laura Zeidler
Juri Opitz
Anette Frank
224
6
0
24 May 2022
SMARAGD: Learning SMatch for Accurate and Rapid Approximate Graph
  Distance
SMARAGD: Learning SMatch for Accurate and Rapid Approximate Graph DistanceInternational Conference on Computational Semantics (IWCS), 2022
Juri Opitz
Philipp Meier
Anette Frank
278
1
0
24 Mar 2022
Unsupervised Full Constituency Parsing with Neighboring Distribution
  Divergence
Unsupervised Full Constituency Parsing with Neighboring Distribution Divergence
Letian Peng
Zuchao Li
Hai Zhao
190
0
0
29 Oct 2021
Contextualized Semantic Distance between Highly Overlapped Texts
Contextualized Semantic Distance between Highly Overlapped Texts
Letian Peng
Z. Li
Hai Zhao
262
2
0
04 Oct 2021
Weisfeiler-Leman in the BAMBOO: Novel AMR Graph Metrics and a Benchmark
  for AMR Graph Similarity
Weisfeiler-Leman in the BAMBOO: Novel AMR Graph Metrics and a Benchmark for AMR Graph SimilarityTransactions of the Association for Computational Linguistics (TACL), 2021
Juri Opitz
Angel Daza
A. Frank
207
36
0
26 Aug 2021
Evaluating the Tradeoff Between Abstractiveness and Factuality in
  Abstractive Summarization
Evaluating the Tradeoff Between Abstractiveness and Factuality in Abstractive SummarizationFindings (Findings), 2021
Markus Dreyer
Mengwen Liu
Feng Nan
Sandeep Atluri
Sujith Ravi
HILM
265
20
0
05 Aug 2021
Data-QuestEval: A Referenceless Metric for Data-to-Text Semantic
  Evaluation
Data-QuestEval: A Referenceless Metric for Data-to-Text Semantic EvaluationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Clément Rebuffel
Thomas Scialom
Laure Soulier
Benjamin Piwowarski
Sylvain Lamprier
Jacopo Staiano
Geoffrey Scoutheeten
Patrick Gallinari
310
36
0
15 Apr 2021
Structural Adapters in Pretrained Language Models for AMR-to-text
  Generation
Structural Adapters in Pretrained Language Models for AMR-to-text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Leonardo F. R. Ribeiro
Yue Zhang
Iryna Gurevych
261
80
0
16 Mar 2021
The GEM Benchmark: Natural Language Generation, its Evaluation and
  Metrics
The GEM Benchmark: Natural Language Generation, its Evaluation and MetricsIEEE Games Entertainment Media Conference (IEEE GEM), 2021
Sebastian Gehrmann
Tosin Adewumi
Karmanya Aggarwal
Pawan Sasanka Ammanamanchi
Aremu Anuoluwapo
...
Nishant Subramani
Wei Xu
Diyi Yang
Akhila Yerukola
Jiawei Zhou
VLM
974
316
0
02 Feb 2021
MAUVE: Measuring the Gap Between Neural Text and Human Text using
  Divergence Frontiers
MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence FrontiersNeural Information Processing Systems (NeurIPS), 2021
Krishna Pillutla
Swabha Swayamdipta
Rowan Zellers
John Thickstun
Sean Welleck
Yejin Choi
Zaïd Harchaoui
496
482
0
02 Feb 2021
Promoting Graph Awareness in Linearized Graph-to-Text Generation
Promoting Graph Awareness in Linearized Graph-to-Text GenerationFindings (Findings), 2020
Alexander Miserlis Hoyle
Ana Marasović
Noah A. Smith
AI4CE
215
32
0
31 Dec 2020
GRUEN for Evaluating Linguistic Quality of Generated Text
GRUEN for Evaluating Linguistic Quality of Generated TextFindings (Findings), 2020
Wanzheng Zhu
S. Bhat
329
78
0
06 Oct 2020
A Survey of Evaluation Metrics Used for NLG Systems
A Survey of Evaluation Metrics Used for NLG SystemsACM Computing Surveys (ACM CSUR), 2020
Ananya B. Sai
Akash Kumar Mohankumar
Mitesh M. Khapra
ELM
526
314
0
27 Aug 2020
Neural Machine Translation with Error Correction
Neural Machine Translation with Error Correction
Kaitao Song
Xu Tan
Jianfeng Lu
340
61
0
21 Jul 2020
Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine
  Translation Evaluation Metrics
Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evaluation MetricsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Nitika Mathur
Tim Baldwin
Trevor Cohn
291
293
0
11 Jun 2020
AMR Quality Rating with a Lightweight CNN
AMR Quality Rating with a Lightweight CNN
Juri Opitz
254
7
0
25 May 2020
GPT-too: A language-model-first approach for AMR-to-text generation
GPT-too: A language-model-first approach for AMR-to-text generation
Manuel Mager
Ramón Fernández Astudillo
Tahira Naseem
Md Arafat Sultan
Young-Suk Lee
Radu Florian
Salim Roukos
346
104
0
18 May 2020
On Exposure Bias, Hallucination and Domain Shift in Neural Machine
  Translation
On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation
Chaojun Wang
Rico Sennrich
311
187
0
07 May 2020
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog
  Generation
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Shikib Mehri
M. Eskénazi
278
266
0
01 May 2020
ToTTo: A Controlled Table-To-Text Generation Dataset
ToTTo: A Controlled Table-To-Text Generation DatasetConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Ankur P. Parikh
Xuezhi Wang
Sebastian Gehrmann
Manaal Faruqui
Bhuwan Dhingra
Diyi Yang
Dipanjan Das
LMTD
532
432
0
29 Apr 2020
A Human Evaluation of AMR-to-English Generation Systems
A Human Evaluation of AMR-to-English Generation SystemsInternational Conference on Computational Linguistics (COLING), 2020
Emma Manning
Shira Wein
Nathan Schneider
302
21
0
14 Apr 2020
AMR Parsing via Graph-Sequence Iterative Inference
AMR Parsing via Graph-Sequence Iterative InferenceAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Deng Cai
Wai Lam
GNN
281
120
0
12 Apr 2020
BLEURT: Learning Robust Metrics for Text Generation
BLEURT: Learning Robust Metrics for Text GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Thibault Sellam
Dipanjan Das
Ankur P. Parikh
762
1,838
0
09 Apr 2020
How Furiously Can Colourless Green Ideas Sleep? Sentence Acceptability
  in Context
How Furiously Can Colourless Green Ideas Sleep? Sentence Acceptability in ContextTransactions of the Association for Computational Linguistics (TACL), 2020
Jey Han Lau
C. S. Armendariz
Shalom Lappin
Matthew Purver
Chang Shu
205
45
0
02 Apr 2020
Learning to Compare for Better Training and Evaluation of Open Domain
  Natural Language Generation Models
Learning to Compare for Better Training and Evaluation of Open Domain Natural Language Generation ModelsAAAI Conference on Artificial Intelligence (AAAI), 2020
Wangchunshu Zhou
Ke Xu
ELMALM
246
49
0
12 Feb 2020
AMR Similarity Metrics from Principles
AMR Similarity Metrics from PrinciplesTransactions of the Association for Computational Linguistics (TACL), 2020
Juri Opitz
Letitia Parcalabescu
Anette Frank
225
49
0
29 Jan 2020
Graph Transformer for Graph-to-Sequence Learning
Graph Transformer for Graph-to-Sequence LearningAAAI Conference on Artificial Intelligence (AAAI), 2019
Deng Cai
W. Lam
395
247
0
18 Nov 2019
Semantic Noise Matters for Neural Natural Language Generation
Semantic Noise Matters for Neural Natural Language GenerationInternational Conference on Natural Language Generation (INLG), 2019
Ondrej Dusek
David M. Howcroft
Verena Rieser
309
121
0
10 Nov 2019
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and
  lighter
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Victor Sanh
Lysandre Debut
Julien Chaumond
Thomas Wolf
3.4K
9,449
0
02 Oct 2019
Enhancing AMR-to-Text Generation with Dual Graph Representations
Enhancing AMR-to-Text Generation with Dual Graph RepresentationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Leonardo F. R. Ribeiro
Claire Gardent
Iryna Gurevych
211
65
0
01 Sep 2019
Densely Connected Graph Convolutional Networks for Graph-to-Sequence
  Learning
Densely Connected Graph Convolutional Networks for Graph-to-Sequence LearningTransactions of the Association for Computational Linguistics (TACL), 2019
Zhijiang Guo
Yan Zhang
Zhiyang Teng
Wei Lu
GNN
307
144
0
16 Aug 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
6.0K
29,143
0
26 Jul 2019
Barack's Wife Hillary: Using Knowledge-Graphs for Fact-Aware Language
  Modeling
Barack's Wife Hillary: Using Knowledge-Graphs for Fact-Aware Language ModelingAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
IV RobertL.Logan
Nelson F. Liu
Matthew E. Peters
Matt Gardner
Sameer Singh
RALM
397
202
0
17 Jun 2019
SemBleu: A Robust Metric for AMR Parsing Evaluation
SemBleu: A Robust Metric for AMR Parsing EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Linfeng Song
D. Gildea
218
41
0
26 May 2019
12
Next
Page 1 of 2