ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1610.02124
  4. Cited By
There's No Comparison: Reference-less Evaluation Metrics in Grammatical
  Error Correction

There's No Comparison: Reference-less Evaluation Metrics in Grammatical Error Correction

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2016
7 October 2016
Courtney Napoles
Keisuke Sakaguchi
Joel R. Tetreault
ArXiv (abs)PDFHTML

Papers citing "There's No Comparison: Reference-less Evaluation Metrics in Grammatical Error Correction"

31 / 31 papers shown
Introducing OmniGEC: A Silver Multilingual Dataset for Grammatical Error Correction
Roman Kovalchuk
Mariana Romanyshyn
Petro Ivaniuk
SyDa
121
0
0
18 Sep 2025
Differentially-private text generation degrades output language quality
Differentially-private text generation degrades output language quality
Erion Cano
Ivan Habernal
SyDa
96
0
0
14 Sep 2025
Opportunities and Challenges of LLMs in Education: An NLP Perspective
Opportunities and Challenges of LLMs in Education: An NLP Perspective
Sowmya Vajjala
Bashar Alhafni
Stefano Banno
Kaushal Kumar Maurya
Ekaterina Kochmar
AI4Ed
249
2
0
30 Jul 2025
Advancements in Arabic Grammatical Error Detection and Correction: An
  Empirical Investigation
Advancements in Arabic Grammatical Error Detection and Correction: An Empirical InvestigationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Bashar Alhafni
Go Inoue
Christian Khairallah
Farah E. Shamout
175
28
0
24 May 2023
CLEME: Debiasing Multi-reference Evaluation for Grammatical Error
  Correction
CLEME: Debiasing Multi-reference Evaluation for Grammatical Error CorrectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jingheng Ye
Hai-Tao Zheng
Qingyu Zhou
Yongqian Li
Shirong Ma
Haitao Zheng
Ying Shen
309
8
0
18 May 2023
How to choose "Good" Samples for Text Data Augmentation
How to choose "Good" Samples for Text Data Augmentation
Xiaotian Lin
Hongyan Wu
Yingwen Fu
Ziyu Yang
Shengyi Jiang
194
2
0
02 Feb 2023
Grammatical Error Correction: A Survey of the State of the Art
Grammatical Error Correction: A Survey of the State of the ArtComputational Linguistics (CL), 2022
Christopher Bryant
Zheng Yuan
Muhammad Reza Qorib
Hannan Cao
Hwee Tou Ng
Ted Briscoe
3DV
247
113
0
09 Nov 2022
Revisiting Grammatical Error Correction Evaluation and Beyond
Revisiting Grammatical Error Correction Evaluation and BeyondConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Peiyuan Gong
Xuebo Liu
Heyan Huang
Min Zhang
174
22
0
03 Nov 2022
Universal Evasion Attacks on Summarization Scoring
Universal Evasion Attacks on Summarization ScoringBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2022
Wenchuan Mu
Kwan Hui Lim
AAML
217
1
0
25 Oct 2022
Towards Automated Document Revision: Grammatical Error Correction,
  Fluency Edits, and Beyond
Towards Automated Document Revision: Grammatical Error Correction, Fluency Edits, and BeyondWorkshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2022
Masato Mita
Keisuke Sakaguchi
Masato Hagiwara
Tomoya Mizumoto
Jun Suzuki
Kentaro Inui
132
22
0
23 May 2022
Construction of a Quality Estimation Dataset for Automatic Evaluation of
  Japanese Grammatical Error Correction
Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error CorrectionInternational Conference on Language Resources and Evaluation (LREC), 2022
Daisuke Suzuki
Yujin Takahashi
Ikumi Yamashita
Taichi Aida
Tosho Hirasawa
Michitaka Nakatsuji
Masato Mita
Mamoru Komachi
86
1
0
20 Jan 2022
LM-Critic: Language Models for Unsupervised Grammatical Error Correction
LM-Critic: Language Models for Unsupervised Grammatical Error Correction
Michihiro Yasunaga
J. Leskovec
Abigail Z. Jacobs
187
54
0
14 Sep 2021
SMURF: SeMantic and linguistic UndeRstanding Fusion for Caption
  Evaluation via Typicality Analysis
SMURF: SeMantic and linguistic UndeRstanding Fusion for Caption Evaluation via Typicality AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Joshua Forster Feinglass
Yezhou Yang
81
24
0
02 Jun 2021
On the Use of Linguistic Features for the Evaluation of Generative
  Dialogue Systems
On the Use of Linguistic Features for the Evaluation of Generative Dialogue Systems
Ian Berlot-Attwell
Frank Rudzicz
98
2
0
13 Apr 2021
Assessing Reference-Free Peer Evaluation for Machine Translation
Assessing Reference-Free Peer Evaluation for Machine TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Sweta Agrawal
George F. Foster
Markus Freitag
Colin Cherry
LRM
133
11
0
12 Apr 2021
Evaluating the Morphosyntactic Well-formedness of Generated Texts
Evaluating the Morphosyntactic Well-formedness of Generated TextsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Adithya Pratapa
Antonios Anastasopoulos
Shruti Rijhwani
Aditi Chaudhary
David R. Mortensen
Graham Neubig
Yulia Tsvetkov
152
9
0
30 Mar 2021
A Comprehensive Survey of Grammar Error Correction
A Comprehensive Survey of Grammar Error Correction
Yu Wang
Yuelin Wang
Jie Liu
Zhuowei Liu
215
39
0
02 May 2020
BLEU Neighbors: A Reference-less Approach to Automatic Evaluation
BLEU Neighbors: A Reference-less Approach to Automatic Evaluation
Kawin Ethayarajh
Dorsa Sadigh
164
4
0
27 Apr 2020
Towards Minimal Supervision BERT-based Grammar Error Correction
Towards Minimal Supervision BERT-based Grammar Error CorrectionAAAI Conference on Artificial Intelligence (AAAI), 2020
Yiyuan Li
Antonios Anastasopoulos
A. Black
99
12
0
10 Jan 2020
Diamonds in the Rough: Generating Fluent Sentences from Early-Stage
  Drafts for Academic Writing Assistance
Diamonds in the Rough: Generating Fluent Sentences from Early-Stage Drafts for Academic Writing AssistanceInternational Conference on Natural Language Generation (INLG), 2019
Takumi Ito
Tatsuki Kuribayashi
Hayato Kobayashi
Ana Brassard
Masato Hagiwara
Jun Suzuki
Kentaro Inui
146
26
0
21 Oct 2019
Automatic Quality Estimation for Natural Language Generation: Ranting
  (Jointly Rating and Ranking)
Automatic Quality Estimation for Natural Language Generation: Ranting (Jointly Rating and Ranking)International Conference on Natural Language Generation (INLG), 2019
Ondrej Dusek
Karin Sevegnani
Ioannis Konstas
Verena Rieser
ALM
141
10
0
10 Oct 2019
On conducting better validation studies of automatic metrics in natural
  language generation evaluation
On conducting better validation studies of automatic metrics in natural language generation evaluation
Johnny Tian-Zheng Wei
123
1
0
31 Jul 2019
An Analysis of Source-Side Grammatical Errors in NMT
An Analysis of Source-Side Grammatical Errors in NMT
Antonios Anastasopoulos
101
18
0
24 May 2019
Reaching Human-level Performance in Automatic Grammatical Error
  Correction: An Empirical Study
Reaching Human-level Performance in Automatic Grammatical Error Correction: An Empirical Study
Tao Ge
Furu Wei
M. Zhou
353
87
0
03 Jul 2018
Inherent Biases in Reference based Evaluation for Grammatical Error
  Correction and Text Simplification
Inherent Biases in Reference based Evaluation for Grammatical Error Correction and Text Simplification
Leshem Choshen
Omri Abend
161
36
0
30 Apr 2018
Automatic Metric Validation for Grammatical Error Correction
Automatic Metric Validation for Grammatical Error Correction
Leshem Choshen
Omri Abend
143
33
0
30 Apr 2018
Reference-less Measure of Faithfulness for Grammatical Error Correction
Reference-less Measure of Faithfulness for Grammatical Error Correction
Leshem Choshen
Omri Abend
3DV
136
35
0
11 Apr 2018
Dear Sir or Madam, May I introduce the GYAFC Dataset: Corpus, Benchmarks
  and Metrics for Formality Style Transfer
Dear Sir or Madam, May I introduce the GYAFC Dataset: Corpus, Benchmarks and Metrics for Formality Style Transfer
Sudha Rao
Joel R. Tetreault
253
422
0
17 Mar 2018
Referenceless Quality Estimation for Natural Language Generation
Referenceless Quality Estimation for Natural Language Generation
Ondrej Dusek
Jekaterina Novikova
Verena Rieser
170
30
0
05 Aug 2017
Why We Need New Evaluation Metrics for NLG
Why We Need New Evaluation Metrics for NLG
Jekaterina Novikova
Ondrej Dusek
Amanda Cercas Curry
Verena Rieser
211
491
0
21 Jul 2017
Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep
  Reinforcement Learning
Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning
Baolin Peng
Xiujun Li
Lihong Li
Jianfeng Gao
Asli Celikyilmaz
Sungjin Lee
Kam-Fai Wong
BDL
318
192
0
10 Apr 2017
1