ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.10006
  4. Cited By
Correcting Length Bias in Neural Machine Translation
v1v2 (latest)

Correcting Length Bias in Neural Machine Translation

29 August 2018
Kenton W. Murray
David Chiang
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Correcting Length Bias in Neural Machine Translation"

50 / 113 papers shown
Interactive Text Generation
Interactive Text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Felix Faltings
Michel Galley
Baolin Peng
Kianté Brantley
Weixin Cai
Yizhe Zhang
Jianfeng Gao
Bill Dolan
313
0
0
02 Mar 2023
Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation
  in Natural Language Generation
Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language GenerationInternational Conference on Learning Representations (ICLR), 2023
Lorenz Kuhn
Y. Gal
Sebastian Farquhar
UQLM
589
474
0
19 Feb 2023
DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding
DC-MBR: Distributional Cooling for Minimum Bayesian Risk DecodingInternational Conference on Language Resources and Evaluation (LREC), 2022
Jianhao Yan
Jin Xu
Fandong Meng
Jie Zhou
Yue Zhang
340
4
0
08 Dec 2022
Monotonic segmental attention for automatic speech recognition
Monotonic segmental attention for automatic speech recognitionSpoken Language Technology Workshop (SLT), 2022
Albert Zeyer
Robin Schmitt
Wei Zhou
Ralf Schluter
Hermann Ney
125
11
0
26 Oct 2022
A Continuum of Generation Tasks for Investigating Length Bias and
  Degenerate Repetition
A Continuum of Generation Tasks for Investigating Length Bias and Degenerate RepetitionBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2022
Darcey Riley
David Chiang
192
6
0
19 Oct 2022
The Tail Wagging the Dog: Dataset Construction Biases of Social Bias
  Benchmarks
The Tail Wagging the Dog: Dataset Construction Biases of Social Bias BenchmarksAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Nikil Selvam
Sunipa Dev
Daniel Khashabi
Tushar Khot
Kai-Wei Chang
ALM
283
31
0
18 Oct 2022
CTC Alignments Improve Autoregressive Translation
CTC Alignments Improve Autoregressive TranslationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Brian Yan
Siddharth Dalmia
Yosuke Higuchi
Graham Neubig
Florian Metze
A. Black
Shinji Watanabe
181
36
0
11 Oct 2022
PEER: A Collaborative Language Model
PEER: A Collaborative Language ModelInternational Conference on Learning Representations (ICLR), 2022
Timo Schick
Jane Dwivedi-Yu
Zhengbao Jiang
Fabio Petroni
Patrick Lewis
Gautier Izacard
Qingfei You
Christoforos Nalmpantis
Edouard Grave
Sebastian Riedel
ALM
262
104
0
24 Aug 2022
Exploring Length Generalization in Large Language Models
Exploring Length Generalization in Large Language ModelsNeural Information Processing Systems (NeurIPS), 2022
Cem Anil
Yuhuai Wu
Anders Andreassen
Aitor Lewkowycz
Vedant Misra
V. Ramasesh
Ambrose Slone
Guy Gur-Ari
Ethan Dyer
Behnam Neyshabur
ReLMLRM
348
211
0
11 Jul 2022
So Different Yet So Alike! Constrained Unsupervised Text Style Transfer
So Different Yet So Alike! Constrained Unsupervised Text Style TransferAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Abhinav Ramesh Kashyap
Devamanyu Hazarika
Min-Yen Kan
Roger Zimmermann
Soujanya Poria
GAN
182
16
0
09 May 2022
Quality-Aware Decoding for Neural Machine Translation
Quality-Aware Decoding for Neural Machine TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Patrick Fernandes
António Farinhas
Ricardo Rei
José G. C. de Souza
Perez Ogayo
Graham Neubig
Marcely Zanon Boito
264
61
0
02 May 2022
Jam or Cream First? Modeling Ambiguity in Neural Machine Translation
  with SCONES
Jam or Cream First? Modeling Ambiguity in Neural Machine Translation with SCONESNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Felix Stahlberg
Shankar Kumar
UQLM
195
13
0
02 May 2022
The Implicit Length Bias of Label Smoothing on Beam Search Decoding
The Implicit Length Bias of Label Smoothing on Beam Search Decoding
Bowen Liang
Pidong Wang
Yuan Cao
208
1
0
02 May 2022
On the Role of Pre-trained Language Models in Word Ordering: A Case
  Study with BART
On the Role of Pre-trained Language Models in Word Ordering: A Case Study with BARTInternational Conference on Computational Linguistics (COLING), 2022
Zebin Ou
Meishan Zhang
Yue Zhang
129
3
0
15 Apr 2022
A Call for Clarity in Beam Search: How It Works and When It Stops
A Call for Clarity in Beam Search: How It Works and When It StopsInternational Conference on Language Resources and Evaluation (LREC), 2022
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Dragomir R. Radev
Yejin Choi
Noah A. Smith
283
10
0
11 Apr 2022
Uncertainty Determines the Adequacy of the Mode and the Tractability of
  Decoding in Sequence-to-Sequence Models
Uncertainty Determines the Adequacy of the Mode and the Tractability of Decoding in Sequence-to-Sequence ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Felix Stahlberg
Ilia Kulikov
Shankar Kumar
UQLM
246
12
0
01 Apr 2022
On Decoding Strategies for Neural Text Generators
On Decoding Strategies for Neural Text GeneratorsTransactions of the Association for Computational Linguistics (TACL), 2022
Gian Wiher
Clara Meister
Robert Bamler
260
88
0
29 Mar 2022
Sequence-to-Sequence Knowledge Graph Completion and Question Answering
Sequence-to-Sequence Knowledge Graph Completion and Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Apoorv Saxena
Adrian Kochsiek
Rainer Gemulla
AIMat
288
162
0
19 Mar 2022
Pretrained Language Models for Text Generation: A Survey
Pretrained Language Models for Text Generation: A SurveyACM Computing Surveys (ACM CSUR), 2022
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
AI4CE
518
251
0
14 Jan 2022
Does Entity Abstraction Help Generative Transformers Reason?
Does Entity Abstraction Help Generative Transformers Reason?
Nicolas Angelard-Gontier
Siva Reddy
C. Pal
234
6
0
05 Jan 2022
Characterizing and addressing the issue of oversmoothing in neural
  autoregressive sequence modeling
Characterizing and addressing the issue of oversmoothing in neural autoregressive sequence modeling
Ilia Kulikov
M. Eremeev
Dong Wang
228
9
0
16 Dec 2021
A Plug-and-Play Method for Controlled Text Generation
A Plug-and-Play Method for Controlled Text Generation
Damian Pascual
Béni Egressy
Clara Meister
Robert Bamler
Roger Wattenhofer
265
101
0
20 Sep 2021
Multi-Sentence Resampling: A Simple Approach to Alleviate Dataset Length
  Bias and Beam-Search Degradation
Multi-Sentence Resampling: A Simple Approach to Alleviate Dataset Length Bias and Beam-Search Degradation
Ivan Provilkov
A. Malinin
126
4
0
13 Sep 2021
Distilling the Knowledge of Large-scale Generative Models into Retrieval
  Models for Efficient Open-domain Conversation
Distilling the Knowledge of Large-scale Generative Models into Retrieval Models for Efficient Open-domain ConversationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Beomsu Kim
Seokjun Seo
Seungju Han
Enkhbayar Erdenee
Buru Chang
RALM
207
6
0
28 Aug 2021
Sampling-Based Approximations to Minimum Bayes Risk Decoding for Neural
  Machine Translation
Sampling-Based Approximations to Minimum Bayes Risk Decoding for Neural Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Bryan Eikema
Wilker Aziz
206
59
0
10 Aug 2021
What Do You Get When You Cross Beam Search with Nucleus Sampling?
What Do You Get When You Cross Beam Search with Nucleus Sampling?First Workshop on Insights from Negative Results in NLP (Insights), 2021
Uri Shaham
Omer Levy
246
11
0
20 Jul 2021
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented RecordingInterspeech (Interspeech), 2021
Hirofumi Inaguma
Tatsuya Kawahara
187
2
0
15 Jul 2021
StableEmit: Selection Probability Discount for Reducing Emission Latency
  of Streaming Monotonic Attention ASR
StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR
Hirofumi Inaguma
Tatsuya Kawahara
165
4
0
01 Jul 2021
Digging Errors in NMT: Evaluating and Understanding Model Errors from
  Partial Hypothesis Space
Digging Errors in NMT: Evaluating and Understanding Model Errors from Partial Hypothesis SpaceConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Jianhao Yan
Chenming Wu
Fandong Meng
Jie Zhou
ELMLRM
143
2
0
29 Jun 2021
Automatic Document Sketching: Generating Drafts from Analogous Texts
Automatic Document Sketching: Generating Drafts from Analogous TextsFindings (Findings), 2021
Zeqiu Wu
Michel Galley
Chris Brockett
Yizhe Zhang
Bill Dolan
196
6
0
14 Jun 2021
Mode recovery in neural autoregressive sequence modeling
Mode recovery in neural autoregressive sequence modeling
Ilia Kulikov
Sean Welleck
Kyunghyun Cho
127
2
0
10 Jun 2021
Language Model Evaluation Beyond Perplexity
Language Model Evaluation Beyond PerplexityAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Clara Meister
Robert Bamler
440
99
0
31 May 2021
Searchable Hidden Intermediates for End-to-End Models of Decomposable
  Sequence Tasks
Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence TasksNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Siddharth Dalmia
Brian Yan
Vikas Raunak
Florian Metze
Shinji Watanabe
179
35
0
02 May 2021
Constrained Language Models Yield Few-Shot Semantic Parsers
Constrained Language Models Yield Few-Shot Semantic ParsersConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Richard Shin
C. H. Lin
Sam Thomson
Charles C. Chen
Subhro Roy
Emmanouil Antonios Platanios
Adam Pauls
Dan Klein
J. Eisner
Benjamin Van Durme
623
221
0
18 Apr 2021
Smoothing and Shrinking the Sparse Seq2Seq Search Space
Smoothing and Shrinking the Sparse Seq2Seq Search SpaceNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Ben Peters
André F. T. Martins
282
17
0
18 Mar 2021
Searching for Search Errors in Neural Morphological Inflection
Searching for Search Errors in Neural Morphological InflectionConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Martina Forster
Clara Meister
Robert Bamler
164
5
0
16 Feb 2021
Incremental Beam Manipulation for Natural Language Generation
Incremental Beam Manipulation for Natural Language GenerationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
J. Hargreaves
Andreas Vlachos
Guy Edward Toh Emerson
191
7
0
04 Feb 2021
Why Neural Machine Translation Prefers Empty Outputs
Why Neural Machine Translation Prefers Empty Outputs
Xing Shi
Yijun Xiao
Kevin Knight
AAML
131
9
0
24 Dec 2020
How Can We Know When Language Models Know? On the Calibration of
  Language Models for Question Answering
How Can We Know When Language Models Know? On the Calibration of Language Models for Question AnsweringTransactions of the Association for Computational Linguistics (TACL), 2020
Zhengbao Jiang
Jun Araki
Haibo Ding
Graham Neubig
UQCV
404
509
0
02 Dec 2020
The EOS Decision and Length Extrapolation
The EOS Decision and Length Extrapolation
Benjamin Newman
John Hewitt
Abigail Z. Jacobs
Christopher D. Manning
220
54
0
14 Oct 2020
On Long-Tailed Phenomena in Neural Machine Translation
On Long-Tailed Phenomena in Neural Machine TranslationFindings (Findings), 2020
Vikas Raunak
Siddharth Dalmia
Vivek Gupta
Florian Metze
145
30
0
10 Oct 2020
If beam search is the answer, what was the question?
If beam search is the answer, what was the question?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Clara Meister
Tim Vieira
Robert Bamler
369
153
0
06 Oct 2020
Task-Oriented Dialogue as Dataflow Synthesis
Task-Oriented Dialogue as Dataflow SynthesisTransactions of the Association for Computational Linguistics (TACL), 2020
Semantic Machines
Jacob Andreas
J. Bufe
David Burkett
Charles C. Chen
...
Izabela Witoszko
Jason Wolfe
A. Wray
Yuchen Zhang
Alexander Zotov
AIFin
539
170
0
24 Sep 2020
Text Generation by Learning from Demonstrations
Text Generation by Learning from Demonstrations
Richard Yuanzhe Pang
He He
OffRL
243
87
0
16 Sep 2020
Neural Language Generation: Formulation, Methods, and Evaluation
Neural Language Generation: Formulation, Methods, and Evaluation
Cristina Garbacea
Qiaozhu Mei
353
29
0
31 Jul 2020
Best-First Beam Search
Best-First Beam SearchTransactions of the Association for Computational Linguistics (TACL), 2020
Clara Meister
Tim Vieira
Robert Bamler
394
80
0
08 Jul 2020
MLE-guided parameter search for task loss minimization in neural
  sequence modeling
MLE-guided parameter search for task loss minimization in neural sequence modeling
Sean Welleck
Dong Wang
193
9
0
04 Jun 2020
Is MAP Decoding All You Need? The Inadequacy of the Mode in Neural
  Machine Translation
Is MAP Decoding All You Need? The Inadequacy of the Mode in Neural Machine Translation
Bryan Eikema
Wilker Aziz
272
152
0
20 May 2020
Early Stage LM Integration Using Local and Global Log-Linear Combination
Early Stage LM Integration Using Local and Global Log-Linear Combination
Wilfried Michel
Ralf Schluter
Hermann Ney
153
12
0
20 May 2020
On Exposure Bias, Hallucination and Domain Shift in Neural Machine
  Translation
On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation
Chaojun Wang
Rico Sennrich
226
181
0
07 May 2020
Previous
123
Next