ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.07541
  4. Cited By
Reward Optimization for Neural Machine Translation with Learned Metrics

Reward Optimization for Neural Machine Translation with Learned Metrics

15 April 2021
Raphael Shu
Kang Min Yoo
Jung-Woo Ha
ArXiv (abs)PDFHTMLGithub (25★)

Papers citing "Reward Optimization for Neural Machine Translation with Learned Metrics"

13 / 13 papers shown
Reward Models are Metrics in a Trench Coat
Reward Models are Metrics in a Trench Coat
Sebastian Gehrmann
188
0
0
03 Oct 2025
Adding Chocolate to Mint: Mitigating Metric Interference in Machine Translation
Adding Chocolate to Mint: Mitigating Metric Interference in Machine Translation
José P. Pombal
Nuno M. Guerreiro
Ricardo Rei
André F. T. Martins
451
5
0
11 Mar 2025
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model
Yueqin Yin
Shentao Yang
Yujia Xie
Ziyi Yang
Yuting Sun
Hany Awadalla
Weizhu Chen
Mingyuan Zhou
402
9
0
07 Jan 2025
LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable
  Objectives
LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives
Luísa Shimabucoro
Sebastian Ruder
Julia Kreutzer
Marzieh Fadaee
Sara Hooker
SyDa
426
6
0
01 Jul 2024
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
Shentao Yang
Tianqi Chen
Mingyuan Zhou
EGVM
482
49
0
13 Feb 2024
Learning Evaluation Models from Large Language Models for Sequence Generation
Learning Evaluation Models from Large Language Models for Sequence Generation
Chenglong Wang
Hang Zhou
Kai-Chun Chang
Tongran Liu
Chunliang Zhang
Quan Du
Tong Xiao
Yue Zhang
Jingbo Zhu
ELM
745
5
0
08 Aug 2023
ESRL: Efficient Sampling-based Reinforcement Learning for Sequence
  Generation
ESRL: Efficient Sampling-based Reinforcement Learning for Sequence GenerationAAAI Conference on Artificial Intelligence (AAAI), 2023
Chenglong Wang
Hang Zhou
Yimin Hu
Yi Huo
Bei Li
Tongran Liu
Tong Xiao
Jingbo Zhu
293
13
0
04 Aug 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Preference-grounded Token-level Guidance for Language Model Fine-tuningNeural Information Processing Systems (NeurIPS), 2023
Shentao Yang
Shujian Zhang
Congying Xia
Yihao Feng
Caiming Xiong
Mi Zhou
564
33
0
01 Jun 2023
GROOT: Corrective Reward Optimization for Generative Sequential Labeling
GROOT: Corrective Reward Optimization for Generative Sequential Labeling
Kazuma Hashimoto
K. Raman
VLM
462
1
0
29 Sep 2022
Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation
  Practices for Generated Text
Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation Practices for Generated TextJournal of Artificial Intelligence Research (JAIR), 2022
Sebastian Gehrmann
Elizabeth Clark
Thibault Sellam
ELMAI4CE
780
230
0
14 Feb 2022
Learning Compact Metrics for MT
Learning Compact Metrics for MTConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Amy Pu
Hyung Won Chung
Ankur P. Parikh
Sebastian Gehrmann
Thibault Sellam
276
113
0
12 Oct 2021
Doubly-Trained Adversarial Data Augmentation for Neural Machine
  Translation
Doubly-Trained Adversarial Data Augmentation for Neural Machine TranslationConference of the Association for Machine Translation in the Americas (AMTA), 2021
Weiting Tan
Shuoyang Ding
Huda Khayrallah
Philipp Koehn
SILMAAML
232
1
0
12 Oct 2021
Convergence Properties of Stochastic Hypergradients
Convergence Properties of Stochastic HypergradientsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Riccardo Grazzi
Massimiliano Pontil
Saverio Salzo
617
28
0
13 Nov 2020
1
Page 1 of 1