Bandit Structured Prediction for Neural Sequence-to-Sequence Learning

21 April 2017

Papers citing "Bandit Structured Prediction for Neural Sequence-to-Sequence Learning"

12 / 12 papers shown

Title
Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings Miguel Moura Ramos Tomás Almeida Daniel Vareta Filipe Azevedo Sweta Agrawal Patrick Fernandes André F. T. Martins 31 1 0 08 Nov 2024
State-of-the-art generalisation research in NLP: A taxonomy and review Dieuwke Hupkes Mario Giulianelli Verna Dankers Mikel Artetxe Yanai Elazar ... Leila Khalatbari Maria Ryskina Rita Frieske Ryan Cotterell Zhijing Jin 114 93 0 06 Oct 2022
Fixing exposure bias with imitation learning needs powerful oracles L. Hormann Artem Sokolov 26 3 0 09 Sep 2021
Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior Noriyuki Kojima Alane Suhr Yoav Artzi 25 24 0 10 Aug 2021
Revisiting the Weaknesses of Reinforcement Learning for Neural Machine Translation Samuel Kiegeland Julia Kreutzer AAML 31 46 0 16 Jun 2021
Interactive Learning from Activity Description Khanh Nguyen Dipendra Kumar Misra Robert Schapire Miroslav Dudík Patrick Shafto 47 34 0 13 Feb 2021
Machine Translation System Selection from Bandit Feedback Jason Naradowsky Xuan Zhang Kevin Duh OffRL 11 8 0 22 Feb 2020
APRIL: Interactively Learning to Summarise by Combining Active Preference Learning and Reinforcement Learning Yang Gao Christian M. Meyer Iryna Gurevych 13 34 0 29 Aug 2018
Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning Julia Kreutzer Joshua Uyheng Stefan Riezler 25 83 0 27 May 2018
Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback Khanh Nguyen Hal Daumé Jordan L. Boyd-Graber 27 135 0 24 Jul 2017
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation Yonghui Wu M. Schuster Z. Chen Quoc V. Le Mohammad Norouzi ... Alex Rudnick Oriol Vinyals G. Corrado Macduff Hughes J. Dean AIMat 716 6,743 0 26 Sep 2016
Effective Approaches to Attention-based Neural Machine Translation Thang Luong Hieu H. Pham Christopher D. Manning 218 7,926 0 17 Aug 2015