ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1601.04468
  4. Cited By
Bandit Structured Prediction for Learning from Partial Feedback in
  Statistical Machine Translation

Bandit Structured Prediction for Learning from Partial Feedback in Statistical Machine Translation

18 January 2016
Artem Sokolov
Stefan Riezler
Tanguy Urvoy
ArXiv (abs)PDFHTML

Papers citing "Bandit Structured Prediction for Learning from Partial Feedback in Statistical Machine Translation"

9 / 9 papers shown
Reinforcement learning
Reinforcement learning
Florentin Wörgötter
734
3,169
0
16 May 2024
AlpacaFarm: A Simulation Framework for Methods that Learn from Human
  Feedback
AlpacaFarm: A Simulation Framework for Methods that Learn from Human FeedbackNeural Information Processing Systems (NeurIPS), 2023
Yann Dubois
Xuechen Li
Rohan Taori
Tianyi Zhang
Ishaan Gulrajani
Jimmy Ba
Carlos Guestrin
Abigail Z. Jacobs
Tatsunori B. Hashimoto
ALM
654
831
0
22 May 2023
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation
  with Multi-Armed Bandits
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits
Julia Kreutzer
David Vilar
Artem Sokolov
267
18
0
13 Oct 2021
Survey on reinforcement learning for language processing
Survey on reinforcement learning for language processingArtificial Intelligence Review (AIR), 2021
Víctor Uc Cetina
Nicolás Navarro-Guerrero
A. Martín-González
C. Weber
S. Wermter
OffRL
376
141
0
12 Apr 2021
Warm-starting Contextual Bandits: Robustly Combining Supervised and
  Bandit Feedback
Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
Chicheng Zhang
Alekh Agarwal
Hal Daumé
John Langford
S. Negahban
432
44
0
02 Jan 2019
Preference-based Online Learning with Dueling Bandits: A Survey
Preference-based Online Learning with Dueling Bandits: A Survey
Viktor Bengs
R. Busa-Fekete
Adil El Mesaoudi-Paul
Eyke Hüllermeier
486
133
0
30 Jul 2018
A Shared Task on Bandit Learning for Machine Translation
A Shared Task on Bandit Learning for Machine Translation
Artem Sokolov
Julia Kreutzer
Kellen Sunderland
Pavel Danchenko
Witold Szymaniak
Hagen Fürstenau
Stefan Riezler
173
16
0
27 Jul 2017
Reinforcement Learning for Bandit Neural Machine Translation with
  Simulated Human Feedback
Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback
Khanh Nguyen
Hal Daumé
Jordan L. Boyd-Graber
444
146
0
24 Jul 2017
Stochastic Structured Prediction under Bandit Feedback
Stochastic Structured Prediction under Bandit FeedbackNeural Information Processing Systems (NeurIPS), 2016
Artem Sokolov
Julia Kreutzer
Christopher Lo
Stefan Riezler
150
31
0
02 Jun 2016
1
Page 1 of 1