Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.07635
Cited By
DORB: Dynamically Optimizing Multiple Rewards with Bandits
15 November 2020
Ramakanth Pasunuru
Han Guo
Mohit Bansal
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DORB: Dynamically Optimizing Multiple Rewards with Bandits"
4 / 4 papers shown
Title
Why is constrained neural language generation particularly challenging?
Cristina Garbacea
Qiaozhu Mei
49
14
0
11 Jun 2022
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,724
0
26 Sep 2016
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
192
1,325
0
05 Jun 2016
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
214
7,687
0
17 Aug 2015
1