Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.00456
Cited By
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
30 June 2019
Natasha Jaques
Asma Ghandeharioun
J. Shen
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog"
2 / 102 papers shown
Title
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
220
1,328
0
05 Jun 2016
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal
Zoubin Ghahramani
UQCV
BDL
287
9,156
0
06 Jun 2015
Previous
1
2
3